We structure an algorithm that uses a group of document recognition libraries in order to extract information from both “estatutos tributarios municipales” and “presupuesto generales de la nación” that were provided in pdf format. We find that the tools we use are almost enough to capture the full information from these documents with few mistakes, so it can help people to standardize this information.