
Since the data was inconsistant, I had to do some manual cleansing to consolidate all the schools.After looking at the data, I realized it was unstructured and had categories within fields.Since the data was in pdfs, I had to convert the file to csv using an online converter and downloading the csv file.Structuring the data was the difficult and time consuming part. Generating scripts for conversion was pretty simple.Most data that had been reported by bootcamps was in pdf forms online reported by CIRR.So being that it is fairly new and unstructured it was hard to find consistent data.

Reporting data for technical bootcamps has only been reported since 16.XML Converter Script (scripts/convert_to_xml.py) JSON Converter Script(scripts/convert_to_json.py) Tools Usedĭata Source - csv clean file (data/bootcamp_data.csv)ĭata Output - json formatted file (data/bootcamp_data.json)ĭata Output - xml formatted file (data/bootcamp_data.xml)ĭata Dictionary - xlxs spreadsheet(data_dictionary.xlxs) Scripts The purpose of this project was to read a csv formatted file that contained bootcamp graduate data and create file conversions to json and xml formats. 11/15/17 Natalie Ramirez Project Description

Converts a csv file to json and xml file formats.
