Data Engineer Intern at Teal India in Bengaluru

Website Teal India

About the job

As a data engineer at TEAL, you’ll be taking the plunge into a rich data lake that includes everything from complex geospatial data to legal court orders to transactions data. You’ll be hustling and getting your hands dirty with every part of the data pipeline always having an implicit appreciation for how all of this data will ultimately power a revolutionary real estate risk platform.

Your day-to-day will largely include

  • Writing complex regular expressions and other parsers to extract usable data from messy PDF, HTML, JSON and other files that range in the millions to tens of millions.
  • Working with NLP tools, machine translators and language specialists to efficiently translate documents from languages ranging from Persian to Tamil.
  • Working with data annotators and QC experts to ensure data quality is at its highest.
  • Implementing methods to improve data reliability and quality.

About you

We are looking for someone who

  • Is proficient and has demonstrated experience using Python.
  • Has worked with large (millions to hundreds of millions of rows in a SQL database) interdisciplinary datasets.
  • Is patient and methodical with unstructured and messy data.
  • Is always hungry to learn newer and better technologies to make the data ecosystem faster, smoother and less silo-ed.

To apply for this job please visit careers.tealindia.in.