• Hi!
    I'm Sofia

    A Software Engineer by training and
    a Data Scientist in the making
    interested in working on challenging problems around data...

    Download Resume

About

Who Am I?

My name is Sofia Dutta, a Software Engineer by training and a Data Scientist in the making, I currently work for NewWave Telecom & Technologies, Inc. as a Data Scientist Intern. At NewWave, I am part of a team working on Machine Learning problems in their Medicaid Data Quality Assistant (MDQA) project from the healthcare domain.

I have worked on several Data Science, Machine Learning, and Deep Learning projects during my internship at NewWave and graduate studies at UMBC.

Previously, I used to work as a Software Developer and Data Analyst for Tata Consultancy Services from 2010 to 2018.

Python Programming

Deep Learning

Machine Learning

Master's in Data Science with a 4.0 GPA!

Hire me
Experience

Work Experience

Data Scientist Intern Present - 2020

Data Scientist Intern at NewWave Telecom and Technologies, Inc., Woodlawn, MD, USA

  • Built a system for computation of data quality analysis and exploration in a record time of a month.
  • Trained a machine learning model using 3000 rules for quality computation metrics.
  • Improved computation speed 10-fold by deploying analysis workflow in Google Cloud Platform (GCP) clusters and using Apache Spark for quality metrics computations.
  • Exported results to Looker for creating intuitive dashboards visualizations.
  • Performed data quality analysis on half a million healthcare records from Centers for Medicare & Medicaid.

Semantic Web Researcher 2020 - 2019

Graduate Student Researcher in Semantic Web and Smart Home Access Control, Ebiquity Group, UMBC, USA

  • Authored an Ontology for Smart Home Access Control by extending earlier research in Semantic Web.
  • Developed an Android app for handling context-sensitive access control in a Smart Home Environment.
  • Created YouTube videos for presentation to the National Institute of Standards and Technology.
  • Published a paper at the IEEE Big Data Security 2020 conference.

Software Developer and Data Analyst 2018 - 2010

Software Developer and Data Analyst at Tata Consultancy Services (TCS) Ltd., India

  • Led the design, development, and delivery management of seven projects for clients of TCS.
  • Created API interfaces using PL/SQL stored procedures for daily usage for clients of TCS.
  • Led meetings to capture requirements from DHL UK, Staples USA, Hyatt USA, Kaiser-Permanente USA.
  • Carried out change based regression analysis and documented software functional specifications.
  • Prepared test plans and executed system integration testing and user-acceptance testing.
  • Ensured client systems were up in four hours after migration activities saving millions of dollars in potential revenue lost.
  • Implemented scripts for data migration of a billion records while adhering to strict time SLA bounds.
  • Completed client data migration from legacy Oracle Apps (11i) to Oracle ERP Suite (R12).
  • From 2013 - 2018, managed continuous integration and continuous deployment in production environments.
  • Got certified in seven Oracle Apps competencies while working for client projects.
My Specialty

My Skills

I am a Data Scientist with experience in Data Analysis, Machine Learning and Software Development.

Python

95%

Java

80%

PyTorch

90%

Sci-kit Learn

85%

Keras

80%

Tensorflow

80%

MLlib

80%

PySpark

90%

TSQL, Oracle SQL, PL/SQL

95%

Oracle Apps, Oracle Fusion

90%

Google Cloud

80%

AWS S3

80%

LookML, Looker

85%

Knowledge Graph, OWL, SPARQL

75%
Education

Education

2020 - 2019
Working on my Master's in Data Science from University of Maryland, Baltimore County, Baltimore, USA. GPA: 4.0!
See transcript here.

2010 - 2006
Completed my Bachelor of Technology in Computer Science from West Bengal University of Technology, Kolkata, India with a 3.5 GPA!
See transcript here.

Data Science Projects
Client Projects
Software Certifications
Certificates & Awards
Publications

Publications

Android demos

Demos for research

Smart home controller app with rules demo