Data Engineering – Data engineering solution designing in Big data ecosystem & product development for global businesses while delivering tangible business results reflecting multiple data auditability, controls in data management lifecycle
Building a Databricks cloud optimized solution for providing enterprise data solution via implementing data management Strategy. Shell Scripting, Python, Databricks.
Provided Free lancing and trainings in Big data -Hadoop, Hive and also delivered product using Databricks,PySpark,Azure and AWS
Octopus -Task Orchestration and ETL tool- Build an in-house tool as part of replacement of Informatica in order to save the cost of Informatica. Myself alone build the entire tool for ETL processing and also providing the task orchestration feature. The tool was build using python - pandas framework and Oracle/PLSQL. Similar to Airflow, Luigi..