I have extensive data engineering and machine learning experience and worked for some of world's largest corporations. I will provide best quality work on time.
Below is my technical summary:
SPECIALTIES
• Data Solution Architect
• Big Data Architect
• Cloud Architect (AWS, Azure)
• Data Governance
• Data Modelling
• Data Science (Machine Learning and Statistical Modelling) & Data Engineering
• Data Visualisation
• Data Conversion and Integration
• Master Data Management
• DevOps and CI/CD
TECHNICAL SKILLS
• Big Data (Hortonworks and Cloudera) – Spark(PySpark, Scala), Kafka, Hive,Impala, NiFi, HDFS, Sqoop, Ranger, Yarn, Solr, SAM, Schema Registry, SuperSet
• Language: Python, Scala, R, JavaScript
• Data Visualization – Tableau, PowerBI, OBIEE, DOMO
• Plunk & ELK Stack – ElasticSearch, Logstash, Filebeat, Kibana
• AWS – S3, RefShift, DynamoDB, Athena, Kinesis, EMR, Aurora, Glue
• Azure – Data Warehouse, Polybase, SQL Server, HDInsight, SSIS, SSAS
• ETL – Informatica Power Centre, Informatica Big Data Management (BDM), SSIS
• Data Science & Engineering – R / RStudio / SparkR Packages: dplyr, ggplot2, stringr, plyr, carrat, SparkR, NLP, tibble, TensorFlow, curl, Python / PySpark, MLLib
• Libraries: MLLib, NumPy, SciPy, Pandas, Matplotlib, Seaborn, SciKit-Learn
• DBMS / OLAP – Oracle, SQL Server, TeraData, MySQL, Postgres, Essbase, SSAS
• Data Modelling – Kimball, Vault
• Machine Learning / Statistical Modelling - Linear Regression, ?Logistic Regression, Classification and Regr