Data Engineer building scalable data pipelines, cloud data platforms, and analytics systems on AWS, Snowflake, and Databricks.
5+ Years Experience | Data Engineer | Snowflake • AWS • Databricks • Spark • ETL/ELT • Data Modeling • Airflow
Let’s TalkActively Seeking Full-Time Opportunities
I design and build scalable ETL/ELT pipelines to ingest, process, and transform large-scale data from multiple sources into analytics-ready datasets.
I build cloud-native data platforms using Snowflake and AWS, enabling efficient storage, transformation, and analytics of large datasets.
I process large-scale structured and semi-structured data using distributed computing frameworks to support analytics and reporting.
Designed and implemented a scalable metadata-driven ELT pipeline using Snowflake, Amazon S3, and GitHub Actions. Built a reusable stored procedure to dynamically generate SQL based on configuration tables, enabling onboarding of new pipelines without code changes. Implemented Bronze-to-Silver transformations, audit logging for pipeline observability, and CI/CD automation for deployment and execution.
Technologies: Snowflake, SQL (Snowflake Scripting), Amazon S3, GitHub Actions, Snowflake CLI, Data Engineering
Designed and implemented a production-style data warehouse using Snowflake following the Medallion Architecture (Bronze, Silver, Gold). Built secure ingestion from Amazon S3, performed data profiling and transformation, and developed a star schema with fact and dimension tables along with business-ready marts for analytics.
Technologies: Snowflake, SQL, Amazon S3, Data Warehousing, Medallion Architecture
Automated the setup of an Amazon EMR cluster for big data processing using Apache Spark and Hadoop. Used AWS CLI and Python (Boto3) to provision resources, integrate with S3, and optimize for cost and scalability.
Technologies: AWS EMR, Apache Spark, Hadoop, Python (Boto3), AWS CLI, S3
Automates the deployment of Amazon EKS clusters with self-managed node groups using Terraform. Enables scalable, customizable Kubernetes infrastructure with full control over worker nodes.
Technologies: Terraform, AWS EKS, Kubernetes, IAM
A responsive, modern portfolio website showcasing my experience, projects, certifications, and contact information. Built with a focus on clean UI, accessibility, and performance, and deployed with CI/CD on Netlify.
Technologies: HTML, CSS, JavaScript, Netlify, Formspree
Certified Kubernetes Application Developer
SnowPro® Associate: Platform
Academy Accreditation - Databricks Fundamentals
AWS Certified Machine Learning - Specialty
GCP Professional Data Engineer
HashiCorp Certified: Terraform Associate (003)
AWS Certified Solutions Architect - Associate
AWS Certified Developer – Associate
AWS Certified Cloud Practitioner
AWS Certified AI Practitioner - Beta
AWS Cloud Quest: Cloud Practitioner
Microsoft Certified: Security, Compliance, and Identity Fundamentals
GitHub Foundations
TigerGraph for ML
TigerGraph Associate
Apache Cassandra 3 Developer Certification
Apache Cassandra 3 Administrator Certification
Company: Technohaul LLC, Texas, USA
Technologies: Snowflake, AWS (S3, EMR), PySpark, SQL, Apache Airflow, Databricks, Python
Company: Relinetek LLC, Texas, USA
Technologies: Snowflake, AWS S3, SQL, Python, Data Modeling
University: Kent State University, Kent, OH
Focused on distributed systems, data engineering, databases, and cloud computing. Built strong foundations in scalable system design, data processing, and backend engineering.
Company: Accenture, Hyderabad, India
Technologies: Snowflake, AWS S3, SQL, Airflow, Python, Spark, Data Modeling
Company: Accenture, Bengaluru, India
Technologies: SQL, AWS, Data Pipelines, ETL
University: Jawaharlal Nehru Technological University, Kakinada
Built strong foundations in data structures, databases, operating systems, and computer networks, enabling transition into data engineering and cloud systems.
I’m Ganasai Palakurthi, a Data Engineer with experience building scalable data pipelines, cloud data platforms, and analytics-ready data solutions using Snowflake, AWS, Spark, and Databricks.
My experience includes data ingestion, ETL/ELT development, data transformation, and designing data architectures that support large-scale structured and semi-structured datasets. I have worked on building reliable pipelines, optimizing data processing performance, and delivering curated datasets for reporting and analytics.
I focus on developing efficient data workflows, improving data quality, and building scalable systems that support business intelligence and data-driven decision-making. I am also committed to continuous learning and hold multiple industry certifications in cloud, data engineering, and platform technologies.