๐ŸŒ AIๆœ็ดข & ไปฃ็† ไธป้กต
Skip to content
View AswiN-7's full-sized avatar

Block or report AswiN-7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
AswiN-7/README.md

Hi, I'm Aswin ๐Ÿš€

Data Engineer | AWS | PySpark | Real-time Pipelines | Cloud Automation | GenAI Enthusiast


๐Ÿš€ About Me

I'm a Data Engineer with over 3 years of experience in building and optimizing cloud-native, scalable data pipelines using modern data stacks. My journey has focused on crafting high-performance ETL/ELT workflows, automating cloud infrastructure, and enabling real-time data access that supports analytics and ML use cases.

I have hands-on experience designing robust systems using:

  • AWS Services like Glue, Lambda, Step Functions, DMS, Kinesis, S3, Redshift, CloudWatch
  • Big Data technologies like Apache Kafka, PySpark, Delta Lake, Airflow, and Ab Initio
  • Languages & Frameworks: Python, SQL, Flask, Shell, LangChain, PyTorch

I am also passionate about AI/ML and LLMs, and actively explore ways to integrate them into data engineering workflows.


๐Ÿ“Š Recent Highlights

  • ๐Ÿ† 3rd Place Winner โ€“ Barclays GenAI Hackathon (Regional Level)
  • โš™๏ธ Built a real-time data streaming pipeline using Kafka, Python, and AWS S3
  • โœจ Contributed to DaFE (Data Forge Engine), a cloud-native, low-code processing platform
  • โœ… Automated AWS DMS, EC2 cost-optimization workflows, and CI/CD config pipelines

๐Ÿ› ๏ธ Tech Stack

Languages:       Python, SQL, Java, Shell
Cloud & DevOps:  AWS (Glue, Lambda, S3, DMS, DynamoDB, Athena, Step Functions, CloudWatch), Jenkins, GitLab, Docker
Data Engineering: PySpark, Airflow, Kafka, Ab Initio, Delta Lake, ETL/ELT, Streaming, Data Governance
Storage:         PostgreSQL, MongoDB
AI/ML Tools:     PyTorch, LangChain, Hugging Face, LLM, NLP

๐Ÿ’ผ Notable Projects

โœจ Real-Time Data Streaming Pipeline

  • Built a real-time ingestion pipeline with Apache Kafka, Python, and AWS S3
  • Automated metadata detection using Glue Crawlers + Athena for serverless querying

๐ŸŒ Tamil QA RAG System

  • Developed an open-domain retrieval-augmented generation (RAG) model for Tamil using fine-tuned Roberta + XLM
  • Dense vector indexing with Milvus and deployed APIs with Flask

๐Ÿ“† Education

Bachelor of Engineering (Computer Science)
SSN College of Engineering โ€“ Chennai, India (2018โ€“2022)
CGPA: 7.79 / 10


๐Ÿ‘ฅ Let's Connect


๐Ÿ“ˆ GitHub Stats


๐ŸŒŸ Featured Badges


"Data isn't just numbersโ€”it's a story waiting to be understood. Let's build systems that tell it better."

Pinned Loading

  1. tamilNLP tamilNLP Public

    Python 2

  2. School-Auth School-Auth Public

    Forked from mohanram123/School-Auth

    An authentication system for school management system using nodejs and mysql

    JavaScript

  3. blog-creating-system blog-creating-system Public

    Java

  4. CountryDatabaseManagement CountryDatabaseManagement Public

    Java

  5. mess-management-system mess-management-system Public

    C

  6. tnau-gpa-calculator tnau-gpa-calculator Public

    Subject Information, Credit scores are taken from TNAU website

    JavaScript 1