Skip to content
View KayvanShah1's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report KayvanShah1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
KayvanShah1/README.md

Header

Data & ML Engineer building reliable data and ML systems

I design pipelines and models that stay correct under late data, scale, and real-world failure.

Currently working on audit analytics, agentic backends, and production forecasting pipelines.

linkedin github gitlab kaggle stackoverflow

What I’m Known For

  • Designing ingestion and modeling systems for messy, high-volume event data - Production ML and LLM workflows with evaluation, monitoring, and deployment hygiene - Resilient integrations handling rate limits, backfills, schema drift, and retries

Now

  • Building a Google Workspace audit analytics pipeline with overlap-safe ingestion - Developing agentic backend workflows using LLMs - Writing about real-world data failures and system design tradeoffs

Pinned repositories below reflect the work above.

Stack

python numpy pandas plotly folium scikit-learn tensorflow opencv pytorch fastapi selenium javascript mysql postgresql mongodb elasticsearch kibana git docker kubernetes terraform github-actions google-cloud-platform amazon-aws

GitHub Activity

GitHub Streak GitHub Stats

languages

Trophies

Open To

Open to Data Engineering, MLOps, and Platform roles. Best reached via LinkedIn or email.


This README is generated every 24 hours!
Last refresh: 01:48:46 GMT+0000 (Coordinated Universal Time)

Pinned Loading

  1. shbyun080/OneNet shbyun080/OneNet Public

    Official Implementation of OneNet

    Python 22 2

  2. ksanu1998/static_analysis_codegen_llms ksanu1998/static_analysis_codegen_llms Public

    This repository contains code base for project titled Leveraging static analysis for evaluating code-generation models developed during the CSCI 544 Applied Natural Language Processing course, Fall…

    HTML 5 2

  3. VirtuTA VirtuTA Public

    VirtuTA is an AI teaching assistant that delivers quick, accurate responses to student queries directly on Piazza. Powered by agentic workflows, Google Gemini, and Langchain, it automates both conc…

    Jupyter Notebook 10 4

  4. gws-audit-analytics-pipeline gws-audit-analytics-pipeline Public

    A robust data pipeline to fetch, process, and analyze token activity events from the Google Workspace Admin Reports API. This project ensures no data loss across multiple runs and provides insight …

    Python 1

  5. DataForgeOpenAIHub/Steam-Sales-Analysis DataForgeOpenAIHub/Steam-Sales-Analysis Public

    This repository features an ETL pipeline for retrieving, processing, validating, and ingesting game metadata and sales data from SteamSpy and Steam APIs. Data is stored in a MySQL database on Aiven…

    Jupyter Notebook 7

  6. taskaza taskaza Public

    Taskaza – a conversational ticket/task management AI agent

    TypeScript 2 1