Aditya Thaker - Systems & AI Engineer

I like knowing how things work under the hood. Most of my time goes into writing eBPF programs that hook into the kernel, CUDA code for GPU workloads, and building AI agents that do more than just chat.

I did my Master's at CU Boulder (3.95 GPA), where I spent time on WebRTC infrastructure and kernel-level packet processing. Before that I was doing ML research at IIT Patna and building full-stack tools at SimPPL and the University of Mumbai.

Right now at HeyNoah, I'm writing LangGraph agents with tool-use, memory, and human-in-the-loop checkpoints for enterprise workflows.

Education

M.S. Computer Science, CU Boulder

GPA: 3.95 / 4.00

Focus Areas

Systems Programming, AI Agents, Distributed Computing

Building AI agents with LangGraph and LangChain that handle multi-step tasks end-to-end, with tool-use, RAG, and structured output parsing.
Writing the agent state management layer: persistent conversation graphs, conditional branching, retry logic, and human-in-the-loop checkpoints.

LangGraph LangChain RAG AI Agents

Built a WebRTC SFU in Go for real-time multi-user video chat, with ICE negotiation, RTP/RTCP handling, and dynamic track renegotiation.
Wrote eBPF/XDP programs in C for kernel-bypass packet processing of RTP streams, improving latency and throughput by 5%+.
Added observability by logging peer metadata via Cilium eBPF maps and tracing RTP, STUN, and QUIC flows at the kernel level.

Go eBPF/XDP C WebRTC Cilium

Ran CU Boulder's Dynamic Programming and Greedy Algorithms course on Coursera - 120+ students, office hours, grade verification, and content updates.
TA'd the Intro Computing course, teaching 50 students C++ fundamentals, memory management, OOP, and debugging.

Coursera C++ Teaching

Optimized analytics visualizations using virtual DOM updates and Canvas rendering, reducing load times by 25%.
Built an analytics dashboard using d3.js and plotly to track user engagement with news articles, with real-time trend analysis.
Improved the REST API with Django and optimized PostgreSQL schemas, cutting response times by 28%.

Django d3.js PostgreSQL REST API

Built a MERN web app to automate exam duty assignments for 100+ examiners across 8 departments, using Redis for caching before persisting to MongoDB.
Set up Stripe payments, automated email notifications, and handled deployment, cutting manual admin work significantly.

MERN Redis MongoDB Stripe

Built a GNN-based phishing detector for Ethereum transactions in PyTorch, hitting 98% F-score. Used ego-graph representations to encode temporal transaction patterns, which cut training time by 20%.

PyTorch GNNs Ethereum ML Research

M.S. Computer Science

University of Colorado Boulder - Boulder, CO

Aug 2023 - May 2025 GPA: 3.95 / 4.00

Design & Analysis of Algorithms, Compiler Construction, Distributed Systems, NLP, Linux System Administration, Data Center Scale Computing, Modern Computing Systems

B.E. Computer Engineering

University of Mumbai - Mumbai, India

Aug 2019 - May 2023 GPA: 3.78 / 4.00

Data Structures, Operating Systems, Database Management Systems, Computer Networks, Machine Learning, Distributed Computing, Information Security, Software Engineering, Discrete Mathematics

Systems / Compilers

eBPF Compiler Optimization

Optimized eBPF bytecode at the LLVM level for JIT vectorization, cutting memset instruction count by ~50% for >300B structs. Benchmarked Tracee, Tetragon, and Sysdig with lmbench/perf to measure CPU cycle overhead.

C LLVM eBPF

~50% memset reduction

GPU Computing

CUDA Vector Operations

Custom CUDA kernels for batched vector add/mul/scale, 10x faster than single-threaded CPU on 1M+ element arrays. Used coalesced memory access, pinned host memory, and async streams to maximize throughput.

CUDA C++ GPU

10x speedup over CPU

Compilers / Databases

Python-to-SQL/NoSQL Compiler

Compiler that takes Python expressions, parses them through a custom AST, and spits out SQL or MongoDB queries. Maps Pandas DataFrames to relational tables with schema inference. 10x faster than doing it through an ORM.

Python Compiler Design SQL MongoDB

10x faster than ORM

HPC / Distributed

Hybrid Parallel Sorting

MPI + OpenMP sorting benchmark in C++. Partitions data across nodes with MPI, then quicksorts locally with OpenMP threads. Tested scaling from 1 to 16 ranks to see where communication overhead starts to dominate.

C++ MPI OpenMP

AI / Security

Ethereum Phishing Detection

GNN-based phishing detector for Ethereum in PyTorch. Converts transaction histories into temporal ego-graphs, trains a classifier on them, and hits 98% F-score. The ego-graph trick also sped up training by 20%.

PyTorch GNNs Blockchain

98% F-score

NLP / ML

Multimodal Emotion Cause Analysis

Trained 3 different transformer setups on text+audio+video data to figure out what causes emotional responses. Ran ablation studies across modalities, got to 85% F-score. Split model components across cloud instances with async message-passing for 20% better throughput.

Python PyTorch Transformers NLP

85% F-score

C/C++ Go CUDA eBPF/XDP LLVM OpenMP MPI pthreads

Python JavaScript Java PostgreSQL MongoDB Redis DynamoDB

GCP AWS Docker Kubernetes Ansible Spark Kafka Hadoop CI/CD

PyTorch TensorFlow LangChain LangGraph scikit-learn pandas NumPy

Interested in systems programming, AI agents, or just want to chat? Feel free to reach out.

Say Hello

GitHub LinkedIn Email

Aditya Thaker.

About

Education

Focus Areas

Experience

Engineer

Graduate Researcher

Graduate Teaching Assistant

Fullstack Web Developer

Backend Web Developer

Machine Learning Research Intern

Education

M.S. Computer Science

B.E. Computer Engineering

Projects

eBPF Compiler Optimization

CUDA Vector Operations

Python-to-SQL/NoSQL Compiler

Hybrid Parallel Sorting

Ethereum Phishing Detection

Multimodal Emotion Cause Analysis

Skills

Systems & Low-Level

Languages & Databases

Infrastructure & Cloud

AI / ML

Contact