About Me
Greetings! My name is Anuja Uppuluri. I was born and raised in Austin, Texas and currently live in Pittsburgh. I am a computer scientist and AI & machine learning researcher among other things.
I'm an undergrad at Carnegie Mellon University studying Computer Information Systems, Artifical Intelligence, and Discrete Math. I graduate in May 2025 🎓📜.
I am passionate about LLM post-training research, interpretability, and AI safety– I founded and lead the Carnegie Mellon AI Safety & Alignment Initiative (300 members strong!). I believe AGI will benefit all of humanity– my goal is to help get there and make sure that path doesn't introduce catastrophic risks.
I am in an acapella group called Counterpoint– I love singing (in a group). I also enjoy painting, meditating, listening to & making music, writing, playing chess & Clash Royale, films & shows (Severance right now!), reading, and most other enjoyable hobbies people partake in.
I would love to connect with you! Click your preferred form of communication below to add me.
⊹ ࣪ ﹏𓊝﹏🫧⋆。˚﹏⊹ ࣪ ˖
Research
AidanBench | Co-First Author
Accepted to NeurIPS Language Gamification 2024This is a novel benchmark that is used for evaluating sustained, open ended generation in large language models / LLMs.
It uses open ended question prompts to assess a model's coherence, creativity, contextual attention, and instruction following through embedding based dissimilarity metrics.
We performed comparative analyses across SOTA models and could demonstrate that AidanBench is strongly correlated with model size and moderately correlated with LMSYS.
This is a non saturating benchmark / has no score ceiling and it aligns better with real-world open-ended use cases.
Creating a Cooperative AI Policymaking Platform | Co-First Author
with Humanity UnleashedLeading research on frameworks that systematically identify and quantify human values across diverse populations, using Bayesian modeling to inform AI driven policy development.
Developing methodologies to capture stakeholder preferences that represent various demographic groups to get AI recommendations to reflect by domain a comprehensive range of societal perspectives.
Translating elicited values into actionable policy proposals to enable transparent governance with human opinion oversight in AI decision processes.
The platform I'm building is part of the larger mission of leveraging AI to enhance human cooperation and alignment, working toward responsible governance before more advanced AI systems emerge.
Projects
Multi-Agent RL Trading System
Reinforcement learning framework that trains agents to trade in simulated markets using Proximal Policy Optimization. Implements portfolio management with price impact modeling in a multi-agent environment.
Key Achievements:
- • Agent learns stable trading strategies with returns stabilizing after initial exploration
- • Price correlation demonstrates sophisticated market impact understanding
- • Real-time adaptive behavior to market volatility
Interactive LLM Explainability Dashboard
Interactive dashboard for exploring and visualizing language model internals. Makes complex neural network behaviors interpretable through intuitive visualizations of attention mechanisms and text embeddings.
Key Features:
- • Text Embeddings Visualization using SentenceTransformer with t-SNE dimensionality reduction
- • BERT Attention Heatmap showing how each token attends to others in the sentence
- • Intuitive interactive interface for exploring model internals
Experience
Software Development Engineer Intern
May 2024 – August 2024Built from scratch an optimized full-stack testing system for ML models that configure prices for all Amazon Basics items. System now serves as the primary testing framework used by the Base Pricing team for ML model evaluations. Deployed system to production, reducing manual validation time from 12 mins to 2 mins.
Software Engineering Intern
August 2023 – December 2023Fixed critical memory leaks in satellite controller system code, increasing information transmission efficiency by 24%. Reduced overall system downtime and increased satellite communication stability using Julia.
Center for Human
Compatible AI
AI Safety and Risk Researcher
August 2023 – December 2023Wrote algorithms to enhance confidence scoring mechanisms in large-scale language models (15% reliability increase). Implemented rigorous evaluations on confidence scoring code- built benchmark prompt dataset for model testing.
Lead Python Developer for AI
June 2023 – August 2023Led a team of 18 developers in creating a self-learning AI SaaS product that generates 3D animated movies from user-submitted scripts. Reduced the CTO's timeline from 6 months to 2 months, Project was acquired by Disney.
Machine Learning Intern
June 2022 – August 2022Led the creation of a convolutional neural network model for image classification, integrating BERT encoders and decision trees into the architecture. Final product implemented by large health insurance providers, such as Humana and Aetna.
Discovery Fellowship
Technology Summer Program
Watson AI HS Internship
Aerospace Scholar Program
Lead Python Developer for AI
June 2023 – August 2023Led a team of 18 developers in creating a self-learning AI SaaS product that generates 3D animated movies from user-submitted scripts. Reduced the CTO's timeline from 6 months to 2 months, Project was acquired by Disney.
Machine Learning Intern
June 2022 – August 2022Led the creation of a convolutional neural network model for image classification, integrating BERT encoders and decision trees into the architecture. Final product implemented by large health insurance providers, such as Humana and Aetna.
AI, ML, and Deep Learning Intern
January 2021 – March 2021Worked with Dr. Taniya Mishra and Dr. Safinah Ali at MIT Media Labs to develop sentiment analysis models to contribute to Affectiva's emotion recognition algorithms- used natural language processing and computer vision.
Co-Founder and Lead Developer
May 2020 – May 2023Co-founded under Harvard Innovation Labs, led development efforts using Unity for the frontend and C++ for the backend. Developed platform from the ground up, grew user base from 0 to >127,000 active users. (acquired by Harvard University)