Hello! My name is Ajay Krishna Vajjala.
I am a fourth-year Ph.D. candidate at George Mason University (GMU) studying Computer Science, expected graduation of May 2025. I received my Bachelor's and Master's degrees in Computer Science from the College of Engineering and Computing at George Mason University. I started the Ph.D. program in the Fall of 2021, and I'm currently working under Dr. David S. Rosenblum. My research interests lie in machine learning, with a focus on recommender systems and applied ML in software engineering. I am passionate about driving progress in machine learning, information retrieval, and software engineering research, particularly through real-world applications of LLMs to solve complex challenges, improve developer workflows, and enhance user experiences.
In addition to my academic background, I have industry experience as a research intern at Microsoft Research, where I developed innovative solutions that automated developer workflows, significantly enhancing productivity by leveraging LLMs. I have also published my work in top ML conferences as a lead author and collaborated on various interdisciplinary projects. I’m eager to bring my diverse experience to a research-driven role in the industry, where I can continue advancing machine learning applications to drive real-world impact.
Seeking full-time opportunities as a Research Scientist, Applied Scientist, or Researcher!
August 2024 - Presented my internship work at the HCAIX Taks Series at Microsoft Research (MSR)!
April 2024 - Successfully defended my PhD Proposal!
April 2024 - Presented my paper at the SIAM International Conference on Data Mining (SDM) in Houston, Texas!
April 2024 - Invited as a speaker at the GMU Software Engineering Seminar series, where I gave a talk on my latest research!
March 2024 - My paper, titled "Analyzing the Impact of Domain Similarity: A New Perspective in Cross-Domain Recommendation," has been accepted to the International Joint Conference on Neural Networks (IJCNN 2024) !
February 2024 - Accepted an offer to join Microsoft Research (MSR) as a Research Intern for the summer of 2024!
December 2023 - My paper, titled "Vietoris-Rips Complex: A New Direction for Cross-Domain Cold-Start Recommendation," has been accepted to the SIAM International Conference on Data Mining (SDM 2024)!
December 2022 - Completed M.S. in Computer Science from GMU!
September 2022 - Attended the ACM Conference on Recommender Systems (RecSys), and served as a student volunteer!
April 2022 - Passed my written and oral PhD Comprehensive Exams!
January 2022 - Accepted an offer to start as a Graduate Research Assistant at GMU!
August 2021 - Accepted an offer to start as a Graduate Teaching Assistant at GMU!
May 2021 - Joined an interdisciplinary research team at the Center for Adaptive Systems of Brain-Body Interactions (CASBBI)!
February 2021 - Admitted into the Computer Science PhD Program at George Mason University!
Demeber 2020 - Graduated with a B.S in Computer Science from George Mason University!
2024
Under Review
D. Meher, A. Krishna Vajjala, and D. Rosenblum. "Understanding User Behavior Consistency in Cross-Domain Recommendation: An LLM-Based Approach" (Under Review)
2024
Under Review
A. Krishna Vajjala, Ar. Krishna Vajjala, C. Badea, C. Bird, R. Deline, J.Entenmann, N. Forsgren, A. Hramadski, S. Sanyal, O. Surmachev, and T. Zimmermann. "Enhancing Differential Testing: LLM-Powered Automation in Release Engineering" (Under Review)
2024
Under Review
Ar. Krishna Vajjala, A. Krishna Vajjala, Y.Yan, S. Pothagoni, D. Poshyvanyk, and K. Moran. "FRAME: Enhancing Multimodal GUI Embeddings with Structural Information" (Under Review)
A. Krishna Vajjala, Ar. Krishna Vajjala, Z. Zhu, and D. Rosenblum. "Analyzing the Impact of Domain Similarity: A New Perspective in Cross-Domain Recommendation." In Proceedings of the IEEE International Joint Conference on Neural Networks (IJCNN 2024), Yokohama, Japan, June 30th to July 5th, 2024.
A. Krishna Vajjala, D. Meher, S. Pothagoni, Z. Zhu, and D. Rosenblum. "Vietoris-Rips Complex: A New Direction for Cross-Domain Cold-Start Recommendation." In Proceedings of the SIAM International Conference on Data Mining (SDM 2024), Houston, TX, U.S. 18th-20th April, 2024. [Acceptance Rate: 29.2%]
June 2024 - August 2024
During my time as a Research Intern on the SAINTES team at Microsoft Research (MSR), I implemented machine learning solutions to streamline release engineering workflows. I collaborated with the IDNA product team to explore AI-driven productivity enhancements for engineers and conducted interviews with on-call engineers to identify inefficiencies. I developed a novel model using Large Language Models (LLMs) to automatically label differences between test and production environments as "noise" or "critical," improving efficiency.
I fine-tuned GPT-3.5 and GPT-4 using Azure OpenAI Studio, achieving 98% accuracy in offline evaluations and consistently over 90% accuracy in live on-call sessions. This reduced labeling time from 10 hours to just 15 minutes, significantly enhancing developer productivity. I presented the results to the Corporate Vice President (CVP) and senior leadership in IDNA, and completed a successful tech transfer, where I transferred my model into the product team’s production workflow.
May 2022 - Present
I am working on a project to enhance cross-domain recommendation capabilities using state-of-the-art language models. By introducing zero-shot and few-shot prompting techniques, I optimized the recommendation process across various domains. The project involves fine-tuning open-source models like LlaMA-2 with parameter sizes ranging from 7B to 70B. This process was conducted using parameter-efficient fine-tuning (PEFT) methods. A critical component of the project was the use of QLoRA for fine-tuning LLMs on dual-domain data, improving the models' recommendation performance. The fine-tuning process was powered by multiple A100 GPUs, ensuring efficient handling of large-scale data and model parameters. I utilized Weights and Biases for visualizing the fine-tuning process of QLoRA, allowing for a detailed and transparent view of the model's performance enhancements. This comprehensive approach demonstrated the potential of LLMs in revolutionizing the field of recommender systems through advanced transfer learning techniques.
I conducted reasearch that integrated the Rips Complex from computational geometry with deep learning techniques. The primary goal was to effectively transfer user preferences across varying domains, leading to the creation of novel user profiles tailored for personalized recommendations. The approach was particularly groundbreaking in addressing extreme cold-start scenarios, registering a substantial performance boost of over 20% compared to existing leading methods. This project involved meticulous fine-tuning using 5-fold cross-validation to enhance the model's efficacy on novel data. Technologically, it leaned heavily on Python and Tensorflow, with all complex computations being adeptly handled by the powerful Nvidia A100 GPU.
I leveraged Python, GloVe pre-trained embeddings, and BERT techniques from NLP to develop sophisticated domain similarity metrics. This led to the formulation of baseline cross-domain recommendation algorithms, taking advantage of the open-source Recbole-CDR library. The algorithms were rigorously assessed across a diverse array of 18 domain combinations, benchmarked against three industry-leading cross-domain algorithms. Intriguingly, the findings revealed that the recommendation efficacy was largely invariant to the specific domain combinations, a conclusion drawn using a paired t-test. For optimum efficiency, all computational processes and experiments were powered by the high-performance Nvidia A100 GPU.
I utilized Python, TensorFlow, and the Nvidia A100 GPU to develop a cross-domain recommender system using a conditional GAN. The project centered around the synthetic generation of target domain item embeddings, which were seamlessly infused with source domain data. Furthermore, these synthesized embeddings were combined with pre-existing user embeddings, providing enhanced personalized recommendations. As a testament to the robustness of the approach, we conducted extensive tests using various top-tier cross-domain recommendation algorithms. The results were promising: our model outperformed the competition, registering a preliminary performance surge of 5% over conventional benchmarks.
May 2021 - May 2022
As an NSF National Research Trainee Fellow at the Center of Adaptive Systems of Brain and Body Interaction, I led an interdisciplinary team to develop a web application aimed at helping incarcerated individuals access reentry services. I managed the development using React for the front-end, Node/Express for the back-end, and MySQL for the database. Additionally, I initiated key partnerships with the Director of DC Jail and the Vice President of American Prison Data Systems (APDS) to trial the application on APDS tablets, facilitating real-world implementation.
Aug 2021 - May 2022
Taught 60 person labs twice a week for two semesters, while creating programming assignments for the students to work on. Held office hours every week to help students understand programming concepts, and worked on grading labs and programming assignments for over 100 students.
Jan 2020 - Dec 2020
Working as a teaching assistant for Data Mining Course at GMU. Assisted students on machine learning algorithms, and help them understand machine learning and data science concepts (Classification and Clustering Algorithms). Help students work with KNN, Neural Networks, K-Means clustering, Decision Trees, and held office hours every week.
May 2019 - Aug 2019
Worked on application for hospitals to keep track of patient and staff data in hospitals. I developed API’s to retrieve patient information, doctor information, and track call logs. I used Node JS for developing API’s, and React JS for frontend development. The data for the application was stored in AWS S3, and it was deployed on an EC2 Instance. In addition, a CI/CD pipeline was set in place for developing the web application.
Aug 2021 - Present
George Mason University
Cousework: Theory of Computation, Graph Algorithms, Research Methods in CS
Jan 2021 - Dec 2021
George Mason University
Cousework: Mining Massive Datasets with Map Reduce, Component-Based Software Development, Artificial Intelligence
Aug 2017 - Dec 2020
George Mason University
Cousework: Operating Systems, Analysis Of Algorithms, Data Mining, Data Structures, Object Oriented Programming
Address
Virginia, USA
akrish@gmu.edu