👤 About Me
Hello! I'm Pranav Agarwal, a pursuer of knowledge and innovation within the realm of technology. My journey began in the lively streets of India, where, as a young boy, I was fascinated by the magic of computers by spirited competitions in car racing games with my elder brother, sparking a passion for technology that continues to drive me forward.
At VIT University, my undergraduate studies in Computer Science allowed me to dive deeper into the mechanics of programming and data structures. It was here that I developed a keen interest in solving complex problems through code, leading to my first project— a car game built using C++.
My professional journey has been equally diverse, spanning several industries from game development, medicine to aerospace, each role enriching my expertise and broadening my perspective on how technology can be applied to different fields.
Currently pursuing a Master’s in Data Science at UC Irvine, I am on a quest to harness the power of machine learning to transform vast amounts of data into actionable insights. My goal is to innovate in ways that prioritize ethical considerations and privacy, ensuring that advancements in AI and machine learning are used responsibly.
Away from the keyboard, my passions are varied and vibrant. I find rhythm and poetry in hip-hop, the strategic depth and speed in Formula 1 racing, and the exhilarating competition of soccer. Each of these interests adds a layer of balance and joy to my life, fueling my work with renewed energy and inspiration.
🎓 Education
Bachelor of Technology in Computer Science
I earned my B.Tech in Computer Science and Engineering at Vellore Institute of Technology (VIT), India. My academic journey here was marked by engagement with core subjects like data structures, database management, and natural language processing. As an active member of ACM-VIT, I contributed to several projects, notably one focused on the automatic segregation of wet and dry waste using computer vision. This practical application of classroom learning fueled my interest in real-world technological solutions and prepared me for advanced studies.
Master of Data Science
I'm pursuing a Master of Data Science at the University of California, Irvine (UCI), where I achieved a GPA of 3.9. At UCI, I specialized in machine learning, artificial intelligence, and deep learning. My projects ranged from theoretical research to practical implementations, including an in-depth evaluation of foundational models for pathology in survival prediction. This program has honed my analytical skills to tackle complex data-driven challenges in the tech industry.
🔧 Skills & Certifications
Throughout my academic and professional journey, I have cultivated a diverse and robust skill set that spans various aspects of technology and data science:
- Programming Languages: Proficient in Python, C++, R, and SQL.
- Frameworks and Libraries: Experienced with PyTorch, TensorFlow, spaCy, Keras, Seaborn, and Pandas.
- Web Development: Skilled in using HTML, CSS, JavaScript, and Django for web applications.
- Cloud Technologies: Competent in managing and architecting solutions on AWS, including the use of EC2, S3, and RDS.
- Containerization and Orchestration: Knowledgeable in Docker and Kubernetes for developing scalable applications.
- NLP: Capable of building and deploying machine learning models, leveraging libraries like Hugging Face and LangChain.
- Database Management: Experienced in using SQL databases as well as NoSQL databases like MongoDB and Neo4J.
- Big Data Tools: Familiar with Apache Spark for large-scale data processing.
- Data Visualization: Proficient in creating insightful visualizations with Tableau and Python libraries like Matplotlib.
My certifications further bolster my technical capabilities:
- AWS Solutions Architect Associate: Certified in designing distributed systems on Amazon Web Services.
View Certificate - Applied Machine Learning: Certified by the University of Michigan on Coursera, focusing on practical machine learning implementation.
View Certificate
🔗 Open Source Contributions
Mozilla
I am an active contributor to Mozilla, focusing on projects that enhance web technologies and user privacy. My contributions help improve the performance and security features of Mozilla products, supporting a free and open web.
FOSSASIA
My work with FOSSASIA includes contributing to software that promotes the use of open source technology in Asia. These contributions span across various projects aimed at enhancing accessibility, education, and open source adoption.
OpnTec
At OpnTec, I contribute to the development of open source tools and technologies, enhancing innovation and collaboration within the tech community. My efforts focus on improving software solutions that facilitate open communication and technology sharing among developers and engineers worldwide.
🛠 Experience
Machine Learning Infrastructure Intern | Safran
July 2024 - Nov 2024
I accelerated development time by 18% by collaborating in the design and development of cloud-based data applications, specializing in machine learning models for predictive health management systems in aerospace equipment. This role involved intricate teamwork and technical skills to boost efficiency in developing solutions that predict equipment failures before they occur.
Further, I increased prediction accuracy by 25% and reduced downtime by 20% by engineering and integrating software solutions to enhance prototype systems, verifying their functional performance, and optimizing predictive algorithms within an IoT framework on AWS using SageMaker and Amazon ECR.
I also improved system reliability to 97% by working within a dynamic, cross-functional team to merge various technologies. This ensured the deployment of cloud-hosted machine learning solutions tailored for real-time health monitoring, enhancing both the effectiveness and reliability of our aerospace equipment's health management systems.
Key Skills: Machine Learning, AWS, SageMaker, Amazon ECR, IoT, Software Engineering, Data Analytics.
Machine Learning Student Researcher | UCI AI Center
June 2024 - Present
At UCI's AI Center in Dr. Jana Lipkova’s OctoPath lab, I spearheaded the comprehensive evaluation of patch-level and whole-slide models such as ResNet50, UNI, CONCH, REMEDIS, and HIPT. My work, which utilized datasets from TCGA, CPTAC, and private cohorts, improved model evaluation efficiency by 11%.
I also increased accuracy to 94% for survival prediction targeting brain and lung cancers, delivering insights into the effectiveness of foundational models in clinical survival prediction.
Key Skills: Machine Learning, Data Analysis, Python, TensorFlow, Statistical Modeling.
Natural Language Student Researcher | UCI INCHES Lab
May 2024 - Sep 2024
In Dr. Angela Lukowski’s lab at UCI, I developed a program using Python and spaCy that automatically parsed narrative data into propositional phrases, reducing the manual workload from two weeks to a single day.
This innovation enhanced the efficiency of analyzing event memory studies by 77%, adapting to varied datasets with flexible rule implementation.
Key Skills: Natural Language Processing, Python, spaCy, Data Parsing, Automation.
Cloud Engineer | Airbus
July 2021 - August 2023
As a Cloud Engineer at Airbus, I played a pivotal role in migrating 65% of departmental systems to AWS, enhancing operational efficiency and security.
My strategic implementations reduced security alerts by 20% and achieved annual cost savings of $70,000 by integrating a recommendation engine that optimized cloud resource allocation.
Additionally, I developed a comprehensive dashboard with Amazon QuickSight, providing actionable insights for strategic planning and project management.
Key Skills: Cloud Computing, AWS, Security Management, Data Analytics, Systems Engineering.
NLP Software Engineer Intern | Novartis
January 2021 - June 2021
During my internship at Novartis, I engineered an NLP-based chatbot using Azure Bot Framework that reduced response times by 1.7 hours, significantly improving customer service efficiency.
The implementation of a chatbot analyzer led to a 33% surge in usage by providing actionable insights for performance enhancement and a 23% increase in positive customer feedback through A/B testing of intents based on user feedback.
Key Skills: Natural Language Processing, Python, Machine Learning, Azure Bot Framework, Data Analysis.
Software Developer Summer Intern (AR/VR) | PlayShifu
May 2019 - June 2019
As a Software Developer Intern specializing in AR/VR at PlayShifu in Bangalore, India, I played a pivotal role in the software development of Marker Less Augmented Reality (AR). My contributions led to the creation of a pioneering AR prototype that significantly reduced the time required for AR object development by 70%. This achievement not only streamlined production processes but also set new benchmarks for efficiency within the team.
Furthermore, I orchestrated the design and initial coding of a critical feature for PlayShifu's flagship iOS product. This key development was instrumental in driving the product's annual sales to reach $10 million. My efforts involved extensive use of Unity for optimizing performance, ensuring seamless gameplay experiences on less powerful devices. These optimizations broadened the user base by 16%, making high-quality AR experiences accessible to a wider audience.
Key Skills: Augmented Reality, Unity, iOS Development, Performance Optimization, Software Development.
📁 Projects
Lane Detection for Autonomous Driving Using Attention
This project implemented advanced lane detection techniques using PyTorch and OpenCV, crucial for improving the navigation systems of autonomous vehicles. The approach involved comparing a standard UNet model with a custom variant featuring residual blocks and attention mechanisms for enhanced lane marking detection accuracy that take into account the current data stream along with past knowledge to make predictions. The attention mechanism helps in the model focusing the relevant details of the image. The TUSimple dataset was used, showcasing multi-lane highway scenarios under varying conditions.
Key steps included preprocessing with grayscale conversion, Gaussian blur, and Canny edge detection, followed by random transformations to augment the dataset. The custom UNet model excelled at detecting lanes even in obstructed views, illustrating the advantages of attention-focused segmentation.
Technologies Used: Python, PyTorch, OpenCV, UNet, Attention Mechanisms
View on GitHub | Read Article on Medium
Credit Reporting Trends Analysis
This project leveraged Python, Pandas, Seaborn, and NumPy to process and visualize data trends within credit reporting. Focusing on predictive analytics, it identified key trends affecting credit scores and market conditions, using comprehensive datasets from Equifax, Experian, and TransUnion.
By analyzing historical data, the project forecasted significant trends and their impacts on future credit activities, providing valuable insights for financial institutions and policymakers. This analysis was instrumental in developing strategies that align with evolving market dynamics.
Technologies Used: Python, Pandas, Seaborn, NumPy
Forest Fire Detection Using Classifiers and Transfer Learning
This project aimed to enhance forest fire detection capabilities through the use of sophisticated machine learning classifiers and transfer learning techniques. By integrating SVMs, KNN, and Naive Bayes with pre-trained models such as Inception and VGG19, I significantly improved the accuracy and speed of detection. The application of these advanced techniques allowed for early identification of fire outbreaks, potentially saving landscapes and lives by facilitating quicker response times.
The project also involved extensive data gathering and processing, where I implemented image recognition and processing technologies to identify fire characteristics from satellite imagery. The success of this project not only demonstrated the effectiveness of combining multiple machine learning strategies but also set a precedent for future environmental monitoring technologies.
Key Skills: Machine Learning, SVM, KNN, Naive Bayes, Transfer Learning, Image Processing, Python, Inception, VGG19.
Duplicate Detection in Job Postings using NLP and Milvus
The goal of this project was to solve the prevalent issue of duplicate job postings across multiple platforms. Using NLP techniques, I developed a system that could analyze and understand the semantic content of job descriptions. By implementing a vector search powered by Milvus, the system effectively identified and flagged duplicate postings, thereby enhancing the quality and trustworthiness of job boards.
Further development included optimizing the Sentence Transformers for better context capture and refining the cosine similarity calculations to improve the precision of duplicate detection. This refinement allowed the system to scale efficiently, adapting to new job postings and maintaining high accuracy over time. The project exemplifies how targeted AI solutions can address specific industry challenges, improving operational efficiency and user experience.
Key Skills: NLP, Milvus, Sentence Transformers, Python, Cosine Similarity, Vector Search.
Bombay Stock Exchange Equity Analysis
This project developed a comprehensive web application designed to monitor and analyze the daily performance of equities on the Bombay Stock Exchange. Utilizing real-time data scraping techniques, the system provided updated stock performance metrics that were crucial for day traders and long-term investors alike. The incorporation of machine learning algorithms allowed for predictive analysis, offering forecasts that helped users make informed investment decisions.
The backend of the application was powered by sophisticated data cleaning and analysis routines, ensuring data integrity and relevance. By employing advanced data visualization tools, the platform presented complex data in an accessible format, enabling users to quickly grasp market trends and variances. This project showcased the potential of integrating financial data with modern technology to create powerful analytical tools that empower users with actionable insights.
Key Skills: Data Scraping, Data Cleaning, Machine Learning, Python, JavaScript, Predictive Analytics, Data Visualization.
Chat-Application
The Chat-Application is a dynamic web-based platform designed to facilitate real-time communication across multiple chat rooms. Utilizing Node.js and Socket.IO, this application supports instantaneous messaging and location sharing, enhancing user interaction and engagement. Hosted on Heroku, it provides a robust and accessible environment for users to join different chat rooms, send messages, and share their locations seamlessly.
Technically, the application leverages the power of WebSocket technology through Socket.IO, which enables real-time, bi-directional communication between web clients and servers. This technology ensures that messages and location data are transmitted instantly without the need for refreshing the web page, offering an interactive experience akin to commercial messaging platforms. The backend, developed with Node.js, manages the application’s operations efficiently, handling multiple user connections, session management, and data transmission with high reliability and low latency.
Key Skills: Node.js, Socket.IO, Real-Time Communication, WebSockets, Heroku, Full-Stack Development.
Expensify-app
The Expensify-app is a web application designed to help users track their daily savings and expenditures with ease and precision. Developed using React-JS, this application offers a dynamic and responsive user interface that simplifies financial management. Utilizing FireBase for backend database services ensures that all data is securely managed and scalable.
Technically, the application is bolstered by Node.js and Express which serve up a robust server environment, facilitating efficient data handling and application logic operations. The integration of these technologies allows the Expensify-app to offer real-time updates and seamless user experiences. The project's setup involves steps like cloning the repository, building the development environment, and running a local server, demonstrating a comprehensive understanding of modern web development practices.
Key Skills: React-JS, FireBase, Node.js, Express, Web Development, Real-Time Data Handling.
Node.js Task Management Application
The Node.js Task Management Application is a robust backend system designed to manage user tasks efficiently. Developed using Node.js and Express, this application leverages MongoDB for database management, ensuring scalable and persistent storage of user data. It features comprehensive user and task management capabilities, from user creation and authentication to task allocation and updates.
User authentication is handled via JWT tokens, which secure the application by ensuring that users can only access their specific details and tasks, thereby protecting user data from unauthorized access. The system's architecture supports complex user interactions and is capable of handling multiple user sessions simultaneously, thanks to its efficient use of asynchronous JavaScript operations and middleware for session management.
Key Skills: Node.js, Express, MongoDB, JWT, REST API, Heroku, Security Implementation, Asynchronous JavaScript.
Weather-Application
The Weather-Application is a web-based platform designed to provide real-time weather data for any specified location. This application is powered by a Node.js backend and utilizes Express to manage server operations, ensuring smooth data handling and service delivery. It leverages Handlebars for templating web pages, creating a dynamic and user-friendly interface that adjusts according to the weather data received.
The core functionality of the application involves integrating Mapbox to retrieve geographic coordinates based on location names and WeatherStack to fetch detailed weather information using those coordinates. This dual-API setup allows for accurate and up-to-date weather reports, enhancing user experience by providing essential weather details like temperature, humidity, and wind speed. Hosted on Heroku, the application demonstrates a scalable solution for accessing and displaying weather data through a web interface.
The application also serves as a web adaptation of a command-line weather tool, showcasing my ability to transition functionalities between different platforms and interfaces. This adaptation not only broadens the accessibility of the tool but also illustrates my skills in full-stack development.
🌟 Extracurricular Activities
Winner of VITHACK'19
Awarded $1500 for the best project and $300 for the best website at VITHACK'19, a competitive hackathon focused on fintech innovations. My project featured a cutting-edge application that utilized computer vision to detect electronic devices in a home environment. By simply scanning the room with a smartphone camera, the system identified present electronics, enabling the fintech company to tailor personalized advertisements and product suggestions based on actual user possessions.
Winner of WTM GDG VIT'19
Developed a pivotal women’s safety application for WTM GDG VIT'19, which won accolades for its innovative approach to enhancing women's security in urban environments. The app collected data on past incidents of dangers faced by women and used this data to create a dynamic heatmap. It then provided intelligent routing suggestions to help women navigate through areas with the lowest risk, significantly boosting their safety in potentially hazardous locales.
Code for Good
Participated in Code for Good, an initiative dedicated to bringing basic computer science education to underprivileged schools. I engaged with students who lacked access to computers, teaching them fundamental computing skills and igniting an interest in technology that could potentially transform their futures.
Soccer
An enthusiastic participant in local soccer leagues, actively engaging in regional tournaments. Soccer not only enhances my physical fitness but also fosters a sense of teamwork and strategic thinking, skills that are invaluable both on and off the field.