Hello, my name is

Kavita Yadav

Research Technology & Innovations Engineer

at Thales with proven experience in building real world data products using Reasearch, Big Data & Machine learning skills!!

My current ventures


I am a meticulous Research Technology & Innovations Engineer at Thales with more than 6 years of experience. I hold Master's degree in Computer Science with a specialization in Information Security from the PEC University of Technology, Chandigarh. During my carrier I have worked on various tools and technologies in field of Big Data and Machine Learning. I have also got the exposure to work as full stack developer in Thales Innovation Hub. As full stack developer, I have developed and delivered production-level code for a big data dashboard analytics solution, enabling real-time data visualization and driving informed decision-making across the organization. I have contributed to build prototypes to solve customer centric problems.

I have a extensive background in performing EDA, Feature-selection, Model Selection, Generation, Validation across both machine learning and deep learning along with fundamental understanding of Inferential/Descriptive/Predictive statistics. I use python in my day to day work and highly skilled in using pandas, sklearn, numpy, seaborn, matplotlib, tensorflow, mlxtend, statsmodels, math, jupyter, logging, etc.

I have hands on experience in various language, tools & technologies like MSSQL, Clikchouse, PostgreSQL Vue.js, C, C#, C++, Typecript/JavaScript, Pug/Jade, XAML, HTML, CSS, HDFS, Spark, Visual Studio, IntelliJ, Eclipse, Netbeans, bitbucket, Anaconda, jupyter notebook, WinMerge, QDire, DBeaver, Linux, Windows, Vxworks-Tornado, Embedded C++, Borland C++, Agile Methodologies, Git/Jira/Bit bitbucket, SVN, Wireshark, NMAP, Netsparker, etc. I have good understanding of Computer vision, Natural Language Processing, CNN, RNN and other deep learning architectures.

I am passionate about Artificial Intelligence technologies and I am Professionally certified in Google Data Analytics and IBM AI Engineering.

Machine Learning

Built predictive models to ease the future business problems in real world.

Data Science

Use large scale data in a scientific way to provide the solution for real world data problems.

Research

Use systematic study approach to conclude the project tools & technologies requirements.

Deep Learning

Use deep neural network to unlock the intelligence on unstructured data.

My Projects

Here are some of my projects i have been working on to enhance my skills in free time....

List of key projects which I have worked on, as a RTI Engineer in Thales:
  1. Entries Predictions - This project is about predictive modeling of entry data, which helps in forecasting the number of passengers entering the MTR station. We have tried to make predictions at different levels using different machine learning models. • Short Term that predict next 3 hours using past 2 years data to train LSTM. Mid Term that predict next 7 days using past 15 days entries data to train STLFArima. Overall Trend that predict next 3 hours using past 3 months entries, mid term prediction, edges data to train Keras NN model. • We also applied the same mid term and overall trend solution on waiting time data that help to predict waiting time of passengers at platform to catch the train at MTR stations.
  2. Exit Prediction Analysis - There are two algorithm created by Thales France & Canada to find out the destinations of passengers entering the station. Thales France’s algorithm support metro data and Thales Canada’s algorithm supports buses data. To perform the analysis, Biscay’s metro and bus trips data & GTFS timetable data were used and modified the above algorithm to examine their performance individually and later merged the results of both to see the impact. Also modified Thales France’s algorithm to adapt the bus data.
  3. Alarm Prediction – It is the FP-Growth model, which helps in detecting the avalanche of alarms occurring in the database and performs analysis on those alarm patterns. Also, applied association rules to predict alarms.
  4. Passenger Distribution - It is a Diffusion Metrics model, which helps in predicting the distribution of number of passengers in the MTR network. It predicts where and when the passenger entering the station will be.
  5. Anomalies Detection- This algorithm is designed to detect anomalies. If the number of passengers entering the station is more than the expected space capacity of the platform, it will raise an alert which will help the station operators to take necessary steps for crowd management.
  6. Naia Biscay Passenger Journey Analytics- Driven by the trend for improved passenger experience required by mass transit system operators, Thales has designed “Naia” as a Big data Analytics platform to help them to understand how passengers travel through their network in order to improve the quality of service offered.
  7. Passenger Flow Simulator Analysis- This is a simulator created by Thales France to visualize the flow of passengers in the MTR network. To perform the analysis, Hong Kong MTR trip data and timetable data of multiple lines were used to check the compatibility of the simulator to data from other countries.
  8. BEM Fare Allocation System (Bangkok) - It is Fare Allocation System for Bangkok Expressway & Metro company which maintain the blue line. This system used to manage ticketing data of BEM and automatic generation of financial reports.
  9. Naia Hong Kong Passenger Flow Analytics - It is generating 3 KPIs i.e Platform Crowding, Train Occupancy & Waiting Time and generating route followed by passenger in MTR network from origin to destination.
  10. BKKBLE - Blue Line Extension (Bangkok)
  11. BKKBLU- Blue Line Upgrade (Bangkok)
  12. SZL4- Shenzhen Line 4(Red Line)
  13. Delta.io vs Clickhouse Data Pipeline Comparison: Conducted a performance comparison of two different database technologies for extracting, transforming and loading large dataset.
  14. Auto-Tool to update IO Configurations in Excel: Developed an automated tool using python to modify macro-enabled excel files by comparing them with IO list configuration files. This tool streamlined the System Engineering process of updating Input/Output configuration for various equipment in MTR stations.
  15. NLPtoSQL - Build a model to convert Natural Language Queries to SQL queries using LLMs and Hugging Face.
  16. NLPtoDoc - Build a model to summarize given document based on question asked by user using LLMs and Hugging Face.

My Certifications

Here are some of the latest certifications i have been certified on.

My Skills

A RTI Engineer needs enormous range of skills and knowledge to improve the decision-making process. Below are some skills that I have.
85

Python

80

Machine Learning

75

Deep Learning

90

Data Visualisation

85

SQL

75

Statistics

80

Data Structures and Algorithm

70

DevOps

90

Linux

Connect with me socially

Drop me a Line