Long H. Nguyen

Data Scientist, Engineer

Sillicon Valley, CA, USA
US Green Card holder

Work Experience

Data enthusiast, problem solver, Kaggle master with a business mindset.
Dr. Long NGUYEN is currently a Director - Data Science at Visa Inc. company. He has more than nine years’ experience working on machine learning and data science. He also has two additional years working as a software engineer.
He has strong interests in how to apply machine learning into real industries with big data. He has successfully worked for several industrial projects with significant funding for large multi-national corporations such as banks, telcos, consulting firms and energy corporations. Moreover, he has published scientific papers in reputed conferences and journals. His research interests include time series mining, data stream mining, big data, and condition-based monitoring.

Visa

Director - Data Science

June 2019 - Present
    Design and deploy highly innovative ML models from Visa's unique global data. Analyze and extract key insights for a variety of applications, issuers and merchants. Promote new methodology and best practices in the field of data analytics.

Visa

Product Analytics Manager

August 2016 - June 2019
  1. Develop Data Analytics solutions to solve complex business problems.
  2. Work with large data sets using quantitative techniques and build complex statistical/machine learning models that learn from big data.
  3. Hands-on experience on SAS, Hadoop, Hive, R, Python, and Tableau.

McLaren Applied Technologies

Sr. Data Scientist

August 2015 - August 2016

Worked for the following projects:

  1. Predictive Health System
  2. - Develop intelligent models for monitoring asset health of mining trucks.
  3. Life Insights System
  4. - Developed systems for monitoring human’s health and improving their performance on daily life.
  5. Scheduling for Manufacturing
  6. - Performed scheduling with several complex constraints, including minimizing changeover time, minimizing disposal products, and considering depletion rate.

Institute for Infocomm Research (I²R)

Data Scientist

August 2013 August 2015

Worked for 7 industrial projects with significant funding for large multi-national corporations.

Selected projects with roles are named as follows:

  1. Sleep Quality Measurement – Principle Investigator
  2. - Developed multi-layer models that use accelerometer and HRV data to accurately detect wake and different sleep stages.
    - Methodology: Signal processing, Imbalance learning, Matlab, Java.
  3. Human Gait Analysis – Lead Scientist
  4. - Classified normal and abnormal gaits in a real-time manner using accelerometer data.
    - Applications for avoiding injury of athletics and monitoring recovery in rehabilitation.
    - Methodology: Anomaly detection, one-class classification, Java, Matlab.
  5. Wattalyzer – Member
  6. - Developed an integrated solution for smart grid condition monitoring through advanced sensing and real-time analytics.
    - Deployed active learning to significantly reduce expensive labeling cost. The algorithm was published in IEEE Prognostics and Health Management (PHM), 2015. (link), (pdf)
    - Methodology: Data stream analytics, Java, R, MySQL.
  7. Churn Prediction for Telco – Member
  8. - Developed a novel technique to process terabytes of customers’ data for churn prediction.
    - Doubled customer’s expectation on accuracy improvement.
    - Methodology: GBM, Boosted decision tree, R, ORE (Oracle).
  9. Location Profiling for Mobile Phone Data – Member
  10. - Developed simple-yet-elegant algorithms for places of interest using terabytes of call details data.
    - The algorithm was later published in IEEE Mobile Data Management, 2014. (link), (pdf)
    - Methodology: Frequent pattern analysis, Java, Multithreading.

Nanyang Technological University

Researcher

Jan 2013 - Aug 2013

Researched on differentially private publication of data streams.

Nanyang Technological University

Teaching Assistant

Jan 2012 - Jan 2013

Undergraduate courses: Discrete Mathematics, Algorithms & Computing.

Airbus Group Innovations, Singapore

Intern Data Analyst

Jan 2010 - Jan 2012

Developed multivariate statistical approaches for helicopter health monitoring.

Ho Chi Minh City International University

Researcher & Lecturer

Jan 2008 - Jan 2009

Undergraduate courses: Computer Graphics, Data Structures & Algorithms, Web Programming.

Skills

Programming languages: R, Python, Java, Matlab – proficient; C++, Hadoop, Spark - competent.
Software: MySQL, Weka, RapidMiner, Windows, Unix/Linux, Mac.
Human languages: Vietnamese – native; English – fluent.

Awards

Visa’s Go-Share program

Visa’s Go-Share program to Dubai for high performance staff in order to exchange experience and skillsets.

2017

LookSeeWellington career trip

One of top 93 candidates from more than 48k technical applicants (link), (photo).

2017

Kaggle Inc.

Kaggle Master with the highest global rank of 294th/500k

Feb 2016

Top performer candidate

Institute for Infocomm Research (I²R)

April 2015

Graduate research scholarship

EADS Company (~180,000 SGD)

January 2009

Education

Nanyang Technological University (NTU), Singapore

Computer Science

Doctor of Philosophy

Jan 2009 - Dec 2012

Ho Chi Minh City University of Technology, Vietnam

Computer Science

Honor Bachelor

Sep 2002 - Jan 2007

Selected Publications

A survey on data stream clustering and classification.

Knowledge and Information Systems

2015
pdf

Closed motifs for streaming time series classification.

Knowledge and Information Systems

2014

Concurrent Semi-supervised Learning with Active Learning of Data Streams.

Transactions on Large-Scale Data-and Knowledge-Centered Systems

2013
pdf

Heterogeneous ensemble for feature drifts in data streams.

Advances in Knowledge Discovery and Data Mining

2012

Activities

Took part in several volunteer activities to deliver food and gifts, and to educate orphans.
Reading, traveling, and photography.

References

"While working for our institute, Dr. Long put his comprehensive and versatile know-how into practice in a targeted way to implement projects effectively. He possesses excellent expertise regarding data engineering skills, advance statistics, and predictive modelling techniques to build, maintain, and improve on multiple real-time decision systems.

At all times, I have found Dr. Long to be dependable, reliable, hard-working, courteous, and responsible. I still remembered he was willing to work overtime when project deadlines required this. He also volunteered to work on an extra project when his colleagues in that project required his expertise and support.

Dr. Long has shown his strong interests in how to apply machine learning into real industries with big data. With the two years in our department, he has successfully worked for seven industrial projects with significant funding for large multi-national corporations such as a telcos, consulting firms and energy corporations. He was promoted as a lead data scientist of the joint lab between I²R and McLaren Applied Technologies.

Dr. Long is a very good team player and had excellent relations with colleagues. He also showed very good team leadership and project management skills. He not only managed his team well by balancing workload for team members based on their skill sets, but also ensured delivery of projects on-time."
— Dr. Shonali Krishnaswamy, Head of Data Analytics Department, I²R, A*STAR, Singapore (letter)
"Long displayed a tremendous amount of dedication to his work. He demonstrated strong can-do attitude and versality in picking up data mining as well as in working with me on an industrial project on health monitoring. His programming skill was excellent too as he was able to quickly understand open source codes and modify them for the project."
— David Woon, Head of Operations at Airbus Group Innovations, Singapore (letter)