Toggle navigation
JMichael Blog
Home
About
Tags
Video
Tags
keep hungry keep foolish
R
Regression
Project
Experiment Design
Block Design
Product
Python
Data Mining
Data Exploration
Classification
Machine Learning
Data Science
Bigdata
Jupyter
Excel
Tableau
Demographic analysis
Interactive Dashboard
Sports Analytics
Presentation
Plotly
Modelling
Fraud Detection
Big Data
Supervise Learning
Classfication
Linux
Jupyter Server
Selenium
Numpy
Pandas
Text analytics
Google Sheet
cohort analysis
marketplace
Matlab
SQL
Java
Latex
C++
Jupyter Lab
Interface
NLP
Text Analytics
Resume
Workshop
Panel Talk
Sleep Quality
Text Analysis
Market Analysis
Data Open
Data Scien
Energy Consumption Calculation
Tiktok Trust and Safety
REGEX
AB Testing
Experiment
Analytics
Product Feature
Cheat Sheet
Statistics
R
TOP -- Data Science Knowledge Complete Summary
This is a continuously updated data-science concept post edit and organized by me to summarize and sort out cs, stats, machine learning and other related topics all in one from past learning.
Validation of XML Files using R
How does R enable automatic XML files validation and error log collection
Handling Big Data in R
A step by step guidance for handling large dataset 30GB in R
Citadel West Coast Regional Data Open 2020
Bikeshare Market Analysis in NYC at Citadel Data Open
Text Analytics and Neural Network
Predict the influential factors in college based on students' comments on college life.
Sleep Quality Analysis using R
Follow Up previous Python project by re-writting in R.
Natural Language Processing for Entity(name, place, etc.) Extraction using R
Apply NLP techniques in R to Annotate people and places in text files and extract them into a clean table.
Jupyter Lab Customization with Python, Jave, C++, R, Matlab Environments and SQL, Diagram, Markdown Interface
A step-by-step guidance to customize your Jupyter project
HopSkipDrive Driver Marketplace Analysis
A marketplace analysis for 27k data of suppliers & customers, including cohort analysis, concentration, take rate, conversation rate, power usrs etc. using Excel and R.
EY NextWave Data Science Challenge 2019
Local/Regional finalist, ranked top 10 in US, and regional finalist in China over 2936 participants.
Zipline Unmmaned Aerial Vehicle Data Exploration & Analysis.
An unstructured, independent exploratory data analysis & visulization assignment using 450+ flight datasets csv files to discern details and find patterns, business insights, engineering risks or anomalies.
UCLA Data Fest 2019 -- Sports Analytics for Athlete's Fatigue Levels
Effects of Acute and Chronic Fatigue on a Rugby Player’s Performance and Advice for Coaches.
Demographic Analysis of People in City of Seattle
Tableau Desktop could be a powerful tool to study cencus statistically and display plots that demonstrate business insight and any other interesting findings.
Textbook Resources for Data Science (Copyright Owned by the Authors)
Textbooks from Pubic Internet
Data Mining & Machine Learning applied in Predictive Analysis
Exploring out the most influential variables in predicting the affordability among 79 potentially variables and the most effective model by applying different classification methods including Logistic Regression, K-Nearest Neighbors Method, and Random Forest
Experiment Design -- The Effects of Emotion and Alcohol Consumption on Short-Term Memory
A Randomized Complete Block Design (RCBD) is chosen for the purpose of our research. Time spent in playing memory game serves as the variable of interest. Shorter time to finish a memory game indicates a better memory ability of the participant.
Regression Analysis on Happiness Level Project
The first regresion analysis for happiness level and other dependent variables on a survey data.
Regression
Regression Analysis on Happiness Level Project
The first regresion analysis for happiness level and other dependent variables on a survey data.
Project
Citadel West Coast Regional Data Open 2020
Bikeshare Market Analysis in NYC at Citadel Data Open
Text Analytics and Neural Network
Predict the influential factors in college based on students' comments on college life.
Natural Language Processing for Entity(name, place, etc.) Extraction using R
Apply NLP techniques in R to Annotate people and places in text files and extract them into a clean table.
Machine Learning Application on Heart Disease Prediction
Preventing heart disease is important. Good data-driven systems for predicting heart disease can improve the entire research and prevention process, making sure that more people can live healthy lives.
Web-browser Automation with Selenium
With Selenium, Python can be enabled to let users enter, search, scrape down and manipulate information from any source simply in one piece of scripts, with one click to run code and get your result.
EY NextWave Data Science Challenge 2019
Local/Regional finalist, ranked top 10 in US, and regional finalist in China over 2936 participants.
Zipline Unmmaned Aerial Vehicle Data Exploration & Analysis.
An unstructured, independent exploratory data analysis & visulization assignment using 450+ flight datasets csv files to discern details and find patterns, business insights, engineering risks or anomalies.
UCLA Data Fest 2019 -- Sports Analytics for Athlete's Fatigue Levels
Effects of Acute and Chronic Fatigue on a Rugby Player’s Performance and Advice for Coaches.
Demographic Analysis of People in City of Seattle
Tableau Desktop could be a powerful tool to study cencus statistically and display plots that demonstrate business insight and any other interesting findings.
Data Mining & Machine Learning applied in Predictive Analysis
Exploring out the most influential variables in predicting the affordability among 79 potentially variables and the most effective model by applying different classification methods including Logistic Regression, K-Nearest Neighbors Method, and Random Forest
Experiment Design -- The Effects of Emotion and Alcohol Consumption on Short-Term Memory
A Randomized Complete Block Design (RCBD) is chosen for the purpose of our research. Time spent in playing memory game serves as the variable of interest. Shorter time to finish a memory game indicates a better memory ability of the participant.
Regression Analysis on Happiness Level Project
The first regresion analysis for happiness level and other dependent variables on a survey data.
Experiment Design
Experiment Design -- The Effects of Emotion and Alcohol Consumption on Short-Term Memory
A Randomized Complete Block Design (RCBD) is chosen for the purpose of our research. Time spent in playing memory game serves as the variable of interest. Shorter time to finish a memory game indicates a better memory ability of the participant.
Block Design
Experiment Design -- The Effects of Emotion and Alcohol Consumption on Short-Term Memory
A Randomized Complete Block Design (RCBD) is chosen for the purpose of our research. Time spent in playing memory game serves as the variable of interest. Shorter time to finish a memory game indicates a better memory ability of the participant.
Product
Experiment Design -- The Effects of Emotion and Alcohol Consumption on Short-Term Memory
A Randomized Complete Block Design (RCBD) is chosen for the purpose of our research. Time spent in playing memory game serves as the variable of interest. Shorter time to finish a memory game indicates a better memory ability of the participant.
Python
My Notes for Online Experiment -- A/B Testings
Practical Applications and Concepts of AB testings.
Python String Pattern Process
Using Python to reshape String col. to long String, and conduct pattern match
Energy Consumption Calculation Based on Differnt Time and Price (Tarrif) Using Python
Python Calculation and Time Manipulation
Validation of XML Files using R
How does R enable automatic XML files validation and error log collection
Citadel West Coast Regional Data Open 2020
Bikeshare Market Analysis in NYC at Citadel Data Open
Sleep Quality Analysis using Python
final project demo for python data analysis course fall 19 at UCLA.
Jupyter Lab Customization with Python, Jave, C++, R, Matlab Environments and SQL, Diagram, Markdown Interface
A step-by-step guidance to customize your Jupyter project
Machine Learning Application on Heart Disease Prediction
Preventing heart disease is important. Good data-driven systems for predicting heart disease can improve the entire research and prevention process, making sure that more people can live healthy lives.
Web-browser Automation with Selenium
With Selenium, Python can be enabled to let users enter, search, scrape down and manipulate information from any source simply in one piece of scripts, with one click to run code and get your result.
SPE JupyterHub & Python on remote Linux/Unix servers
A Presenation to 45 related/interested fellows at Sony Pictures 19 summer -- Architecture for R , Python and Julia environments for Corporate Data Science Project Initiatives.
EY NextWave Data Science Challenge 2019
Local/Regional finalist, ranked top 10 in US, and regional finalist in China over 2936 participants.
Zipline Unmmaned Aerial Vehicle Data Exploration & Analysis.
An unstructured, independent exploratory data analysis & visulization assignment using 450+ flight datasets csv files to discern details and find patterns, business insights, engineering risks or anomalies.
UCLA Data Fest 2019 -- Sports Analytics for Athlete's Fatigue Levels
Effects of Acute and Chronic Fatigue on a Rugby Player’s Performance and Advice for Coaches.
Textbook Resources for Data Science (Copyright Owned by the Authors)
Textbooks from Pubic Internet
Data Mining & Machine Learning applied in Predictive Analysis
Exploring out the most influential variables in predicting the affordability among 79 potentially variables and the most effective model by applying different classification methods including Logistic Regression, K-Nearest Neighbors Method, and Random Forest
Data Mining
Data Science Concepts and Notes (WIP)
Data Science, Statistics and Data Mining Notes (UCLA CS145)
EY NextWave Data Science Challenge 2019
Local/Regional finalist, ranked top 10 in US, and regional finalist in China over 2936 participants.
Data Mining & Machine Learning applied in Predictive Analysis
Exploring out the most influential variables in predicting the affordability among 79 potentially variables and the most effective model by applying different classification methods including Logistic Regression, K-Nearest Neighbors Method, and Random Forest
Data Exploration
Sleep Quality Analysis using R
Follow Up previous Python project by re-writting in R.
Sleep Quality Analysis using Python
final project demo for python data analysis course fall 19 at UCLA.
Zipline Unmmaned Aerial Vehicle Data Exploration & Analysis.
An unstructured, independent exploratory data analysis & visulization assignment using 450+ flight datasets csv files to discern details and find patterns, business insights, engineering risks or anomalies.
UCLA Data Fest 2019 -- Sports Analytics for Athlete's Fatigue Levels
Effects of Acute and Chronic Fatigue on a Rugby Player’s Performance and Advice for Coaches.
Demographic Analysis of People in City of Seattle
Tableau Desktop could be a powerful tool to study cencus statistically and display plots that demonstrate business insight and any other interesting findings.
Data Mining & Machine Learning applied in Predictive Analysis
Exploring out the most influential variables in predicting the affordability among 79 potentially variables and the most effective model by applying different classification methods including Logistic Regression, K-Nearest Neighbors Method, and Random Forest
Classification
Machine Learning Application on Heart Disease Prediction
Preventing heart disease is important. Good data-driven systems for predicting heart disease can improve the entire research and prevention process, making sure that more people can live healthy lives.
Data Mining & Machine Learning applied in Predictive Analysis
Exploring out the most influential variables in predicting the affordability among 79 potentially variables and the most effective model by applying different classification methods including Logistic Regression, K-Nearest Neighbors Method, and Random Forest
Machine Learning
Machine Learning Application on Heart Disease Prediction
Preventing heart disease is important. Good data-driven systems for predicting heart disease can improve the entire research and prevention process, making sure that more people can live healthy lives.
EY NextWave Data Science Challenge 2019
Local/Regional finalist, ranked top 10 in US, and regional finalist in China over 2936 participants.
Data Mining & Machine Learning applied in Predictive Analysis
Exploring out the most influential variables in predicting the affordability among 79 potentially variables and the most effective model by applying different classification methods including Logistic Regression, K-Nearest Neighbors Method, and Random Forest
Data Science
TOP -- Data Science Knowledge Complete Summary
This is a continuously updated data-science concept post edit and organized by me to summarize and sort out cs, stats, machine learning and other related topics all in one from past learning.
My Notes for Online Experiment -- A/B Testings
Practical Applications and Concepts of AB testings.
Validation of XML Files using R
How does R enable automatic XML files validation and error log collection
Handling Big Data in R
A step by step guidance for handling large dataset 30GB in R
Textbook Resources for Data Science (Copyright Owned by the Authors)
Textbooks from Pubic Internet
Bigdata
Validation of XML Files using R
How does R enable automatic XML files validation and error log collection
Handling Big Data in R
A step by step guidance for handling large dataset 30GB in R
Textbook Resources for Data Science (Copyright Owned by the Authors)
Textbooks from Pubic Internet
Jupyter
Validation of XML Files using R
How does R enable automatic XML files validation and error log collection
Handling Big Data in R
A step by step guidance for handling large dataset 30GB in R
Sleep Quality Analysis using R
Follow Up previous Python project by re-writting in R.
Sleep Quality Analysis using Python
final project demo for python data analysis course fall 19 at UCLA.
Textbook Resources for Data Science (Copyright Owned by the Authors)
Textbooks from Pubic Internet
Excel
HopSkipDrive Driver Marketplace Analysis
A marketplace analysis for 27k data of suppliers & customers, including cohort analysis, concentration, take rate, conversation rate, power usrs etc. using Excel and R.
Demographic Analysis of People in City of Seattle
Tableau Desktop could be a powerful tool to study cencus statistically and display plots that demonstrate business insight and any other interesting findings.
Tableau
Zipline Unmmaned Aerial Vehicle Data Exploration & Analysis.
An unstructured, independent exploratory data analysis & visulization assignment using 450+ flight datasets csv files to discern details and find patterns, business insights, engineering risks or anomalies.
UCLA Data Fest 2019 -- Sports Analytics for Athlete's Fatigue Levels
Effects of Acute and Chronic Fatigue on a Rugby Player’s Performance and Advice for Coaches.
Demographic Analysis of People in City of Seattle
Tableau Desktop could be a powerful tool to study cencus statistically and display plots that demonstrate business insight and any other interesting findings.
Demographic analysis
Demographic Analysis of People in City of Seattle
Tableau Desktop could be a powerful tool to study cencus statistically and display plots that demonstrate business insight and any other interesting findings.
Interactive Dashboard
UCLA Data Fest 2019 -- Sports Analytics for Athlete's Fatigue Levels
Effects of Acute and Chronic Fatigue on a Rugby Player’s Performance and Advice for Coaches.
Demographic Analysis of People in City of Seattle
Tableau Desktop could be a powerful tool to study cencus statistically and display plots that demonstrate business insight and any other interesting findings.
Sports Analytics
UCLA Data Fest 2019 -- Sports Analytics for Athlete's Fatigue Levels
Effects of Acute and Chronic Fatigue on a Rugby Player’s Performance and Advice for Coaches.
Presentation
UCLA Data Fest 2019 -- Sports Analytics for Athlete's Fatigue Levels
Effects of Acute and Chronic Fatigue on a Rugby Player’s Performance and Advice for Coaches.
Plotly
Zipline Unmmaned Aerial Vehicle Data Exploration & Analysis.
An unstructured, independent exploratory data analysis & visulization assignment using 450+ flight datasets csv files to discern details and find patterns, business insights, engineering risks or anomalies.
Modelling
Zipline Unmmaned Aerial Vehicle Data Exploration & Analysis.
An unstructured, independent exploratory data analysis & visulization assignment using 450+ flight datasets csv files to discern details and find patterns, business insights, engineering risks or anomalies.
Fraud Detection
Zipline Unmmaned Aerial Vehicle Data Exploration & Analysis.
An unstructured, independent exploratory data analysis & visulization assignment using 450+ flight datasets csv files to discern details and find patterns, business insights, engineering risks or anomalies.
Big Data
Zipline Unmmaned Aerial Vehicle Data Exploration & Analysis.
An unstructured, independent exploratory data analysis & visulization assignment using 450+ flight datasets csv files to discern details and find patterns, business insights, engineering risks or anomalies.
Supervise Learning
EY NextWave Data Science Challenge 2019
Local/Regional finalist, ranked top 10 in US, and regional finalist in China over 2936 participants.
Classfication
EY NextWave Data Science Challenge 2019
Local/Regional finalist, ranked top 10 in US, and regional finalist in China over 2936 participants.
Linux
SPE JupyterHub & Python on remote Linux/Unix servers
A Presenation to 45 related/interested fellows at Sony Pictures 19 summer -- Architecture for R , Python and Julia environments for Corporate Data Science Project Initiatives.
Jupyter Server
Jupyter Lab Customization with Python, Jave, C++, R, Matlab Environments and SQL, Diagram, Markdown Interface
A step-by-step guidance to customize your Jupyter project
SPE JupyterHub & Python on remote Linux/Unix servers
A Presenation to 45 related/interested fellows at Sony Pictures 19 summer -- Architecture for R , Python and Julia environments for Corporate Data Science Project Initiatives.
Selenium
Web-browser Automation with Selenium
With Selenium, Python can be enabled to let users enter, search, scrape down and manipulate information from any source simply in one piece of scripts, with one click to run code and get your result.
Numpy
Web-browser Automation with Selenium
With Selenium, Python can be enabled to let users enter, search, scrape down and manipulate information from any source simply in one piece of scripts, with one click to run code and get your result.
Pandas
Web-browser Automation with Selenium
With Selenium, Python can be enabled to let users enter, search, scrape down and manipulate information from any source simply in one piece of scripts, with one click to run code and get your result.
Text analytics
Web-browser Automation with Selenium
With Selenium, Python can be enabled to let users enter, search, scrape down and manipulate information from any source simply in one piece of scripts, with one click to run code and get your result.
Google Sheet
HopSkipDrive Driver Marketplace Analysis
A marketplace analysis for 27k data of suppliers & customers, including cohort analysis, concentration, take rate, conversation rate, power usrs etc. using Excel and R.
cohort analysis
HopSkipDrive Driver Marketplace Analysis
A marketplace analysis for 27k data of suppliers & customers, including cohort analysis, concentration, take rate, conversation rate, power usrs etc. using Excel and R.
marketplace
HopSkipDrive Driver Marketplace Analysis
A marketplace analysis for 27k data of suppliers & customers, including cohort analysis, concentration, take rate, conversation rate, power usrs etc. using Excel and R.
Matlab
Jupyter Lab Customization with Python, Jave, C++, R, Matlab Environments and SQL, Diagram, Markdown Interface
A step-by-step guidance to customize your Jupyter project
SQL
Jupyter Lab Customization with Python, Jave, C++, R, Matlab Environments and SQL, Diagram, Markdown Interface
A step-by-step guidance to customize your Jupyter project
Java
Jupyter Lab Customization with Python, Jave, C++, R, Matlab Environments and SQL, Diagram, Markdown Interface
A step-by-step guidance to customize your Jupyter project
Latex
Jupyter Lab Customization with Python, Jave, C++, R, Matlab Environments and SQL, Diagram, Markdown Interface
A step-by-step guidance to customize your Jupyter project
C++
Jupyter Lab Customization with Python, Jave, C++, R, Matlab Environments and SQL, Diagram, Markdown Interface
A step-by-step guidance to customize your Jupyter project
Jupyter Lab
Jupyter Lab Customization with Python, Jave, C++, R, Matlab Environments and SQL, Diagram, Markdown Interface
A step-by-step guidance to customize your Jupyter project
Interface
Jupyter Lab Customization with Python, Jave, C++, R, Matlab Environments and SQL, Diagram, Markdown Interface
A step-by-step guidance to customize your Jupyter project
NLP
Text Analytics and Neural Network
Predict the influential factors in college based on students' comments on college life.
Natural Language Processing for Entity(name, place, etc.) Extraction using R
Apply NLP techniques in R to Annotate people and places in text files and extract them into a clean table.
Text Analytics
Citadel West Coast Regional Data Open 2020
Bikeshare Market Analysis in NYC at Citadel Data Open
Natural Language Processing for Entity(name, place, etc.) Extraction using R
Apply NLP techniques in R to Annotate people and places in text files and extract them into a clean table.
Resume
Resume Workshop -- Data Science/Analytics
CSSA Panel.
Workshop
Resume Workshop -- Data Science/Analytics
CSSA Panel.
Panel Talk
Resume Workshop -- Data Science/Analytics
CSSA Panel.
Sleep Quality
Sleep Quality Analysis using R
Follow Up previous Python project by re-writting in R.
Sleep Quality Analysis using Python
final project demo for python data analysis course fall 19 at UCLA.
Text Analysis
Text Analytics and Neural Network
Predict the influential factors in college based on students' comments on college life.
Market Analysis
Citadel West Coast Regional Data Open 2020
Bikeshare Market Analysis in NYC at Citadel Data Open
Data Open
Citadel West Coast Regional Data Open 2020
Bikeshare Market Analysis in NYC at Citadel Data Open
Data Scien
Python String Pattern Process
Using Python to reshape String col. to long String, and conduct pattern match
Energy Consumption Calculation Based on Differnt Time and Price (Tarrif) Using Python
Python Calculation and Time Manipulation
Energy Consumption Calculation
Energy Consumption Calculation Based on Differnt Time and Price (Tarrif) Using Python
Python Calculation and Time Manipulation
Tiktok Trust and Safety
Python String Pattern Process
Using Python to reshape String col. to long String, and conduct pattern match
REGEX
Python String Pattern Process
Using Python to reshape String col. to long String, and conduct pattern match
AB Testing
My Notes for Online Experiment -- A/B Testings
Practical Applications and Concepts of AB testings.
Experiment
My Notes for Online Experiment -- A/B Testings
Practical Applications and Concepts of AB testings.
Analytics
My Notes for Online Experiment -- A/B Testings
Practical Applications and Concepts of AB testings.
Product Feature
My Notes for Online Experiment -- A/B Testings
Practical Applications and Concepts of AB testings.
Cheat Sheet
TOP -- Data Science Knowledge Complete Summary
This is a continuously updated data-science concept post edit and organized by me to summarize and sort out cs, stats, machine learning and other related topics all in one from past learning.
Statistics
TOP -- Data Science Knowledge Complete Summary
This is a continuously updated data-science concept post edit and organized by me to summarize and sort out cs, stats, machine learning and other related topics all in one from past learning.