JMichael Blog

Thinking will not overcome fear but action will.

Data Science Concepts and Notes (WIP)

Data Science, Statistics and Data Mining Notes (UCLA CS145)

Data Science Multi-disciplinary field that brings together concepts from computer science, statistics/machine learning, and data analysis to understand and extract insights from the ever-increasin...

TOP -- Data Science Knowledge Complete Summary

This is a continuously updated data-science concept post edit and organized by me to summarize and sort out cs, stats, machine learning and other related topics all in one from past learning.

Data Science Multi-disciplinary field that brings together concepts from computer science, statistics/machine learning, and data analysis to understand and extract insights from the ever-increasin...

My Notes for Online Experiment -- A/B Testings

Practical Applications and Concepts of AB testings.

AB Testing Workflow know business goal.google   define goal metrics Unit of diversion (randomization unit) - views, users, or cookie Population - to run experiment only on the population t...

Python String Pattern Process

Using Python to reshape String col. to long String, and conduct pattern match

Project Overview we have a list of keywords in the excel sheet we should check all the video titles and see if the keywords ever appear in the titles. to enable this, we load the keywords fr...

Energy Consumption Calculation Based on Differnt Time and Price (Tarrif) Using Python

Python Calculation and Time Manipulation

Project Overview Objective Your assignment is to clean data obtained from an electrical meter and calculate the total of a utility bill. Tariff Description There are three components for this spec...

Validation of XML Files using R

How does R enable automatic XML files validation and error log collection

Introduction R could be an useful language not only to conduct statistical analysis but complete some functional jobs at fast speed with little human interference. The XML validation is a pro...

Handling Big Data in R

A step by step guidance for handling large dataset 30GB in R

Introduction There are very few posts on the internet that systematically talk about how to handle relatively large datasets in R. Especially, when the data size goes even larger, i.e. 32GB with 5...

Citadel West Coast Regional Data Open 2020

Bikeshare Market Analysis in NYC at Citadel Data Open

Overview This is the project about we worked at Citadel West Coast Regional Data Open 2020 The topic is Exploring the Market for Bikeshare in NY The problems we are trying to solve are whic...

Text Analytics and Neural Network

Predict the influential factors in college based on students' comments on college life.

hello This browser does not support PDFs. Please download the PDF to view it: Download PDF. </embed>

Sleep Quality Analysis using R

Follow Up previous Python project by re-writting in R.

Background Information 1.1 Data source National Sleep Foundation www.sleepfoundation.org 2015 Sleep in America Poll, which we assume their answers reflect true situation 1.2 Data Collection...

Sleep Quality Analysis using Python

final project demo for python data analysis course fall 19 at UCLA.

1. Background Information 1.1 Data source National Sleep Foundation www.sleepfoundation.org 2015 Sleep in America Poll 1.2 Data Collection & General Information The National Sleep Found...

Resume Workshop -- Data Science/Analytics

CSSA Panel.

Structure Name / Contact Email, Phone No., (Address,) LinkedIn, Website, Github, Projects Education School, Major/Minor, GPA, Duration, Related Coursework, (School Involvements) Skil...