This almost the 2nd year of my new job. Instead of teaching people R and machine learning algorithm, I started to build my own models and implemented them into production. However, I found it was so hard to revisit my old project: different versions of data, different visions of models, and tons of PPT and […]
This project inspired by a recent acquisition activity is Bass Pro to acquire Cabela’s. I would like to look at the revenues and the market share of Cabela’s and one of its competitors, Dick’s Sporting Goods, prior to acquisition and see if there are any features/signals that can be seen in the last few months […]
I currently work as a data scientist in NYC Data Science Academy. Last night, we had a fantastic open house party to introduce our 12-Week Data Science bootcamp. It’s a good chance to learn about what is data science, what will you learn in the bootcamp and make friends. To me, it’s a good chance to […]
It’s important to keep yourself updated in this data science area. So I did some research and got a book list on some topics I interested in.
The Visual Display of Quantitative Information Bought the book already, need to dig into the topic. About the book: Theory and practice in the design of data graphics, 250 illustrations of the best (and a few of the worst) statistical graphics, with detailed analysis of how to display data for precise, effective, quick analysis.
Now you see it Luckily this book is here, in my office! Maybe good for business analyses.
Storytelling with Data Got this book during my last job. It contains not just how to make pretty and efficient graphs, but also how to present your data analyses results. Since I want to develop a related course on data visualization and presentation, I may need to read this book as a whole.
Hands-on programming with R Very basic R. Since I’m teaching R in NYCDSA now, this book helps me a lot. Planning to go over this book very quickly.
Why Airbnb? Visiting NYC? Airbnb is a good choice to book unique accommodations.I have used Airbnb.com for almost 3 years, this website helps me spend my vacation as a local person, gain some fantastic experience! To better explore its rental listings across New York City, I designed this app to answer some questions: How many of the […]
Amy Ma January 30, 2016 Each movie has their own posters. Even in today’s always-online climate, the movie poster remains a powerful form of advertising. Every movie poster has its own color scheme, based on the movie’s type, content, and tone. Best movie posters should catch people’s eyes. So what kinds of colors are more […]
For those 4 classification problems, we also did the same processes as we mentioned in part I. Instead of going over the details, we would only show the model selection result of the four classification problems as following: Table 1. Model Selection Results by Data Sets Wdbc Ionosphere Hypothyroid Gradient Boosting(n.trees, depth, n.minobsinnode) 500,9,10 […]
Which supervised learning algorithm is the best? For people who just start their machine learning journey, this question always comes to their mind. To answer this question,we used 4 different types of data sets (One for regression problem, and the other 4 for binary classification problem) to test 6 supervised learning algorithms: SVMs (linear and […]
Introduction The purpose of this report is to identify the relationships in the supplier base at Johnson & Johnson, especially the business arrangements with minority and women-owned companies. I chose the three pharmaceutical companies’ data, which contains 28 variables of eight departments’ purchase records (n=725,414). To prepare for the data analysis, this paper combined the […]
According to Glassdoor’s report, data scientists have the best jobs in the U.S. in 2016, with a median base salary of $116,840 (national average salary of $118,709) and 1,736 job openings. On the other hand, indeed.com says the average base salary of data scientists should be $123,000. Which figure is more reliable? As a foreigner, […]