class: center, middle ## IMSE 586 ## Big Data Analytics and Visualization
### Course Introduction
### Instructor: Fred Feng --- class: split-60 .column[ ## About the instructor
Fred Feng .red[fredfeng]@umich.edu Assistant Professor @ IMSE Research group: [fenggroup.org](https://fenggroup.org/) ] .column[
![:scale 40%](./images/feng.png) ] --- class: center, middle # Course website .huge[
] --- # Some general study tips - Type along during the coding sessions - Learning by doing .center[![:scale 40%](./images/ridebike.jpg)] -- - Pause/replay the video as needed --- # Some general study tips - The homework are meant to be an essential part of your learning experience. Start early! --
- Ask for help - Email us
- Come to the Zoom office hours --
- Back up your work (homework, project) - [Dropbox at UM](https://its.umich.edu/communication/collaboration/dropbox) --- # Why programming in data analytics?
- Flexibility & customization - Reproducible research - Collaborative work - Version control --- # Why Python? .center[![:scale 60%](./images/python-logo.png)] - Open source, free, and widely available - Massive open-source community --- # Why Python? - ### Extensive libraries - [NumPy](https://numpy.org/) - [pandas](https://pandas.pydata.org/) - [Matplotlib](https://matplotlib.org/) - [Seaborn](https://seaborn.pydata.org/index.html) - [statsmodels](https://www.statsmodels.org/) - [scikit-learn](https://scikit-learn.org/) - and many more --- .center[![:scale 100%](./images/pythonvislandscape.png)] --- class: middle, center .center[![:scale 80%](./images/xkcd-python.png)]
Image source
--- # Why Python? - ### Beginner friendly - High-level programming language - Intuitive syntax --- # Why Python? - ### Widely popular - [IEEE: Top Programming Languages 2022](https://spectrum.ieee.org/top-programming-languages-2022) - [Widely used in industry](https://scikit-learn.org/stable/testimonials/testimonials.html) - [Job qualifications](https://www.google.com/search?q=data+science+job+posting) --- # JupyterLab To run Python, we will use a browser-based application called [JupyterLab](https://jupyter.org/). --- # Install Python / JupyterLab
We will use Anaconda, a common Python distribution bundled with other popular tools.
You can follow the installation instruction at
--- # Local recourses of data science
* [Michigan Institute for Data Science (MIDAS)](https://midas.umich.edu/) --- # Assignments - [Software installation](https://www.imse440.org/software-setup.html) - Watch & follow along the "Python basics" video - Read the syllabus in its entirety - Course survey (link on Canvas) - Suggested readings - [Big Data Analytics: What It Is, How It Works, Benefits, And Challenges](https://www.tableau.com/learn/articles/big-data-analytics) - [The JupyterLab Interface](https://jupyterlab.readthedocs.io/en/stable/user/interface.html) - [The scientific paper's obsolete-here's what's next](https://www.theatlantic.com/science/archive/2018/04/the-scientific-paper-is-obsolete/556676/)