class: center, middle ## IMSE 586 ## Big Data Analytics and Visualization
### Logistic regression
### Instructor: Fred Feng --- class: middle, center #Classification data:image/s3,"s3://crabby-images/14702/14702bff97ec494f2a39fa9c340ce2c7a33da981" alt=":scale 39.6%" data:image/s3,"s3://crabby-images/89d75/89d75076edd45f758e04d901c21bacc2b885777b" alt=":scale 40%" --- # Will a credit card customer default? ``` import pandas as pd df = pd.read_csv('./data/default.csv') df.head() ``` .center[data:image/s3,"s3://crabby-images/d62f0/d62f0df729a3b4d44f5b243f098b91cabee151de" alt=":scale 70%"] --- $$\text{default_binary}= \begin{cases} 1, & \text{if } \text{default = Yes;} \\\ 0, & \text{if } \text{default = No.} \end{cases} $$ -- ``` ( so.Plot(df, x='balance', y='default_binary') .add(so.Dot()) ) ``` .center[data:image/s3,"s3://crabby-images/09eab/09eab8d16cb18125e9d3b7f59c9175434f35a837" alt=":scale 100%"] --- # Logistic regression $$p(x)=\frac{e^{\beta_0+\beta_1x}}{1+e^{\beta_0+\beta_1x}}$$ p(x): probability of default given that the balance is x. --
.center[data:image/s3,"s3://crabby-images/0a464/0a464ae805f0faedf4373f2d917a029cbd6a2420" alt=":scale 100%"] --- # Logistic regression $$ \begin{aligned} p(x)&=\frac{e^{\beta_0+\beta_1x}}{1+e^{\beta_0+\beta_1x}} \\\ \\\ \frac{p(x)}{1-p(x)}&=e^{\beta_0+\beta_1x} \\\ \\\ \ln{\frac{p(x)}{1-p(x)}}&=\beta_0+\beta_1x \\\ \end{aligned} $$ The log-odds (or logit) is a linear function of x. --- # Interpretation of the slope parameter $$\frac{p(x)}{1-p(x)}=e^{\beta_0+\beta_1x}$$ When we increase x by 1, the odds $$ \small \begin{aligned} \frac{p(x+1)}{1-p(x+1)}=e^{\beta_0+\beta_1(x+1)} &=e^{\beta_1}e^{\beta_0+\beta_1x} =e^{\beta_1}\frac{p(x)}{1-p(x)} \end{aligned} $$ increase by a ratio of $$e^{\beta_1}$$ --- $$\ln{\frac{p(x)}{1-p(x)}}=\beta_0+\beta_1x$$ When we increase x by 1, the log-odds $$ \small \begin{aligned} \ln{\frac{p(x+1)}{1-p(x+1)}}&=\beta_0+\beta_1(x+1) \\\ &=\beta_0+\beta_1x+\beta_1\\\ &=\ln{\frac{p(x)}{1-p(x)}}+\beta_1 \end{aligned} $$ increase by $$\beta_1$$