Machine Learning
Understand algorithms like regression, classification, clustering, and how they power predictions and recommendations.
Box Plot of Wine Quality
Create a box plot showing alcohol distribution by wine quality.
Calculate Accuracy Score
Calculate accuracy for a wine quality classifier (good/bad).
Calculate Basic Statistics
Calculate and print basic statistics (mean, std, min, max) for numerical columns in the Iris dataset.
Calculate Correlation Matrix
Calculate and display the correlation matrix for the student performance dataset.
Calculate Mean Absolute Error
Calculate MAE for a linear regression model predicting house prices.
Calculate Price per Sqft
Add a new column 'price_per_sqft' to the house prices dataset and display the result.
Classification Report
Generate a classification report for the student activities classifier.
Classification Report
Generate a classification report for the Iris species prediction model.
Confusion Matrix
Create a confusion matrix for the Iris species prediction model.
Confusion Matrix Basics
Create a confusion matrix for the student extra activities classifier.
Count Missing Values
Count and print the number of missing values in each column of the Titanic dataset.
Count Unique Values
Count and print the number of unique values in each column of the Iris dataset.
Create a Bar Chart
Create a bar chart showing average price by number of bedrooms.
Create a Simple Scatter Plot
Create a scatter plot of size_sqft vs price from the house prices dataset.
Create Dummy Variables
Convert the 'Pclass' column in Titanic dataset to dummy variables and print the result.
Cross-Validation Score
Calculate and print the cross-validation score for a logistic regression model on the Titanic dataset.
Decision Tree Basics
Train a decision tree to predict if students did extra activities based on their scores.
Decision Tree Classifier
Train a decision tree classifier on the Iris dataset and print its depth.
Display Dataset Shape
Load the Titanic dataset and print its shape (rows, columns).
Encode Categorical Variable
Convert the 'Sex' column in Titanic dataset to numerical values (male=0, female=1) and print the result.
Feature Importance
Display feature importances from a random forest model trained on the Iris dataset.
Filter Data by Condition
Filter the Titanic dataset to show only female passengers and print the result.
Filter Records by Condition
Display houses with more than 3 bedrooms from the dataset.
Grid Search for Hyperparameter Tuning
Perform grid search to find the best K for KNN classifier on the Iris dataset.
Group and Aggregate Data
Calculate the average final score by extra_activities status from the student dataset.
Handle Missing Values
Fill missing age values in Titanic dataset with the mean age and print the updated Age column.
KNN Classifier Basics
Train a KNN classifier to predict wine quality (threshold: >=7 is good) using alcohol and sulphates.
Linear Regression
Train a linear regression model to predict diabetes progression and print the R2 score.
Load and Display Specific Columns
Load the house prices dataset and display only 'size_sqft' and 'price' columns.
Normalize Numerical Data
Normalize the 'Age' column in Titanic dataset using min-max scaling and print the result.
Plot a Histogram
Plot a histogram of final scores from the student dataset with 3 bins.
Plot Multiple Features
Create scatter plots of hours_studied vs final_score and previous_score vs final_score side by side.
Precision and Recall
Calculate precision and recall for the wine quality classifier.
Predict Iris Species
Train a KNN classifier to predict iris species and print the accuracy.
Predict Student Scores
Use linear regression to predict final_score based on hours_studied.
Random Forest Classifier
Train a random forest classifier on the Titanic dataset to predict survival.
Simple Linear Regression
Train a linear regression model to predict house price based on size.
Train a Logistic Regression Model
Train a logistic regression model to predict survival on the Titanic dataset.
Train-Test Split
Split the Iris dataset into 70% training and 30% test sets and print their shapes.
Train-Test Split Practice
Split the wine dataset into 80% train and 20% test sets and print their shapes.