⚡ ML Preprocessing CLI

A beginner-friendly Command-Line Interface (CLI) tool to automate essential data preprocessing tasks in machine learning workflows — from missing value handling to encoding and feature scaling.

🎯 Features

📊 Dataset summary and null value statistics
🧹 Missing value handling (drop or impute)
🧠 Categorical encoding (Label Encoding, One-Hot Encoding)
📏 Feature scaling (Standard or Min-Max)
📥 Export cleaned dataset to CSV
🧪 Interactive terminal menu — no notebook required!

🧰 Installation

Make sure you have Python 3.10+ installed.

git clone https://github.com/mohakamitpatel/ML_Preprocessing_CLI.git
cd ML_Preprocessing_CLI
pip install -r requirements.txt

🗂️ Project Structure

📂 ML_Preprocessing_CLI
├── data/ → Sample datasets (optional)
├── data_input.py → Reads the dataset
├── data_description.py → Summarizes the dataset
├── imputation.py → Missing value imputation
├── categorical.py → Encoding categorical features
├── feature_scaling.py → Feature scaling logic
├── download.py → Saves the cleaned dataset
├── main.py → CLI entry-point
└── requirements.txt → Required Python libraries

💻 How to Use

Run the CLI from the terminal:

python main.py

You will be guided through a step-by-step menu to:

Upload dataset
Handle missing values
Encode categorical variables
Scale features
Download processed file

🔍 Example Usage

$ python main.py
📁 Please enter the path to your CSV file: data/sample.csv

✅ Dataset Loaded!
Choose an option:
[1] Describe Data
[2] Handle Missing Values
[3] Encode Categorical Features
[4] Scale Features
[5] Export Cleaned Data

📦 Dependencies

pandas
numpy
scikit-learn
tabulate
termcolor

Install via:

pip install -r requirements.txt

🎥 Demo Preview

💡 Future Ideas

Add support for command-line arguments
Integrate GUI using Tkinter or Streamlit
Auto EDA with visualizations
Upload directly from URL or cloud storage

🤝 Contributing

Pull requests are welcome!
If you have suggestions or find bugs, feel free to fork and improve.

✌️ Peace out. Built with code, not caffeine — Mohak

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

⚡ ML Preprocessing CLI

🎯 Features

🧰 Installation

🗂️ Project Structure

💻 How to Use

🔍 Example Usage

📦 Dependencies

🎥 Demo Preview

💡 Future Ideas

🤝 Contributing

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
Image.png		Image.png
README.md		README.md
categorical.py		categorical.py
data_description.py		data_description.py
data_input.py		data_input.py
download.py		download.py
feature_scaling.py		feature_scaling.py
imputation.py		imputation.py
main.py		main.py
requirements.txt		requirements.txt

mohakamitpatel/ML_Preprocessing_CLI

Folders and files

Latest commit

History

Repository files navigation

⚡ ML Preprocessing CLI

🎯 Features

🧰 Installation

🗂️ Project Structure

💻 How to Use

🔍 Example Usage

📦 Dependencies

🎥 Demo Preview

💡 Future Ideas

🤝 Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages