mirror of
https://github.com/thisisjaymehta/CAP-Dataset-CSV-Converter.git
synced 2026-06-08 07:09:26 +00:00
Convert CAP Dataset from EDF to CSV and reduce its size to 10% of original data
- Python 100%
| .gitignore | ||
| files.py | ||
| LICENSE | ||
| minifier.py | ||
| README.md | ||
| requirements.txt | ||
CAP-Dataset-CSV-Converter
Convert CAP Dataset from EDF to CSV and reduce its size to 10% of original data
Steps:
- Download dataset from https://physionet.org/content/capslpdb/1.0.0/ ( ~ 40.1 GB ) .
- Extract ZIP
- Put
minifier.py,files.pyandrequirements.txtin dataset folder (inside/cap-sleep-dataset-1.0.0/) - Install requirements (
pip install -r requirements.txt) - Run
minifier.py - Wait for 1000 years.
Notes:
- Make sure your free diskspace is more than 100GB
- Edit
files.pyif you want to work with only part of dataset. By default it will convert all edf files - It will take looong time to process. So if you decide to leave computer running, make sure your computer don't go to sleep automatically after some time.
- This code will strip csv to 10% of original edf data. For eg,
brux1.edfconverted tobrux1.csvhave 7342592 rows. But I minify data to include first 734259 rows only.