Jump to content

SoK/2024/StatusReport/Israel Galadima

From KDE Community Wiki

Download/Import of datasets from kaggle.com

Project Abstract

LabPlot is a data visualization and analysis software. For testing and study purposes, LabPlot provides easy access to multiple collections of publicly available datasets. In addition to the already available collections, the support for https://www.kaggle.com is desired which seems to be the central place nowadays for datasets used in the data science community. The purpose of this project is to extend LabPlot to download and import datasets from https://www.kaggle.com.

Weekly Progress

  1. Week 1 - 5
    1. Setup development environment
    2. Submitted a merge request [1] to introduce a small delay before reacting to the user typing in the datasets search box. This was to reduce the frequency of requests made to https://www.kaggle.com.
    3. Submitted a merge request [2] to introduce a small delay before reacting to the user typing in line edits in various dock widgets. This was to reduce the frequency of entries added to the application undo stack.
    4. Submitted a draft merge request [3] to refactor ImportDatasetWidget to accommodate new logic for downloading and importing datasets from https://www.kaggle.com.
  2. Week 6 - 10
    1. Pivoted to creating a separate widget and dialog for downloading and importing datasets from https://www.kaggle.com. Closed previous draft merge request [4].
    2. Submitted merge requests [5] [6] to fix issues in ImportDatasetDialog.
    3. Submitted draft merge request [7] to add ImportKaggleDialog and ImportKaggleWidget for downloading and importing datasets from https://www.kaggle.com.
    4. Updated draft merge request [8] to include line edit for collecting path to kaggle cli tool and added message box to warn the user when kaggle cli tool couldn't be run.
  3. Week 10 - 12
    1. Requested additional weeks to get new feedback from my mentor and do the final polishing of the new dialog. Final merge request [9]

Blog Posts

  1. Blog Post 01 [10]