ML Codes Exercises

This is my spot for sharing my Jupyter notebooks, a bunch of examples, and a variety of data sets. In each code, you can find notes that will help you interpret the concepts better.

NOTE You can find the data sets in the sections .


This code contains simple dataset representing gene expression levels.

Gene expression levels are calculated by the ratio between the expression of the target gene (i.e., the gene of interest) and the expression of one or more reference genes (often household genes).

This dataset is synthetic and specifically designed to show some of the strengths and limitations of using KNN for Classification.



The code contains data that simulates a medical study in which mice infected with a virus.
were given various doses of two medicines, and then checked 2 weeks later to see if they were still infected.
Given this data, our goal is to create a classifcation model than predict (given two dosage measurements) if they mouse will still be infected with the virus.





This code contains dataset that represent penguine species.

The data set contain the following features that help us to classify penguins:
* species: penguin species (Chinstrap, Adélie, or Gentoo)
* culmen_length_mm: culmen length (mm)
* culmen_depth_mm: culmen depth (mm)
* flipper_length_mm: flipper length (mm)
* body_mass_g: body mass (g)
* island: island name (Dream, Torgersen, or Biscoe) in the Palmer Archipelago (Antarctica)
* sex: penguin sex





More To Come ...