DNA Barcodes classification with supervised machine learning techniques
The DNA Barcodes sequences classification problem may be approached as a supervised machine learning problem in the following way:
given a reference library composed of DNA Barcode specimen sequences of known species and a collection of unknown DNA Barcode sequences
(query set) recognize the latter into the species that are present in the library. This problem may be solved with a special software
procedure explained in the tutorial and in the paper: "Supervised DNA Barcodes Species Classification: Analysis, Comparisons and Results",
Bio DataMining (under revision).
Download the special FASTA converter here.
FASTA TO WEKA CONVERTER, TUTORIAL, AND PRESENTATION
| Fasta2Weka.zip |
| SupervisedMLBarcodes.ppt |
| Tutorial.pdf |
Sample datasets
| Cypraeidae.zip |
| Drosophila.zip |
| Inga.zip |
| Simulated.zip |
| aQuickTest.zip |
| algae.zip |
| bats.zip |
| birds.zip |
| fishes.zip |
| fungi.zip |
