{"id":81,"date":"2010-03-09T22:13:49","date_gmt":"2010-03-09T22:13:49","guid":{"rendered":"http:\/\/users.aber.ac.uk\/rkj\/test2\/?page_id=81"},"modified":"2012-08-02T10:32:41","modified_gmt":"2012-08-02T10:32:41","slug":"datasets","status":"publish","type":"page","link":"https:\/\/users.aber.ac.uk\/rkj\/site\/research\/datasets\/","title":{"rendered":"Datasets"},"content":{"rendered":"<p>\nThis is a collection of datasets used in some of my feature selection experimentation. Many of these datasets come from the <a href=\"http:\/\/www.ics.uci.edu\/~mlearn\/\">UCI  Machine Learning Repository<\/a>. The decision attribute is always the final column  in the dataset.\n<\/p>\n<p> <br \/>\n<b>Crisp Datasets<\/b><br \/>\nThese datasets contain discrete values only:<\/p>\n<p><a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/breastcancer.dat\">Breast cancer<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/corral.dat\">Corral<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/credit.dat\">Credit<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/derm.data\">Derm<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/derm2.data\">Derm2<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/dna.data\">DNA<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/exactly.dat\">Exactly<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/exactly2.dat\">Exactly2<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/heart.dat\">Heart<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/led.dat\"> LED<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/letters.dat\">Letters<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/lung.dat\">Lung<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/m-of-n.data\">M-OF-N<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/monk3.data\">Monk3<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/mushroom.dat\">Mushroom<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/parity5+2.data\">Parity5+2<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/parity5+5.data\">Parity5+5<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/vote.dat\">Vote<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/webtest.dat\">Website classification<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/crisp\/wq.dat\">Discretized Water Treatment<\/a>\n<\/p>\n<p><b>Real-valued Datasets<\/b><br \/>\nThese datasets contain real-valued attributes:<\/p>\n<p><a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/abalone.dat\">Abalone<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/arrhythmia.dat\">Arrhythmia<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/cacoScaled.dat\">Caco<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/fruit.dat\">Fruit<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/fuzieee.dat\">FUZZIEEE example<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/glass.dat\">Glass<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/ion.dat\">Ionosphere<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/iris.dat\">Iris<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/isolet.dat\">Isolet<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/vehicle.dat\">Vehicle<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/water2.dat\">Water Treatment<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/wav.dat\">Waveform<\/a><br \/>\n<a href=\"real\/web.dat\">Website classification<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/wine.dat\">Wine<\/a>\n<\/p>\n<p> <b>Fuzzification files<\/b> <\/p>\n<p>Note that these have not been optimized. For use in fuzzy-rough<br \/>\nattribute reduction (FRAR):<\/p>\n<p><a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/abalone.dat_f\">Abalone<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/arrhythmia.dat_f\">Arrhythmia<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/cacoScaled.dat_f\">Caco<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/fruit.dat_f\">Fruit<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/fuzieee.dat_f\">FUZZIEEE example<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/glass.dat_f\">Glass<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/ion.dat_f\">Ionosphere<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/iris.dat_f\">Iris<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/isolet.dat_f\">Isolet<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/vehicle.dat_f\">Vehicle<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/water2.dat_f\">Water treatment<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/wav.dat_f\">Waveform<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/web.dat_f\">Website classification<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/wine.dat_f\">Wine<\/a>\n<\/p>\n<p>\nA readme file containing some more description can be found <a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/readme.txt\">here<\/a>.\n<\/p>\n<p>\nAn example dataset and fuzzification for FRAR can be found below. The decision<br \/>\nattribute is fuzzy.<\/p>\n<p><a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/rasmani.dat\">Example dataset<\/a><br \/>\n<a href=\"http:\/\/users.aber.ac.uk\/rkj\/datasets\/real\/rasmani.dat_f\">Dataset fuzzification<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>This is a collection of datasets used in some of my feature selection experimentation. Many of these datasets come from the UCI Machine Learning Repository. The decision attribute is always the final column in the dataset. Crisp Datasets These datasets &hellip;<\/p>\n<p class=\"read-more\"> <a class=\"more-link\" href=\"https:\/\/users.aber.ac.uk\/rkj\/site\/research\/datasets\/\"> <span class=\"screen-reader-text\">Datasets<\/span> Read More &raquo;<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":66,"menu_order":0,"comment_status":"open","ping_status":"open","template":"","meta":{"footnotes":""},"class_list":["post-81","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/users.aber.ac.uk\/rkj\/site\/wp-json\/wp\/v2\/pages\/81","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/users.aber.ac.uk\/rkj\/site\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/users.aber.ac.uk\/rkj\/site\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/users.aber.ac.uk\/rkj\/site\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/users.aber.ac.uk\/rkj\/site\/wp-json\/wp\/v2\/comments?post=81"}],"version-history":[{"count":1,"href":"https:\/\/users.aber.ac.uk\/rkj\/site\/wp-json\/wp\/v2\/pages\/81\/revisions"}],"predecessor-version":[{"id":329,"href":"https:\/\/users.aber.ac.uk\/rkj\/site\/wp-json\/wp\/v2\/pages\/81\/revisions\/329"}],"up":[{"embeddable":true,"href":"https:\/\/users.aber.ac.uk\/rkj\/site\/wp-json\/wp\/v2\/pages\/66"}],"wp:attachment":[{"href":"https:\/\/users.aber.ac.uk\/rkj\/site\/wp-json\/wp\/v2\/media?parent=81"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}