Click here to Skip to main content
15,885,216 members
Please Sign up or sign in to vote.
1.00/5 (1 vote)
See more: , +
Hello everyone
I'm still working on an idea for graduation project this year
I've decided to implement 3 clustring algorithms in c# to compare their performance in clustring but still don't know what to cluster
I want something like Fisher Iris (the same difficulty level)because the concentration point of the project is in the comparison of the algorithms

Thanks to all
Posted

1 solution

Why not use the Fisher Iris dataset ? The question I would ask would be if that rather simple dataset really would "exercise" the algorithms you compare in ways that brought out their relative performance, and capabilities.

If I were a member of the academic staff evaluating your research (unlucky you !), I would be looking at the qualitative research you did to select the types of dataset(s) you worked with as being extremely important.

I would expect you to "defend" your choices by speaking knowledgeably about the history of the algorithms you test, about their known best- and worst- performance cases, and giving a solid explanation of how the datasets you chose are useful to compare clustering performance.

And, I'd be asking you questions like: "given a very large dataset to analyze with many possible factors: describe your choice of algorithms by cost($, CPU time), and demands for hardware resources (memory, data storage, bandwidth)."

You can find all kinds of datasets of every type on the web with just a little searching.

Here's a bunch: [^].
 
Share this answer
 
Comments
Mujeeba Haj Najeeb 19-Dec-14 14:51pm    
ok I'll see my supervisor and talk with him about what you said
thanks

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900