ProtoBug provides automatic hierarchical classification of protein sequences.
The site allows users to study the clustering as well as its qualities.
The ProtoBug classification hierarchically partitions the protein space into clusters of similar proteins.
The lower a cluster is situated in its tree, the smaller it is and the more similar are its proteins to each other.
Browsing the clustering hierarchy can provide insight as to function and structure of proteins.
In addition, the clustering hierarchy induces a novel distance measure between proteins and clusters -
We will be glad to receive any comments / remarks / suggestions by using the Feedback option, or by mailing directly to firstname.lastname@example.org
The clustering source code is available here:
MC-UPGMA source code