ProtoBug provides automatic hierarchical classification of protein sequences. The site allows users to study the clustering as well as its qualities.

The ProtoBug classification hierarchically partitions the protein space into clusters of similar proteins. The lower a cluster is situated in its tree, the smaller it is and the more similar are its proteins to each other. Browsing the clustering hierarchy can provide insight as to function and structure of proteins.

In addition, the clustering hierarchy induces a novel distance measure between proteins and clusters - ProtoDistance.

The clustering source code is available here:
MC-UPGMA source code