Skip navigation links

click here. 

The optimal (1-st) and 2-nd, 5-th, and 14-th suboptimal structures all have the same tree list representation (shown in the bottom red box).  As a result, they are clustered in one taxonomy tree node with a heuristic distance measure of 0.  On the other hand, they do differ from each other, having unique stems, which is clearly visible in the stem trace of the same data (green, vertical, dashed lines intersect all the component stems of individual structures belonging to this taxonomy tree cluster). 
The subtree of the 7-th, 11-th, and 17-th suboptimals shows a distance of 3 among them, which is due to different arrangements of Bulge (B) and Internal (I) loops in a the first sublist of their tree list representations (shown in the top red box).  In addition to these differences, the structures from this subtree also differ in their component stems (yellow, vertical, dashed lines intersect all the component stems of individual structures belonging to this taxonomy tree cluster). 

The relative differences between the two highligthed clusters of structures are clearly visible in the taxonomy tree's distance of 80 between them. They also stand out in the stem trace representation of the solution space. In general, however, the Taxonomy Tree algorithm can be far more sensitive to structureal differences than the Stem Trace depictions.

Stem Histogram

This figure is of a stem histogram that corresponds to the m2 taxonomy trees and the stem trace shown above.  It is displaying the top 50 suboptimal structures shown in the the taxonomy trees.  In this representation long linear sets of diagonals correspond to non-branching segments in the tree list representations. In contrast to a stem trace representation, the clustering of similar structures is not evident.

For more information on stem histograms click here.