Data are sorted by difference (HSG-LSG), such that negative values represent a decrease in percentage under HSG conditions. The random forest ROC area under the curve for phyla = 0.884. Source data is listed in S2 Table.