Unique Microbial Superfamilies Among the 19 Microbial Genomes (Cele was not used in the comparisons and Ecol and Syne are not shown here, Aaeo and Tpal have no unique superfamilies) Aful AF1948 d1jiaa_ 1.103.1.2.4 ENZYME 1 0.0022 Snake phospholipase A2 (Probably False Pos, also in cele) Bbur BBA15 d1ospo_ 2.58.1.1.1 NONENZ 2 5e-88 Outer surface protein A (Real Pos, not in cele) BB0365 e1ice.1b 3.11.1.1.2 ENZYME 1 0.0089 Interleukin-1beta converting enzyme (Cys protease) (Probably False Pos, also in cele) BB0014 d2ech__ 7.19.1.1.1 NONENZ 1 0.0081 Echistatin (Probably False Pos, also in cele) Bsub NprE d1ezm_1 1.57.1.1.1 ENZYME 2 2e-47 Elastase (Real Pos, not in cele) PelB d1idk__ 2.62.1.2.1 ENZYME 2 1e-70 Pectin lyase A (Real Pos, not in cele) MtrB d1wapa_ 2.64.6.1.1 NONENZ 1 1e-32 Trp RNA-binding attenuation protein (TRAP) (Real Pos, not in cele) YuiE d1lcpa1 3.41.1.1.1 ENZYME 1 0.0069 Leucine aminopeptidase, N-terminal domain (Probably False Pos, only recognized by fasta) YrdF d1brsd_ 3.6.1.1.1 NONENZ 1 1.7e-14 Barstar (barnase inhibitor) (Real Pos, not in cele, but a reverse BLAST found matches with aaeo, syne, ecol, hinf...) Cpne AAD19183 d1toh__ 4.112.1.1.1 ENZYME 1 2.2e-12 Tyrosine hydroxylase domains (A good hit but also in cele, highest hit with a HUMAN prot, a HT!) AAD18679 d1kpta_ 4.37.1.1.1 NONENZ 1 0.00096 Virally encoded KP4 toxin (Real Pos, only in Chlamydia's!) AAD18204 e1hle.1b 5.2.1.1.1 NONENZ 1 0.0011 Elastase inhibitor (Probably False Pos, also in cele, reverse BLAST didn't result in any hits!) Ctra 3328868 d1ldl__ 7.11.1.1.1 NONENZ 1 0.0093 Ligand-binding domain of LDL receptor (Probably False Pos, also in cele) 3328728 d1ncfa3 7.24.1.1.1 NONENZ 1 0.0057 Tumor necrosis factor (TNF) receptor* (Probably False Pos, also in cele) Hinf HI1181 d1pgs_2 2.11.1.1.1 ENZYME 1 0.01 N-glycosidase F (PNGase F) (Probably False Pos, only recognized by fasta, reverse BLAST find sim with many Bacteria!) HI0348 d1prtd_ 2.29.2.1.7 NONENZ 1 0.0093 Pertussis toxin S4 subunit (Probably False Pos, reverse BLAST find sim with many Bacteria!) HI1478 d1bco_1 2.36.1.1.1 ENZYME 1 2e-29 Mu transposase, C-terminal domain (Real Pos, not in cele, reverse BLAST finds only Mu) Hpyl HP1397 d1vom_1 2.24.3.1.2 NONENZ 1 0.01 Myosin S1 fragment, N-terminal domain (Not in cele, reverse BLAST finds only in Streptomyces coelicolor, might be considered a Real Pos) HP1186 d1hcb__ 2.56.1.1.1 ENZYME 1 1e-32 Carbonic anhydrase (Not in cele, reverse BLAST finds a few Bacteria, not in our collection, Real Pos) Mgen MG280 e1avo.1a 1.24.8.1.1 NONENZ 1 0.009 Proteasome activator reg(alpha) (Real Pos, not in cele, reverse BLAST finds rat gene as closest homolog) MG371 d1a66a_ 2.2.4.1.2 NONENZ 1 0.002 Transcription factor NFATC, DNA-binding domain (Probably False Pos, also in cele, reverse BLAST shows strong sim with mpne, bsub) Mjan MJ1464 d1hula_ 1.26.1.2.3 NONENZ 1 0.0099 Interleukin-5 (Not in cele, reverse BLAST shows sim with other Arch, cele and plants, False Pos) Mpne 1673974 d1bhp__ 7.12.1.1.2 NONENZ 1 0.0079 beta-Purothionin (Not in cele, reverse BLAST shows sim with Arch and Bac ribosomal proteins, False Pos) Mthe MTH1626 d1cii_1 1.105.9.1.1 NONENZ 1 2e-22 Colicin Ia, N-terminal domain (Not in cele, reverse BLAST shows sim with other Arch, False Pos) MTH100 d1tpg_2 7.27.1.1.2 NONENZ 1 0.0061 Tissue-type plasminogen activator, t-PA (Not in cele, reverse BLAST shows no sim with anything, might be Real Pos) Mtub Rv1353c d2tct_2 1.94.1.1.1 NONENZ 1 8e-31 Tetracyclin repressor, C-terminal domain (Not in cele, reverse BLAST shows weak sim with ecol repressor, Real Pos) Rv1419 d1abrb2 2.31.2.1.2 NONENZ 1 3e-06 Plant cytotoxin B-chain (lectin) (also in cele, reverse BLAST shows closest homolog to a plant, only one other bacteria, Streptomyces, it is HT!) Rv1758 d1cex__ 3.14.7.1.1 ENZYME 7 3e-40 Cutinase (Not in cele, reverse BLAST:only in mtub, closest homolog in db in fungi, like Penicillium purporogenum!!!, Real Pos!) Rv0062 d1tml__ 3.2.1.1.1 ENZYME 1 5e-66 Cellulase E2 (Not in cele, reverse BLAST: with a few obscure organisms, none of our 20, Real Pos) Rv0316 d1mli__ 4.34.4.1.1 ENZYME 1 9e-07 Muconalactone isomerase (Not in cele, reverse BLAST: with a few obscure organisms, none of our 20, Real Pos) Rv1919c d1bv1__ 4.79.3.1.1 NONENZ 1 0.0018 Major birch pollen allergen Bet v 1 (Not in cele, reverse BLAST: weak sims with plant birch pollen allergens, Real Pos) Rv3644c d1agg__ 7.3.5.2.1 NONENZ 1 0.0048 omega-Agatoxin IV, IVa, IVb (Not in cele, reverse BLAST: sim with several DNA pol III in our bacteria, False Pos) Phor d1030939 d1ao6a2 1.97.1.1.1 NONENZ 1 0.01 Serum albumin (Not in cele, reverse BLAST: sim with other, obscure Arch, so this one is Real Pos) d1030451 d1ieab1 4.15.1.1.6 NONENZ 1 0.0062 MHC class II, N-terminal domain (Not in cele, reverse BLAST: nothing significant, False Pos) Rpro RP396 d3pcca_ 2.3.3.1.1 ENZYME 1 1e-06 Protocatechuate-3,4-dioxygenase, alpha chain (Not in cele, reverse BLAST:in Pseudomonas, another proteobacter, Real Pos) RP767 d1ppei_ 7.3.2.1.2 NONENZ 1 0.008 Trypsin inhibitor (Not in cele, reverse BLAST: nothing, False Pos) Scer YDL185W d1vdea3 4.55.2.2.1 NONENZ 4 2.9e-43 PI-SceI (Not in cele, strong hits, reverse BLAST: only with yeast proteins, Real Pos) YLR014C d1pyia2 7.32.1.1.2 NONENZ 47 1.5e-20 PPR1 (Not in cele, 47 copies in scer, reverse BLAST: confirms Real Pos) YHR053C d1aoo__ 7.38.1.1.5 NONENZ 2 2.8e-22 Metallothionein (Not in cele, 2 copies, reverse BLAST: lot of Metallothionein, snakes, etc. Real Pos) *Ctra 7.24.1 ---------FNCSLCLNGTVHLSCQEKQNTV-CTC-HAGFFLRENEC :.: : .:.: .: . .: .-: :-.:.:.. . : MEKKKQALCFSCPYCCDGNVAFSVIDLENCLACDCCEASFMFDAEMCDAI