BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (281 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9ZWZ8 Gp82 n=1 Tax=Mycobacterium phage TM4 RepID=Q9ZWZ... 221 2e-56 UniRef50_A1UPK2 Putative uncharacterized protein n=1 Tax=Mycobac... 89 2e-16 UniRef50_Q5Z1Z3 Putative phage recombinase n=1 Tax=Nocardia farc... 45 0.002 UniRef50_C1BE54 Hypothetical membrane protein n=1 Tax=Rhodococcu... 44 0.009 UniRef50_UPI0001AF6E76 phage excisionase n=1 Tax=Mycobacterium k... 40 0.097 >UniRef50_Q9ZWZ8 Gp82 n=1 Tax=Mycobacterium phage TM4 RepID=Q9ZWZ8_BPMT4 Length = 259 Score = 221 bits (564), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 157/294 (53%), Positives = 183/294 (62%), Gaps = 54/294 (18%) Query: 1 MSTHKT-----PRESATRFFRGWLAAGTSASILGNVTHALLDSDAGSPVVAAALAAVAPL 55 MS H T P+E+ATRFF WL A T+ASILGNVTHA+L + + + AAA + Sbjct: 1 MSNHTTTSESSPQEAATRFFWAWLIAATAASILGNVTHAVLGAASSPLIAAAAAIVPP-V 59 Query: 56 ALLGATHGVHKLVQSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAF 115 LLGATHGVH LV+SRI+G AY A+L I VA+A AF LSF ALRELA+V G+ P+IA+ Sbjct: 60 VLLGATHGVHALVRSRIVGAAYRAALTIVVALAVCAFVLSFEALRELAIVHAGMRPSIAW 119 Query: 116 LVPLVIDLSITGSTIALLALSSAERA----EVQH-DAQPVHVAAQPVHTETHLVAHEADG 170 L PL IDLSITGST+ALLAL+ R EV+H DA P+ A PVH H Sbjct: 120 LWPLAIDLSITGSTVALLALTGQARGAQAYEVEHLDAHPLSPVA-PVHVSVH-------- 170 Query: 171 LVHVTAQPVHEPVRGVTVADLIAREAAQAVAEP---LPLDVETSTDPALFSTHAVAAARL 227 T A +A+ AA VAEP LP++ AA RL Sbjct: 171 ----------------TSAQAVAQAAAVDVAEPATDLPVE---------------AAERL 199 Query: 228 VDEGVTRIDRAKVAQVLAEHAEGTAPSMIARKLQVGYSTVVRILDHHTAQEVNA 281 +D GVTRIDR KVAQVLAEHAEGTAPSMIARKL VGYSTVVRIL+HHTA A Sbjct: 200 LDAGVTRIDRVKVAQVLAEHAEGTAPSMIARKLSVGYSTVVRILEHHTAHAAQA 253 >UniRef50_A1UPK2 Putative uncharacterized protein n=1 Tax=Mycobacterium sp. KMS RepID=A1UPK2_MYCSK Length = 350 Score = 88.6 bits (218), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 69/184 (37%), Positives = 97/184 (52%), Gaps = 7/184 (3%) Query: 11 ATRFFRGWLAAGTSASILGNVTHALLDSDAGSPVVAAALAAVAPLALLGATHGVHKLVQS 70 AT FF GWL S S+ GNV HALL + +AA A V P+ LL ATH LV++ Sbjct: 15 ATAFFLGWLILAASMSLAGNVGHALLIAPVEMGWLAAGAALVPPIVLLAATHSATWLVRA 74 Query: 71 RIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTI 130 R G Y L +T A+A +FALSF ALR AVV G+ +++++ P VID++I +T+ Sbjct: 75 RSAGWVYWTCLALTAALAVGSFALSFDALRSFAVVL-GIRESLSWIWPAVIDVAIAHATL 133 Query: 131 ALLALSSAERAEVQHDAQPVHVAAQPVHTETHLVAHEADGLVH------VTAQPVHEPVR 184 LL+++ R E ++ P + V+ EA G+V +A PV P Sbjct: 134 CLLSMARPARVESFSTSRAAAAVGAPELSVPVAVSAEASGVVRDGVDDAQSANPVSAPGP 193 Query: 185 GVTV 188 G V Sbjct: 194 GPAV 197 Score = 44.7 bits (104), Expect = 0.003, Method: Compositional matrix adjust. Identities = 26/48 (54%), Positives = 33/48 (68%) Query: 224 AARLVDEGVTRIDRAKVAQVLAEHAEGTAPSMIARKLQVGYSTVVRIL 271 A LV EGVT D VA++LA+ A GTAPS I R+ +V ++TV RIL Sbjct: 293 AESLVHEGVTAKDVELVARILADGAAGTAPSTIGRRHEVHHTTVSRIL 340 >UniRef50_Q5Z1Z3 Putative phage recombinase n=1 Tax=Nocardia farcinica RepID=Q5Z1Z3_NOCFA Length = 207 Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust. Identities = 29/59 (49%), Positives = 40/59 (67%), Gaps = 3/59 (5%) Query: 13 RFFRGWLAAGTSASILGNVTHALLDSDAGSPVVAAALAAVAPLALLGATHGVHKLVQSR 71 R+F G LAAG + SI GNV H+L+ +G +AA LA +AP ALL THG+ L+++R Sbjct: 79 RWFTGVLAAGAAVSIGGNVAHSLV---SGHGYLAALLAVIAPAALLIDTHGLAVLLRTR 134 Score = 42.4 bits (98), Expect = 0.018, Method: Compositional matrix adjust. Identities = 30/74 (40%), Positives = 50/74 (67%), Gaps = 4/74 (5%) Query: 69 QSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPA-IAFLVPLVIDLSITG 127 QS+ + A +++ + +A+ +AF LSFAALR+LA+ P +A+L P+++D +I Sbjct: 7 QSKGVRWARWSAVLVVLAIGGAAFVLSFAALRDLAI--KAHTPTHLAWLFPVIVDGTIIQ 64 Query: 128 STIALLALS-SAER 140 +TIA+LAL+ S ER Sbjct: 65 ATIAVLALADSPER 78 >UniRef50_C1BE54 Hypothetical membrane protein n=1 Tax=Rhodococcus opacus B4 RepID=C1BE54_RHOOB Length = 390 Score = 43.5 bits (101), Expect = 0.009, Method: Compositional matrix adjust. Identities = 24/70 (34%), Positives = 44/70 (62%), Gaps = 1/70 (1%) Query: 68 VQSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITG 127 +Q R + A + ++ I + + + AF LSFA R+LA + G+ + ++ P ++D +I G Sbjct: 11 IQVRALIAALIVAISIAIGITTGAFVLSFAVQRDLA-LQAGIPHYLTWIFPAIVDGAILG 69 Query: 128 STIALLALSS 137 +TIA++ALS Sbjct: 70 ATIAVVALSK 79 >UniRef50_UPI0001AF6E76 phage excisionase n=1 Tax=Mycobacterium kansasii ATCC 12478 RepID=UPI0001AF6E76 Length = 297 Score = 40.0 bits (92), Expect = 0.097, Method: Compositional matrix adjust. Identities = 24/68 (35%), Positives = 44/68 (64%), Gaps = 5/68 (7%) Query: 71 RIIGGAYVASLWITVAVASSAFALSFAALRELAVV--WGGVAPAIAFLVPLVIDLSITGS 128 R++ + AS+W+TV +A+ +F LSF +LR LA + W G ++L PL++D I + Sbjct: 65 RLVIRSRTASVWLTVFIATISFVLSFNSLRSLAAMTAWPGWP---SWLSPLLVDGVIILA 121 Query: 129 TIALLALS 136 T+ +++L+ Sbjct: 122 TLVIVSLA 129 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q9ZWZ8 Gp82 n=1 Tax=Mycobacterium phage TM4 RepID=Q9ZWZ... 207 5e-52 UniRef50_A1UPK2 Putative uncharacterized protein n=1 Tax=Mycobac... 185 2e-45 Sequences not found previously or not previously below threshold: UniRef50_D0YT23 Gp35 n=1 Tax=Mobiluncus mulieris 28-1 RepID=D0YT... 49 1e-04 UniRef50_B4F338 Putative bacteriophage excisionase n=2 Tax=Rhodo... 48 3e-04 UniRef50_C1BE54 Hypothetical membrane protein n=1 Tax=Rhodococcu... 48 4e-04 UniRef50_Q0RMB7 Putative uncharacterized protein n=1 Tax=Frankia... 42 0.019 UniRef50_UPI0001BC2C5D hypothetical protein BlinB_03557 n=1 Tax=... 41 0.036 UniRef50_B8G6U9 Putative uncharacterized protein n=3 Tax=Chlorof... 41 0.062 >UniRef50_Q9ZWZ8 Gp82 n=1 Tax=Mycobacterium phage TM4 RepID=Q9ZWZ8_BPMT4 Length = 259 Score = 207 bits (525), Expect = 5e-52, Method: Composition-based stats. Identities = 153/291 (52%), Positives = 177/291 (60%), Gaps = 48/291 (16%) Query: 1 MSTHKT-----PRESATRFFRGWLAAGTSASILGNVTHALLDSDAGSPVVAAALAAVAPL 55 MS H T P+E+ATRFF WL A T+ASILGNVTHA+L + + +AAA A V P+ Sbjct: 1 MSNHTTTSESSPQEAATRFFWAWLIAATAASILGNVTHAVLGAASSPL-IAAAAAIVPPV 59 Query: 56 ALLGATHGVHKLVQSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAF 115 LLGATHGVH LV+SRI+G AY A+L I VA+A AF LSF ALRELA+V G+ P+IA+ Sbjct: 60 VLLGATHGVHALVRSRIVGAAYRAALTIVVALAVCAFVLSFEALRELAIVHAGMRPSIAW 119 Query: 116 LVPLVIDLSITGSTIALLALSSAERA----EVQH-DAQPVHVAAQPVHTETHLVAHEADG 170 L PL IDLSITGST+ALLAL+ R EV+H DA P+ A PVH H Sbjct: 120 LWPLAIDLSITGSTVALLALTGQARGAQAYEVEHLDAHPLSPVA-PVHVSVHT------- 171 Query: 171 LVHVTAQPVHEPVRGVTVADLIAREAAQAVAEPLPLDVETSTDPALFSTHAVAAARLVDE 230 A A LP++ AA RL+D Sbjct: 172 --------------SAQAVAQAAAVDVAEPATDLPVE---------------AAERLLDA 202 Query: 231 GVTRIDRAKVAQVLAEHAEGTAPSMIARKLQVGYSTVVRILDHHTAQEVNA 281 GVTRIDR KVAQVLAEHAEGTAPSMIARKL VGYSTVVRIL+HHTA A Sbjct: 203 GVTRIDRVKVAQVLAEHAEGTAPSMIARKLSVGYSTVVRILEHHTAHAAQA 253 >UniRef50_A1UPK2 Putative uncharacterized protein n=1 Tax=Mycobacterium sp. KMS RepID=A1UPK2_MYCSK Length = 350 Score = 185 bits (468), Expect = 2e-45, Method: Composition-based stats. Identities = 69/184 (37%), Positives = 97/184 (52%), Gaps = 7/184 (3%) Query: 11 ATRFFRGWLAAGTSASILGNVTHALLDSDAGSPVVAAALAAVAPLALLGATHGVHKLVQS 70 AT FF GWL S S+ GNV HALL + +AA A V P+ LL ATH LV++ Sbjct: 15 ATAFFLGWLILAASMSLAGNVGHALLIAPVEMGWLAAGAALVPPIVLLAATHSATWLVRA 74 Query: 71 RIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTI 130 R G Y L +T A+A +FALSF ALR AVV G+ +++++ P VID++I +T+ Sbjct: 75 RSAGWVYWTCLALTAALAVGSFALSFDALRSFAVVL-GIRESLSWIWPAVIDVAIAHATL 133 Query: 131 ALLALSSAERAEVQHDAQPVHVAAQPVHTETHLVAHEADGLVH------VTAQPVHEPVR 184 LL+++ R E ++ P + V+ EA G+V +A PV P Sbjct: 134 CLLSMARPARVESFSTSRAAAAVGAPELSVPVAVSAEASGVVRDGVDDAQSANPVSAPGP 193 Query: 185 GVTV 188 G V Sbjct: 194 GPAV 197 >UniRef50_D0YT23 Gp35 n=1 Tax=Mobiluncus mulieris 28-1 RepID=D0YT23_9ACTO Length = 299 Score = 49.5 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 21/57 (36%), Positives = 36/57 (63%), Gaps = 1/57 (1%) Query: 81 LWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLALSS 137 L++T+ +A +F LSF+ L + ++ G P +A++ PL ID I +TIA++A S Sbjct: 2 LFLTIVLAVGSFVLSFSGLYDFSLT-CGYHPWLAWIWPLTIDGVIVAATIAVVAFSR 57 >UniRef50_B4F338 Putative bacteriophage excisionase n=2 Tax=Rhodococcus equi RepID=B4F338_COREQ Length = 382 Score = 48.3 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Query: 13 RFFRGWLAAGTSASILGNVTHALLD-SDAGSPVVAAALAAVAPLALLGATHGVHKLVQS 70 RFF LA SI GN HA + + P++AAA++ + P++LL A+HG+ L ++ Sbjct: 154 RFFWTVLAVAAGVSIAGNALHAWVSHAPDFDPLLAAAISTIPPISLLAASHGLTILART 212 Score = 42.1 bits (97), Expect = 0.022, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 39/62 (62%), Gaps = 1/62 (1%) Query: 76 AYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLAL 135 A+ A+L +T + ++F LSF AL +L + G +++L P++ID +I +TI++L L Sbjct: 85 AFWAALTLTSMICLASFILSFVALADLLHMTGQPRE-LSYLFPVIIDGTILQATISILWL 143 Query: 136 SS 137 + Sbjct: 144 AG 145 >UniRef50_C1BE54 Hypothetical membrane protein n=1 Tax=Rhodococcus opacus B4 RepID=C1BE54_RHOOB Length = 390 Score = 47.9 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 24/70 (34%), Positives = 44/70 (62%), Gaps = 1/70 (1%) Query: 68 VQSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITG 127 +Q R + A + ++ I + + + AF LSFA R+LA+ G+ + ++ P ++D +I G Sbjct: 11 IQVRALIAALIVAISIAIGITTGAFVLSFAVQRDLAL-QAGIPHYLTWIFPAIVDGAILG 69 Query: 128 STIALLALSS 137 +TIA++ALS Sbjct: 70 ATIAVVALSK 79 >UniRef50_Q0RMB7 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RMB7_FRAAA Length = 327 Score = 42.1 bits (97), Expect = 0.019, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 2/124 (1%) Query: 8 RESATRFFRGWLAAGTSASILGNVTHALLDSDAG-SPVVAAALAAVAPLALLGATHGVHK 66 R R L S+ N HAL+ A S + L+A P+A+LG+ H Sbjct: 7 RSRPLRLPLTMLIFSLGMSVFANGEHALIYHRAQLSAWASVLLSATPPVAVLGSLHMQLS 66 Query: 67 LVQSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSIT 126 G +T AV + +FALSF L ++ ++ G+ A++ P +DL Sbjct: 67 PFGQPGSSGQRYGDWALTGAVFALSFALSFETLWQIGLII-GLDHHFAWMFPAGLDLVAV 125 Query: 127 GSTI 130 + + Sbjct: 126 RAAV 129 >UniRef50_UPI0001BC2C5D hypothetical protein BlinB_03557 n=1 Tax=Brevibacterium linens BL2 RepID=UPI0001BC2C5D Length = 283 Score = 41.4 bits (95), Expect = 0.036, Method: Composition-based stats. Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%) Query: 74 GGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALL 133 G + + + T+ +A AF LSF AL +LA G+A + A++ PL++D I +TIA++ Sbjct: 8 GWSVITAACGTIGIALGAFWLSFTALADLA-RRSGIASSQAWVWPLLVDGLIVVATIAVV 66 Query: 134 ALSSAE 139 AL Sbjct: 67 ALDGRA 72 >UniRef50_B8G6U9 Putative uncharacterized protein n=3 Tax=Chloroflexus RepID=B8G6U9_CHLAD Length = 650 Score = 40.6 bits (93), Expect = 0.062, Method: Composition-based stats. Identities = 41/149 (27%), Positives = 65/149 (43%), Gaps = 14/149 (9%) Query: 101 ELAVVWGGVAPAIAFLVPLVI-DLSITGSTIALLALSSAERAEVQHDAQPVHVAAQPVHT 159 +L+VV PA+ + P+ I D I + + A ++ + A V+ P PV Sbjct: 110 DLSVVVSTSRPALTTVEPVYISDYWIEQARLVAAAKAAEQVAPVEA---PAPAEGAPVVV 166 Query: 160 ETHLVAHEADGLVHVTAQPVHEPVRGVTVADLIAREAAQAVAEPLPLDVETSTDPALFST 219 E + + V A P PV T A+++ + AV EP+ +V ++PA ST Sbjct: 167 EQPVAEVKPSPAVAEVATP---PVEVETTAEVVG-SSEPAVTEPVATEVALPSEPAEPST 222 Query: 220 HAVAAARLVDEGVTRIDRAKVAQVLAEHA 248 V EGV R A+++AEH Sbjct: 223 VIVV------EGVAIDLRLSPAEIVAEHG 245 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q9ZWZ8 Gp82 n=1 Tax=Mycobacterium phage TM4 RepID=Q9ZWZ... 205 2e-51 UniRef50_A1UPK2 Putative uncharacterized protein n=1 Tax=Mycobac... 176 6e-43 UniRef50_C1BE54 Hypothetical membrane protein n=1 Tax=Rhodococcu... 80 8e-14 UniRef50_B4F338 Putative bacteriophage excisionase n=2 Tax=Rhodo... 77 7e-13 UniRef50_D0YT23 Gp35 n=1 Tax=Mobiluncus mulieris 28-1 RepID=D0YT... 76 1e-12 Sequences not found previously or not previously below threshold: UniRef50_Q0RUX8 Possible mycobacteriophage excisionase n=3 Tax=R... 61 3e-08 UniRef50_Q3L913 Hypothetical membrane protein n=2 Tax=Rhodococcu... 56 1e-06 UniRef50_UPI0001AF6E76 phage excisionase n=1 Tax=Mycobacterium k... 51 4e-05 UniRef50_UPI0001BC2C5D hypothetical protein BlinB_03557 n=1 Tax=... 49 1e-04 UniRef50_B2HFL8 Phage excisionase n=1 Tax=Mycobacterium marinum ... 47 6e-04 UniRef50_Q06GE1 Putative transmembrane protein n=1 Tax=Rhodococc... 46 0.002 UniRef50_Q5Z1Z3 Putative phage recombinase n=1 Tax=Nocardia farc... 46 0.002 UniRef50_Q0RMB7 Putative uncharacterized protein n=1 Tax=Frankia... 45 0.003 UniRef50_B8HHK6 Putative uncharacterized protein n=1 Tax=Arthrob... 44 0.005 UniRef50_A4TG29 Putative uncharacterized protein n=1 Tax=Mycobac... 44 0.008 UniRef50_B7GRA6 Putative uncharacterized protein n=1 Tax=Bifidob... 42 0.022 UniRef50_C5EA24 ABC-type branched-chain amino acid transport sys... 41 0.032 UniRef50_D2Q9S7 Putative uncharacterized protein n=2 Tax=Bifidob... 41 0.038 UniRef50_B8HIX1 Putative Trp operon repressor n=1 Tax=Arthrobact... 40 0.075 UniRef50_Q70K80 Putative mycobacteriophage excisionase n=1 Tax=G... 40 0.077 UniRef50_A0Q9G2 Putative uncharacterized protein n=1 Tax=Mycobac... 40 0.098 >UniRef50_Q9ZWZ8 Gp82 n=1 Tax=Mycobacterium phage TM4 RepID=Q9ZWZ8_BPMT4 Length = 259 Score = 205 bits (520), Expect = 2e-51, Method: Composition-based stats. Identities = 150/291 (51%), Positives = 174/291 (59%), Gaps = 48/291 (16%) Query: 1 MSTHKT-----PRESATRFFRGWLAAGTSASILGNVTHALLDSDAGSPVVAAALAAVAPL 55 MS H T P+E+ATRFF WL A T+ASILGNVTHA+L + + + AAA + Sbjct: 1 MSNHTTTSESSPQEAATRFFWAWLIAATAASILGNVTHAVLGAASSPLIAAAAAIVPP-V 59 Query: 56 ALLGATHGVHKLVQSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAF 115 LLGATHGVH LV+SRI+G AY A+L I VA+A AF LSF ALRELA+V G+ P+IA+ Sbjct: 60 VLLGATHGVHALVRSRIVGAAYRAALTIVVALAVCAFVLSFEALRELAIVHAGMRPSIAW 119 Query: 116 LVPLVIDLSITGSTIALLALSSAERA----EVQH-DAQPVHVAAQPVHTETHLVAHEADG 170 L PL IDLSITGST+ALLAL+ R EV+H DA P+ A PVH H Sbjct: 120 LWPLAIDLSITGSTVALLALTGQARGAQAYEVEHLDAHPLSPVA-PVHVSVHT------- 171 Query: 171 LVHVTAQPVHEPVRGVTVADLIAREAAQAVAEPLPLDVETSTDPALFSTHAVAAARLVDE 230 A A LP++ AA RL+D Sbjct: 172 --------------SAQAVAQAAAVDVAEPATDLPVE---------------AAERLLDA 202 Query: 231 GVTRIDRAKVAQVLAEHAEGTAPSMIARKLQVGYSTVVRILDHHTAQEVNA 281 GVTRIDR KVAQVLAEHAEGTAPSMIARKL VGYSTVVRIL+HHTA A Sbjct: 203 GVTRIDRVKVAQVLAEHAEGTAPSMIARKLSVGYSTVVRILEHHTAHAAQA 253 >UniRef50_A1UPK2 Putative uncharacterized protein n=1 Tax=Mycobacterium sp. KMS RepID=A1UPK2_MYCSK Length = 350 Score = 176 bits (446), Expect = 6e-43, Method: Composition-based stats. Identities = 69/184 (37%), Positives = 97/184 (52%), Gaps = 7/184 (3%) Query: 11 ATRFFRGWLAAGTSASILGNVTHALLDSDAGSPVVAAALAAVAPLALLGATHGVHKLVQS 70 AT FF GWL S S+ GNV HALL + +AA A V P+ LL ATH LV++ Sbjct: 15 ATAFFLGWLILAASMSLAGNVGHALLIAPVEMGWLAAGAALVPPIVLLAATHSATWLVRA 74 Query: 71 RIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTI 130 R G Y L +T A+A +FALSF ALR AVV G+ +++++ P VID++I +T+ Sbjct: 75 RSAGWVYWTCLALTAALAVGSFALSFDALRSFAVVL-GIRESLSWIWPAVIDVAIAHATL 133 Query: 131 ALLALSSAERAEVQHDAQPVHVAAQPVHTETHLVAHEADGLVH------VTAQPVHEPVR 184 LL+++ R E ++ P + V+ EA G+V +A PV P Sbjct: 134 CLLSMARPARVESFSTSRAAAAVGAPELSVPVAVSAEASGVVRDGVDDAQSANPVSAPGP 193 Query: 185 GVTV 188 G V Sbjct: 194 GPAV 197 >UniRef50_C1BE54 Hypothetical membrane protein n=1 Tax=Rhodococcus opacus B4 RepID=C1BE54_RHOOB Length = 390 Score = 79.9 bits (195), Expect = 8e-14, Method: Composition-based stats. Identities = 24/70 (34%), Positives = 44/70 (62%), Gaps = 1/70 (1%) Query: 68 VQSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITG 127 +Q R + A + ++ I + + + AF LSFA R+LA+ G+ + ++ P ++D +I G Sbjct: 11 IQVRALIAALIVAISIAIGITTGAFVLSFAVQRDLAL-QAGIPHYLTWIFPAIVDGAILG 69 Query: 128 STIALLALSS 137 +TIA++ALS Sbjct: 70 ATIAVVALSK 79 >UniRef50_B4F338 Putative bacteriophage excisionase n=2 Tax=Rhodococcus equi RepID=B4F338_COREQ Length = 382 Score = 76.8 bits (187), Expect = 7e-13, Method: Composition-based stats. Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Query: 13 RFFRGWLAAGTSASILGNVTHALLD-SDAGSPVVAAALAAVAPLALLGATHGVHKLVQS 70 RFF LA SI GN HA + + P++AAA++ + P++LL A+HG+ L ++ Sbjct: 154 RFFWTVLAVAAGVSIAGNALHAWVSHAPDFDPLLAAAISTIPPISLLAASHGLTILART 212 Score = 48.7 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 39/62 (62%), Gaps = 1/62 (1%) Query: 76 AYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLAL 135 A+ A+L +T + ++F LSF AL +L + G +++L P++ID +I +TI++L L Sbjct: 85 AFWAALTLTSMICLASFILSFVALADL-LHMTGQPRELSYLFPVIIDGTILQATISILWL 143 Query: 136 SS 137 + Sbjct: 144 AG 145 >UniRef50_D0YT23 Gp35 n=1 Tax=Mobiluncus mulieris 28-1 RepID=D0YT23_9ACTO Length = 299 Score = 76.0 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 21/57 (36%), Positives = 36/57 (63%), Gaps = 1/57 (1%) Query: 81 LWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLALSS 137 L++T+ +A +F LSF+ L + ++ G P +A++ PL ID I +TIA++A S Sbjct: 2 LFLTIVLAVGSFVLSFSGLYDFSLT-CGYHPWLAWIWPLTIDGVIVAATIAVVAFSR 57 >UniRef50_Q0RUX8 Possible mycobacteriophage excisionase n=3 Tax=Rhodococcus RepID=Q0RUX8_RHOSR Length = 448 Score = 61.4 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 25/70 (35%), Positives = 44/70 (62%), Gaps = 1/70 (1%) Query: 68 VQSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITG 127 +Q R + A + ++ I V + + AF LSFA R+LA+ G+ + ++ P ++D +I G Sbjct: 11 IQLRALIAALIVAIAIAVGITTGAFVLSFAVQRDLAL-QAGIPHYLTWIFPAIVDGAILG 69 Query: 128 STIALLALSS 137 +TIA++ALS Sbjct: 70 ATIAIVALSK 79 >UniRef50_Q3L913 Hypothetical membrane protein n=2 Tax=Rhodococcus erythropolis RepID=Q3L913_RHOE4 Length = 455 Score = 56.4 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 71/204 (34%), Gaps = 12/204 (5%) Query: 86 AVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLALSS---AERAE 142 + + AF LSFA R+LA + + ++ P ++D +I G+TIA++ +S ++R Sbjct: 47 GITTGAFVLSFAVQRDLA-RQALIPEHLTWIFPAIVDSAILGATIAIVIISKLNMSKRDR 105 Query: 143 VQHDAQPVHVAAQPVHTETHLVAHEADGLVHVTAQPVHEPVRGVTVADLIAREAAQAVAE 202 + A V V + + H A + + A A A Sbjct: 106 GFYIALAVSVVVISILGNAYHAYHAAI-AAREFISGGGDLGFSPLAPAIAAAIAIIPPAL 164 Query: 203 PLPLDVETSTDPALFSTHAVAAARLVDEGVTRIDRAKVAQVLAEHAEGTAPSM------- 255 L + A +V+ G D A LA + P Sbjct: 165 VLAFTHGITVLVKAVGMAYAAYREIVETGTATDDAMDTASDLAADRDPMQPDFRDSVTRK 224 Query: 256 IARKLQVGYSTVVRILDHHTAQEV 279 IA +L+ ++T + HT Sbjct: 225 IAEQLETEHTTDATVDASHTNHPA 248 >UniRef50_UPI0001AF6E76 phage excisionase n=1 Tax=Mycobacterium kansasii ATCC 12478 RepID=UPI0001AF6E76 Length = 297 Score = 51.4 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 7/190 (3%) Query: 71 RIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTI 130 R++ + AS+W+TV +A+ +F LSF +LR LA + ++L PL++D I +T+ Sbjct: 65 RLVIRSRTASVWLTVFIATISFVLSFNSLRSLAAMTA-WPGWPSWLSPLLVDGVIILATL 123 Query: 131 ALLALSSAERAEVQHDAQPVHVAAQPVHTETHLVAHEADGLVHVTAQPVHEPVRGVTVAD 190 +++L+ RA+ + V V T V G Sbjct: 124 VIVSLA-PYRAQFWNRVFLWTVLGVGALVS---VGGNGLHAWLSTGHLVSWMRWGSAGLA 179 Query: 191 LIAREAAQAVAEPLPLDVETSTDPALFSTHAVAAARLVDEGVTRIDRAKVAQVLAEHAEG 250 + A A L + P + V R ++ R+DR + A H EG Sbjct: 180 CVPPVALLATTHILGILWRFDPVPPPDARSQV-RDRALELAAQRMDRWEAAAA-KLHEEG 237 Query: 251 TAPSMIARKL 260 PS+ K+ Sbjct: 238 YCPSVPTDKM 247 Score = 46.0 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 20/59 (33%), Positives = 27/59 (45%), Gaps = 3/59 (5%) Query: 14 FFRGWLAAGTSASILGNVTHALLDSD---AGSPVVAAALAAVAPLALLGATHGVHKLVQ 69 F L G S+ GN HA L + + +A LA V P+ALL TH + L + Sbjct: 140 FLWTVLGVGALVSVGGNGLHAWLSTGHLVSWMRWGSAGLACVPPVALLATTHILGILWR 198 >UniRef50_UPI0001BC2C5D hypothetical protein BlinB_03557 n=1 Tax=Brevibacterium linens BL2 RepID=UPI0001BC2C5D Length = 283 Score = 49.5 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%) Query: 74 GGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALL 133 G + + + T+ +A AF LSF AL +LA G+A + A++ PL++D I +TIA++ Sbjct: 8 GWSVITAACGTIGIALGAFWLSFTALADLA-RRSGIASSQAWVWPLLVDGLIVVATIAVV 66 Query: 134 ALSSAE 139 AL Sbjct: 67 ALDGRA 72 >UniRef50_B2HFL8 Phage excisionase n=1 Tax=Mycobacterium marinum M RepID=B2HFL8_MYCMM Length = 201 Score = 47.2 bits (110), Expect = 6e-04, Method: Composition-based stats. Identities = 20/65 (30%), Positives = 31/65 (47%), Gaps = 1/65 (1%) Query: 8 RESATRFFRGWLAAGTSASILGNVTHALLDSDAG-SPVVAAALAAVAPLALLGATHGVHK 66 + + FF LA AS+ N HA++ +P + AA+ V P++LL HG Sbjct: 43 QRANRSFFWWVLALAAMASVGSNALHAIVAPGNELAPWLKAAIGVVPPVSLLATAHGAAI 102 Query: 67 LVQSR 71 L + R Sbjct: 103 LSRIR 107 >UniRef50_Q06GE1 Putative transmembrane protein n=1 Tax=Rhodococcus sp. NS1 RepID=Q06GE1_9NOCA Length = 379 Score = 45.6 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 20/52 (38%), Positives = 33/52 (63%), Gaps = 1/52 (1%) Query: 86 AVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLALSS 137 + + AF LSF+ L++LA V G + A++ P ++D I G+TIA++ LS Sbjct: 29 GITTGAFILSFSVLKDLA-VQGMLPAEHAWIFPAIVDGGILGATIAVIVLSK 79 >UniRef50_Q5Z1Z3 Putative phage recombinase n=1 Tax=Nocardia farcinica RepID=Q5Z1Z3_NOCFA Length = 207 Score = 45.6 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 26/68 (38%), Positives = 46/68 (67%), Gaps = 1/68 (1%) Query: 69 QSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGS 128 QS+ + A +++ + +A+ +AF LSFAALR+LA+ +A+L P+++D +I + Sbjct: 7 QSKGVRWARWSAVLVVLAIGGAAFVLSFAALRDLAI-KAHTPTHLAWLFPVIVDGTIIQA 65 Query: 129 TIALLALS 136 TIA+LAL+ Sbjct: 66 TIAVLALA 73 >UniRef50_Q0RMB7 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RMB7_FRAAA Length = 327 Score = 44.8 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 2/124 (1%) Query: 8 RESATRFFRGWLAAGTSASILGNVTHALLDSDAG-SPVVAAALAAVAPLALLGATHGVHK 66 R R L S+ N HAL+ A S + L+A P+A+LG+ H Sbjct: 7 RSRPLRLPLTMLIFSLGMSVFANGEHALIYHRAQLSAWASVLLSATPPVAVLGSLHMQLS 66 Query: 67 LVQSRIIGGAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSIT 126 G +T AV + +FALSF L ++ ++ G+ A++ P +DL Sbjct: 67 PFGQPGSSGQRYGDWALTGAVFALSFALSFETLWQIGLII-GLDHHFAWMFPAGLDLVAV 125 Query: 127 GSTI 130 + + Sbjct: 126 RAAV 129 >UniRef50_B8HHK6 Putative uncharacterized protein n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=B8HHK6_ARTCA Length = 206 Score = 44.1 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 21/57 (36%), Positives = 38/57 (66%), Gaps = 1/57 (1%) Query: 84 TVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLALSSAER 140 TV +A AF LSFA+L +LA G+ ++++ P+++D I +T+A++AL+ +R Sbjct: 23 TVLIAVGAFVLSFASLTDLA-ARSGIDRNLSWIWPIIVDGLIVAATVAIVALAGHDR 78 >UniRef50_A4TG29 Putative uncharacterized protein n=1 Tax=Mycobacterium gilvum PYR-GCK RepID=A4TG29_MYCGI Length = 349 Score = 43.7 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 24/61 (39%), Positives = 31/61 (50%), Gaps = 1/61 (1%) Query: 13 RFFRGWLAAGTSASILGNVTHALLDSDAGS-PVVAAALAAVAPLALLGATHGVHKLVQSR 71 R+F LA SI GN H L D P +AA +AAVAPL+LL HG +++ Sbjct: 119 RYFWAVLAISAIISIGGNALHGYLPPDTALWPWLAACIAAVAPLSLLATVHGAATMLRIS 178 Query: 72 I 72 Sbjct: 179 A 179 Score = 41.8 bits (96), Expect = 0.029, Method: Composition-based stats. Identities = 25/70 (35%), Positives = 42/70 (60%), Gaps = 1/70 (1%) Query: 76 AYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLAL 135 A + ++ IT VA+++F LSFA+L +LA G +A L P+++DL+I +T+A++ L Sbjct: 49 ACIVAVLITAVVAAASFVLSFASLADLA-QRSGYPAELAKLWPVIVDLTIVLATVAVIVL 107 Query: 136 SSAERAEVQH 145 A H Sbjct: 108 GPAGVGSRAH 117 >UniRef50_B7GRA6 Putative uncharacterized protein n=1 Tax=Bifidobacterium longum subsp. infantis ATCC 15697 RepID=B7GRA6_BIFLI Length = 220 Score = 42.1 bits (97), Expect = 0.022, Method: Composition-based stats. Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 1/59 (1%) Query: 77 YVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLAL 135 + + V +A AF +SF ALR + + G+ A+A++ P++ID S T A A Sbjct: 1 MWPGVTVAVVLALLAFIISFDALRAVGLA-CGINAALAWMFPIIIDGSTLAFTWAAWAF 58 >UniRef50_C5EA24 ABC-type branched-chain amino acid transport system protein n=2 Tax=Bifidobacterium longum subsp. infantis RepID=C5EA24_BIFLO Length = 373 Score = 41.4 bits (95), Expect = 0.032, Method: Composition-based stats. Identities = 17/57 (29%), Positives = 28/57 (49%), Gaps = 1/57 (1%) Query: 16 RGWLAAGTSASILGNVTHALLDSDAGSP-VVAAALAAVAPLALLGATHGVHKLVQSR 71 L + S+ GN HA L++ P A A+ ++ P+ALL +TH + + R Sbjct: 66 WAGLVLFSLFSVTGNALHAWLNAGGMLPTWGAPAIMSIPPIALLYSTHLIVIIAGDR 122 Score = 41.0 bits (94), Expect = 0.052, Method: Composition-based stats. Identities = 20/57 (35%), Positives = 29/57 (50%), Gaps = 1/57 (1%) Query: 81 LWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLALSS 137 + + V +A AF LSF ALR L V G+ P +++ PL +D +I T A Sbjct: 1 MTVGVMIAVIAFVLSFDALR-LVFVSSGINPLLSWGGPLCVDGTILLCTWATWGFRK 56 >UniRef50_D2Q9S7 Putative uncharacterized protein n=2 Tax=Bifidobacterium dentium RepID=D2Q9S7_9BIFI Length = 411 Score = 41.4 bits (95), Expect = 0.038, Method: Composition-based stats. Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 1/61 (1%) Query: 77 YVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLALS 136 + + + AF LSF ALR L V G+ P +++ P+ +D +I T A Sbjct: 19 MWPGITAGLVIGLIAFILSFDALR-LVFVSCGINPYLSWGGPVCVDGTILLCTWATWGFK 77 Query: 137 S 137 Sbjct: 78 K 78 Score = 40.6 bits (93), Expect = 0.064, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 1/58 (1%) Query: 16 RGWLAAGTSASILGNVTHALLDSDAGSP-VVAAALAAVAPLALLGATHGVHKLVQSRI 72 L + SI GN HAL+++ P A + ++ P+A+L ATH + + R+ Sbjct: 88 WAGLVLFSGCSIGGNALHALINNGLELPAWAPATIMSIPPVAMLYATHLIVIIAGDRL 145 >UniRef50_B8HIX1 Putative Trp operon repressor n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=B8HIX1_ARTCA Length = 250 Score = 40.2 bits (92), Expect = 0.075, Method: Composition-based stats. Identities = 21/64 (32%), Positives = 33/64 (51%), Gaps = 3/64 (4%) Query: 80 SLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLALSSAE 139 SL + +A ++F LSF L + A W G+ + +LVP+V+D +I A+ A Sbjct: 16 SLALVGVLALASFTLSFLGLIQ-AAAWAGIPEYLRWLVPIVVDSTIL--VYAVAASVQRA 72 Query: 140 RAEV 143 R E Sbjct: 73 RGES 76 >UniRef50_Q70K80 Putative mycobacteriophage excisionase n=1 Tax=Gordonia westfalica RepID=Q70K80_9ACTO Length = 292 Score = 40.2 bits (92), Expect = 0.077, Method: Composition-based stats. Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 3/59 (5%) Query: 9 ESATRFFRGWLAAGTSASILGNVTHALLDSDAG---SPVVAAALAAVAPLALLGATHGV 64 ++ R+ LA ++ S+ GN HA L + ++A A+A VAPLALL + HG+ Sbjct: 85 DAGKRYHFSMLAIFSTVSVAGNAGHAYLAAGDQGVAHGLMAVAIAMVAPLALLASIHGL 143 >UniRef50_A0Q9G2 Putative uncharacterized protein n=1 Tax=Mycobacterium avium 104 RepID=A0Q9G2_MYCA1 Length = 394 Score = 39.8 bits (91), Expect = 0.098, Method: Composition-based stats. Identities = 65/241 (26%), Positives = 89/241 (36%), Gaps = 17/241 (7%) Query: 15 FRGWLAAGTSASILGNVTHALLDSDAGSPVVAAALAAVAPLALLGATHGVHKLVQSRIIG 74 F L T+ S+ GNV +A+ V+ ALAAVAP+ L H + K + R Sbjct: 88 FTRLLLGCTTVSLAGNVAYAVERGAVTP--VSIALAAVAPILLPVGVHFIPKAARVRR-- 143 Query: 75 GAYVASLWITVAVASSAFALSFAALRELAVVWGGVAPAIAFLVPLVIDLSITGSTIALLA 134 GA + A +AF LSF AL L + G A+L+P+ ID+ + AL+ Sbjct: 144 GARFVVTVAVIVAAVAAFVLSFEALSGL-MRMRGHDGWTAYLLPVAIDVLAAAAAYALVV 202 Query: 135 LSSAERAEVQHDAQPVHVA---AQPVHTETHLVAHEADGLVHVTAQPVHEPVRGVTVADL 191 A R E + A A+P T T + L T EP Sbjct: 203 EPDAVREEAPVTQRVEAPAMRDAEPAPTATQTPSVRDAELATHTDATPEEPATQPDALR- 261 Query: 192 IAREAAQAVAEPLPLDVETSTDPALFSTHAVAAARLVDEGVTRIDRAKVAQVLAEHAEGT 251 A E +P+ S D A + R + T R Q A A T Sbjct: 262 ------DADTEEIPVTCSDSRDAAAKQGTESSTTRRLAAVPT--PRPAATQTAASPATQT 313 Query: 252 A 252 A Sbjct: 314 A 314 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.119 0.308 Lambda K H 0.267 0.0354 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,266,091,641 Number of Sequences: 3077464 Number of extensions: 41120734 Number of successful extensions: 283929 Number of sequences better than 1.0e-01: 127 Number of HSP's better than 0.1 without gapping: 19 Number of HSP's successfully gapped in prelim test: 242 Number of HSP's that attempted gapping in prelim test: 283190 Number of HSP's gapped (non-prelim): 847 length of query: 281 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 154 effective length of database: 649,558,428 effective search space: 100031997912 effective search space used: 100031997912 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 91 (39.9 bits)