Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2699 |
Symbol | |
ID | 7873441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2922473 |
End bp | 2928151 |
Gene Length | 5679 bp |
Protein Length | 1892 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699622 |
Product | YD repeat protein |
Protein accession | YP_002889678 |
Protein GI | 237653364 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.499088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGCGG GTGCCGCACT GCTGGCGGCG CTCTGTATCA CGGCGCTCCC CGGGGTAGCG GCCGAGACCT CGGCGGCGCA ATCCGGCGCC GCGGATCACG GCATCCAGCA GAGCGGGTGC GGAATCACGA TGAGCTGTGC ACCGGACTCT GCGCAGGACG GCCCCTCGGC GCCAGAGCCC CCCGCAGCGG GGCAATGCGT CTCCAACAAC GCGGGCAACC CCTGCGGCAG CGCCAGCGGC CCGGCCAGCC AGGGTTCCAC CACCGGCCAG GACGTCGGCG CGGGAAACCC GATCGACGTC CTCTCGGGCA ACAAGTACCA GCAGGAGGTC GATCTTCCGG CGCTGCCAGG CATCCTCGGC CTCGAGATCG TCCGCCACTA CAACAGCCGA CGTGCGCGCC CGGGCGACAT CGGGGCGCTC GGCAGCGGCT GGAGACTGTC CTACGAAACG CGCCTGAACA TCCGCGACGA TCGCCTCGAA ATCCTCCAGG CCGACGGCGC GCGGGTGATC TTTGCGCGCC GGCCCGGTGA GCCGCAGCGC TGCGCCAGCC GCGACCCCAG CCGCGGCGAC ATCCTGCTCC GGCACCGCCC CGGCGGCGGC GACGAATATC TCTGGCGCTG GCCCGACGGC CGCAGCCTGC TGTTCGACAG CGCCGGCCAC CTGGTGCAGA TCCTCGTTGC GAGCGGCGAG TTCGTCTCAC TGCGCCACGA CGCCGCCGGC CGGCTGCTGC AGGTCACCGA CCCGCAGGGC CGCAGCCTGC AAGTTCACTA CCACGCTCCC GGCAGCGGCA TCGCCCACAT CGACAGTCCG CTCGGACGCT TCTCGTATGC GCAGGAACGC GTCGCGCTTC CCGCGCGGAC GGACGCAGGC ACGCAGCCTG GGCGCTCCGT CCACGCTCCG CAACGCCCGG GCTCCGCCGG CCCGAGCCGC CCCCTGCAGC GCGGCGCTCC CGCCGGAAAG GCCGCTGCAC CCGCGGTCAC CGCGGCGCGC CTGCTCCGTG TCGACTACCC CGGCGCACCG GCTGCGCAGA CCGGCGGAGC CGCCGCGCGC GAATATCACT ACGAGGACCC GCGCCATCCG GTCGCGCTCA CCGGCATCAG CGTGCTCGCG CGTCAGCCCG CAGGCCCGCC CGTGCCCGAA CGCATCGCCA GCTACGCCTA CGACCACGCC GGGCGCGGCA TCCGCTCGGT GCGTGGCGCG CTGCCGGCAG GCGGTGAAAG CGGCCCCGAG GACGTGCGCC TCGACTATCC CGAACGCGGC CGCAGCGTGC TCACCAACAG CCTCGGCCAG CGCACCACCT ACCTTGGCGC GGTGATCGCC GGCCAGCGCC GCCTGCTCGA AGCGCGCGGC CCGGGCTGCG CGCGCTGCGG CCCCACCGAC GTCCGCTACC GCTACACGCC CGACGGCCGA TTGAGCGCCA TCACGACCCT GGACGCCGAG GGCAGGCCGC GCCTTTCCAC CCAGCACCGT TTCGACACCC GCGGGCGCCT CGCCGAGACC CTCGGTGCCG ATCTCTCCGG CGCCCGGCCG CGCCTGCTCG ACCGCACGCG CTACGAATAT CCCGACACGC CGGCAACCGC GCTCGCCGAC ACGGACCGCG CAGCGCAACG CACCGAGCCC GCCGCGACGA CGCCTCCGCG CGCGATCATC CGCCCGAGCG TCGTCCCCGG CCGCGAGCAC CGCATCGAGC TGCGCTACAA CACGACCGGC CAGCCCATCG AACTGCGCGA GTCCGGTTTC AGCCCGATCG ACGCGGAAGG CCGGCCGGAT CCCGTGCCGA TCGAACGCCG CGTCACCTGG CGCTACGCCA CGATCAACGG TCGCAGCGTG CTTGCCGAGC TCGACGGCCC GCTCCCCGAC GGAGCGGACG CCAGCCCGCT CGACAGCGAC CTCGTCCGCC TGCACTGGGA CGAACGCGGC AGCTTCATTC GCGCGCTGGA AGGTCCCGGC GGCCAGCGCA GCGAGATCGA ATCCGATCCG GCCACCGGCC TGCCGCGGCG GGTGCGCGAT GCCGAAGGTC ACGAGACCCA CTTGCGCCAC GACTCCGCCG GCCAGCTGAT CCGGTGGCGC AGCCGCAGCC CGGGTGAGGC GGCAGATCGC ATCCACGCTG CCGAATACGA TGCCCTCGGC CAACTCGTCG AGCTGCGCTC CGGAGAGGAT GAAGAGCGCC TGCATCCGCG CTGGCGGCGC GCCTTCGATG CCGCCGGCCG CCTGAAGTGG CACGCCGACG CGCTCGGCAT CCTGCGGACC TGGGCCTACG ACCACGAGAG CCGCGTCGTC GAGACCGGGC TGCGCAGCGC CAGCCGGCTG CTGCGGCGCC GCTGGCGTTA CGACGAGCAC GGACGCCTCG CCGCGATCGA CGACGACACC GGCTTCACCC GCAGCCTGCG CCACGACGTC GCAGGCCGGC TCGTCGGCCT CCTGGACAGT CACGGTCGCG AGCTGCTGCC GCCCGCGCGC CACCCCGACG GCGGATCCGC GCCGCCAGCA CCCCCTGCCC ACCCACGCAT CCTGAAGGAC GACTTCGGAC GCCCCGTCCT CGAACGCAGT CCCGACGCCG GAGCGCGCTG GCGCAGTTTC GATGCCGCGG GGCGCCTTGT CGCCATGGGC GACGCCCTCG GCCACCGCGC GCGCTACACC TGGGACCCGC GCGGCCGCAT CCTCGCCCAG GAGGTCACCG ACGGGCGCAA CGGCAGCACC GAGACCACCC GCTGGCGCTA CGACGGCTCC CGCCTGCTCG AAGTCGATCA CCCCAAGGGG CGCGAACGCT ACGAGTACGA CCAGCGCGGC TTGCGCAGCG CGCGCATCGT GAGCCTGAAG CGCGACGGCG GCGAGCTCGT CGTGGTCACG CGCTACGAGT ACGACGCCGA GGGGCATCTT GTCGCCACCA CCCTGCCCGA TGGCAGTCGC CTGCGCTACG AGCGCAACGG CCAGGGCCAG GTCGTCGCGC TGAAGCGCCA GCCGGTGGCC ACGCCCTGGC TGCGCGCGCT GGCGCGCGAA CAGGTCATCG CCAGCGCGTT CCAGCGCGAT CTCTTCGGCC TGCGTGGCTT CCGGGCCGGC AACGGCATGC GTACGTTGCA CGAGCGCACG CGCACCGGCA CCTTGGTCCG CGTCGTTCAT CTGCGCGCCG ACGAGCGCGG CGCACCGGTC CGTAATGCAC GTGCGCTGCT GCCGATCGCA CCTGGCCAGC CCCTGGCCAC GCGCATCGAA CGCTTGTTCG GCATTTCCGC GGCCCACGCC GCGCAAGCAA ACACCGATGC GGACGTCACC CAGCCGGTGG AGGCAGCAGA TGGCCAAGCC CTCGAAGGTG CACTACCACG CCTGCAGGAG ACTGTCGCGT CCGCCGACAC CATCCTGCTC GACTACCAGT ACGTCTGGGA CCCGGAAGGC AACCTCCTGC ACGCGCGCCG GCGCGAGCCC GGAGGCCCGC TGCCCTCGAC GCGCCGCAGC CAGGCCTACG ACCTTCGCAA CCAGCTCGTG GCCAGCGTCG AATGGCGGGA CGACGGCGAC GTGCTGGCCG AAACCGCCGT GTGGCGTTAC GCCTACGATC GCCACCAGCG CCGCGTCCTT GCCCAGGAAG GCGTCGTGTC GCAGCAGGAG CTCGCCGGCC ACACCCGCCC GACCCACTTC GCACCGGGCA GCCACCGTGC CTTGTCCTCG AGCCCACGCG CAGCGAACGT GCCGGCGCGG GCCGCCGCGA GCCGACCGCA CGAGGCCGCC ACGCCCGGTC ACCACGACGC CAGCGGCCAG CCCGAACGCT TCGGCACGCG CAGCCTGCGA TGGGACGCAC TCGGCCGCCT GATCGAGGTG CACGCAGGCG ATCGCAGCAT CGCCCGCTAC GCCTACGACC ACCGCGGCCT GCGCATCGAG CGCACGCGCT TCGATCCCGC AATGGTCGCG CCCACAACGA CCCACACCGT GTACGACGAC GCCCGCCAAC CCCTCGCAGA GCTCGACGCC GACGGCAGAT TGATCCGCCA GTATCTGTGG CTAGCCGACC TGCCCCTCGC CGTGCTCGAC ACCCCGGCGC GCCCCGACAG CGCAACCGAC TCCGCGCGAC GCATCCTCGC AGACCTCGGC CGCATCGCGC AAAGCTGGCT CGCCCCGCAG GCCGGCCTCG CCTGGCTGCA CACCAACCAC CTCGGCGCCC CCGAACTGGC CACCGACGCC GACGGCGAGC CGCTCTGGCG CGCGCGCCAC GCCCCCTTCG GTGCCGCGAC GGTCACCACC TCGCCCCGGC ATCCCGACTT CACCCTCGAC CTGCGCCTGC CCGGCCAGGT CTTCGACGCC GAAACCGGCC TGCACTACAA CCGCCGCCGT TACTACGCGC CGACCCTGGG CGAATACCTC ACCCCCGACC CGCTCGGTAC GCCGGACGGG CCGAACCCGT ATGCGTATGC GGCGTTCAAT CCGCTGCGGA ACGTGGATCC GGATGGGCTG GTGCTGTTCG CGTTCGATGG GACGGGGAAC AGCGACGACC TCAACGACCC GGCGATGGCG GGCAGCGGAT TCAGCAATGT GGTGTATTTC TTTGACGCTT ACACTGCCAC CAAGCGCTAT GTAAGTGGCG TCGGCACCGT GCATCACGAC ATCGACTACG GCGACATCCG CCCGGAAGAC CACGCGACCG GCCATCTGCT GTGGTGGCTG ACACCGGGCG ATCCGGTCCA TGTGAACGAC ATGGGCGGCA ACTACTCGGG GCCGGCACGG ATCGGGCGGA TGAGCCAATA CCTGGACGAC GAGGCCGAGC TCTTCAGTGA CGACCGGGTA ATGGACATCG ACATCGTCGG CTTCAGCCGC GGCGCGGCCC AGGCGCGCGA GTTCGCCAAC CGGATCGTCG CGAAGACGGT CCGCCATGAG GGCCAGGACT ACTACCGCTA CACGAACCGC CGCGGAGACT CTGCCTGTCA GGCGGTCGAT TTCCGCTTCA TGGGGCTCTT CGACACCGTG CTATCGACCA ATTTCAGCGG CGAAGCGTAT CGCCTCGGGA TCCCGGAGGT CTTCGCCCAT GTTGCGCAGG CGGTCGCGCT CAACGAGCAC CGCTCGGACT CGATCACGGA GTTCGCCTAT CGCAACCCGA AGCCGCATCG CATGCACTGG GGCGGCTTTC CGCTCGAGTC GATCGGCGCC AGCAGCGATG CGCCCGGTCG CATCCGCATC GAGAAGGGCT TCGTCGGCGC CCATGCGGAC ATCGGCGGGG GCTATCCCGA CGCCGAGCAA GGCCTCTCGC TGGTTGCACT CAACTGGATG GTGGCGCAAG CACGCGACGC CGGGGTAGAA ATGGAGTCGG TGGCCCCCGT TCCCATCCTT AATGTGGTGA TCCACGACCA GAGCAATGTG ATCGAGATCG GCAACCCGCA ACACAGCGTC GTTCCGCGCC CGACCGGGAA CGGCGATCTG CCCCGGATCG ATTACGTCTT CCCCGAGGAC CGCGCGGTCA ACGGCGCGGT GGCCGGAGAT CGCCAGCGCC GCATGGCGTT CGACAACGGC AGCCTCACCC ACGCCGACAC CCTGCGCTAC ATCACCTGGC TCCCGCGCGA CGCCACAAGG CGCGGCGACG GCTCCACGCT CGACCCGCGC CGTCTCGGCG ACGTCACCGG CACGGTGGAT ATGGCGAGCT ACCTCGCCTG GCTCGGGCAG CCCGAGAACG GCTACACGAC GACAAATGGA GGATTCTGA
|
Protein sequence | MRAGAALLAA LCITALPGVA AETSAAQSGA ADHGIQQSGC GITMSCAPDS AQDGPSAPEP PAAGQCVSNN AGNPCGSASG PASQGSTTGQ DVGAGNPIDV LSGNKYQQEV DLPALPGILG LEIVRHYNSR RARPGDIGAL GSGWRLSYET RLNIRDDRLE ILQADGARVI FARRPGEPQR CASRDPSRGD ILLRHRPGGG DEYLWRWPDG RSLLFDSAGH LVQILVASGE FVSLRHDAAG RLLQVTDPQG RSLQVHYHAP GSGIAHIDSP LGRFSYAQER VALPARTDAG TQPGRSVHAP QRPGSAGPSR PLQRGAPAGK AAAPAVTAAR LLRVDYPGAP AAQTGGAAAR EYHYEDPRHP VALTGISVLA RQPAGPPVPE RIASYAYDHA GRGIRSVRGA LPAGGESGPE DVRLDYPERG RSVLTNSLGQ RTTYLGAVIA GQRRLLEARG PGCARCGPTD VRYRYTPDGR LSAITTLDAE GRPRLSTQHR FDTRGRLAET LGADLSGARP RLLDRTRYEY PDTPATALAD TDRAAQRTEP AATTPPRAII RPSVVPGREH RIELRYNTTG QPIELRESGF SPIDAEGRPD PVPIERRVTW RYATINGRSV LAELDGPLPD GADASPLDSD LVRLHWDERG SFIRALEGPG GQRSEIESDP ATGLPRRVRD AEGHETHLRH DSAGQLIRWR SRSPGEAADR IHAAEYDALG QLVELRSGED EERLHPRWRR AFDAAGRLKW HADALGILRT WAYDHESRVV ETGLRSASRL LRRRWRYDEH GRLAAIDDDT GFTRSLRHDV AGRLVGLLDS HGRELLPPAR HPDGGSAPPA PPAHPRILKD DFGRPVLERS PDAGARWRSF DAAGRLVAMG DALGHRARYT WDPRGRILAQ EVTDGRNGST ETTRWRYDGS RLLEVDHPKG RERYEYDQRG LRSARIVSLK RDGGELVVVT RYEYDAEGHL VATTLPDGSR LRYERNGQGQ VVALKRQPVA TPWLRALARE QVIASAFQRD LFGLRGFRAG NGMRTLHERT RTGTLVRVVH LRADERGAPV RNARALLPIA PGQPLATRIE RLFGISAAHA AQANTDADVT QPVEAADGQA LEGALPRLQE TVASADTILL DYQYVWDPEG NLLHARRREP GGPLPSTRRS QAYDLRNQLV ASVEWRDDGD VLAETAVWRY AYDRHQRRVL AQEGVVSQQE LAGHTRPTHF APGSHRALSS SPRAANVPAR AAASRPHEAA TPGHHDASGQ PERFGTRSLR WDALGRLIEV HAGDRSIARY AYDHRGLRIE RTRFDPAMVA PTTTHTVYDD ARQPLAELDA DGRLIRQYLW LADLPLAVLD TPARPDSATD SARRILADLG RIAQSWLAPQ AGLAWLHTNH LGAPELATDA DGEPLWRARH APFGAATVTT SPRHPDFTLD LRLPGQVFDA ETGLHYNRRR YYAPTLGEYL TPDPLGTPDG PNPYAYAAFN PLRNVDPDGL VLFAFDGTGN SDDLNDPAMA GSGFSNVVYF FDAYTATKRY VSGVGTVHHD IDYGDIRPED HATGHLLWWL TPGDPVHVND MGGNYSGPAR IGRMSQYLDD EAELFSDDRV MDIDIVGFSR GAAQAREFAN RIVAKTVRHE GQDYYRYTNR RGDSACQAVD FRFMGLFDTV LSTNFSGEAY RLGIPEVFAH VAQAVALNEH RSDSITEFAY RNPKPHRMHW GGFPLESIGA SSDAPGRIRI EKGFVGAHAD IGGGYPDAEQ GLSLVALNWM VAQARDAGVE MESVAPVPIL NVVIHDQSNV IEIGNPQHSV VPRPTGNGDL PRIDYVFPED RAVNGAVAGD RQRRMAFDNG SLTHADTLRY ITWLPRDATR RGDGSTLDPR RLGDVTGTVD MASYLAWLGQ PENGYTTTNG GF
|
| |