Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1115 |
Symbol | |
ID | 7084644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1218096 |
End bp | 1220288 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643698130 |
Product | capsular exopolysaccharide family |
Protein accession | YP_002354770 |
Protein GI | 217969536 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01005] exopolysaccharide transport protein family [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.335686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGACG ACCGGGACGA CTTCATCAAC CTGGGCGAGA TCATCGCCGT CCTGCTCGAA TACAAGTGGC TGATCCTCGC CGTCACCTTC TTCGCCGTCT TCATCGGCGC GGTGGTGGCC TTCGTGTCCA CGCCCATCTA CCGCGCCGAC GGGCTGGTGC AGGTGCAGGA CAGCAAGGGG CCCAAGGGCG GGCTGGCCGC GCTGCGCGAC GTCGAGGCCG TGCTCGGCGA GAACAGCTCG GTCACCGCCG AGCTCGAGAT CCTGCGCTCG CGCATGATCC TCGGCCGCGT GGTCGAGCGC CTGCGCCTCG ACATCCGCGC CACCCCCGAC TACTTCCCCA TCTTCGGCCG CGCCCTCGCG CGCCGCTATA ACGGCGACCA GCCCGCCGCG CCGCTGCTCG GCCTGGGCAG CTACGCCTGG GGCGGCGAGC GCATCACCGT GCAGACCCTC GAGGTGCCGT CCTACCTGGT CGGCCTGCCG CTCACCCTCA TCGCCGGCGA CAACGGCGGC TTCACCCTCT ACGACGACCA GGACCAGCCC CTGCTCGACG GCAGCGTCGG CCAGCCCGCC AGCAGCGCCG ACGGCAAGAC CACGCTCTTC GTCGCCGAGC TCGTCGCCCG CCCCGGCACC CATTTCGATC TCGCCCGCAT CAGCCAGATC CAGGCCATCG CCGCCCTGCG CGAAGACCTC GAAGTGCGCG AGCGCGCCCG CCAGTCCAAC GTGATCGAAG CCGCCTACAG CGACGCCGAC CGCGCCGAAG CCGAGCGCCT GCTCAACGAA GTCCTCAACG CCTACGTGCG CCAGAACGTC GAATACCGCT CCGCCGAGGC CGACGCCACC CTGGCCTTCC TCGAAAAGCA GCTCCCCGAG CTCAAGGCCC AGCTCGACAC CGCCGAAGCC GCCTACAACG ACTACCGCCA GACCCGCGGC TCGGTTGACC TCACCCTCGA GACCCAGTCG GTGCTCAGCT CCATCGTCAA GGTGGACGCC GACGTCGTCG AGCTCCAGCA AAAGCGCGAC GAGCTGCGCC AGCGCTTCAC CCCCGAGCAC CCCCAGGTCA AGGCCATCGA CTCGCAGCTC GGCCGCCTGC GCGCCGTGCG CGGCACCCTC GACAAGGACG TCAATCGCCT CCCCGACACC CAGCAGACCG CGCTGCGCCT GCGCCGCGAC GTCGAGGTCG CCACCGCGCT CTACACCAAC CTGCTCAACA GCGCCCAGCA GCTGCGCGTG GCGCGCGCCG GCACCGTCGG CGACGTGCGC GTGATCGACC CCGCCGCCAC CGCGCCGCTG CCGGTCGCGC CGCGCAAGGC GCTCATCCTG CTGCTCTCGG GCGTGCTCGG CGTGCTCGGC TCGCTCGGCC TGGTGTGGGC CATCCGCAGC CTGCGCGTGG TCGTCGAAGA CCCGCAGACC ATCGAGCGCG AGCTCTCGCT GCCGGTGTAC GCCACCGTGC CCGACAGCAA GGACGAAGCC GTGCTGTCGC GCGCCATCGC CCGCGGCAAG ACCGACAAGG GCCAGCTCCT CGCCACCGCC CACCCCGACG ACGACGCCAT GGAGAGCCTG CGCAGCCTGC GCACCACGCT GCACTTCGCG CTGCTCGGCG CCGAGAAGGG CTCGGTGCTC ATCACCGGCC CGGCGCCCGG CGTGGGCAAG AGCTTCATCA GCAAGAACCT CGGCGCCGTG CTCGCCCAGG CCGGCAAGCG CGTCATGCTG GTCGACGGCG ACCTGCGCAA GGGCCACATC AACAAGGCCT TCGGCATCGG CCGCGGTGTG GGCGTGTCCG ACTACATCAT GGGCGCCGCC AGCATCGAGC AGATCGTCAA GCCCACCGGC ATCGACAACT TCTCCCTCGT CACCACCGGC CAGATCCCGC CCAACCCGTC CGAGCTCCTC ATGCACCCGC GCTTCGCCGC GCTGCTCGCC GAGCTCGAAA AGCAGTGCGA CGTGCTCATC ATCGACGCGC CCCCGGTGCT CGCCGTGTCC GACGCCGCCA TCATCGGCCG CCAGGTCGGC GCCACCCTCC TGGTCGCCCG CGCCGGCCGC CACCCGGTGC GCGAGCTCGA GCAGGCCATC AAGCGCTTCG ACCAGGCCGG CGTGGAGGTC AAGGGATTCG TGTTCAACGG CTTCGACCTC ACCCGACAAC GGCATCGCTT CGGGTACGAG GGGTATCACT ACCAGTACAA GTACAAGGCG TGA
|
Protein sequence | MRDDRDDFIN LGEIIAVLLE YKWLILAVTF FAVFIGAVVA FVSTPIYRAD GLVQVQDSKG PKGGLAALRD VEAVLGENSS VTAELEILRS RMILGRVVER LRLDIRATPD YFPIFGRALA RRYNGDQPAA PLLGLGSYAW GGERITVQTL EVPSYLVGLP LTLIAGDNGG FTLYDDQDQP LLDGSVGQPA SSADGKTTLF VAELVARPGT HFDLARISQI QAIAALREDL EVRERARQSN VIEAAYSDAD RAEAERLLNE VLNAYVRQNV EYRSAEADAT LAFLEKQLPE LKAQLDTAEA AYNDYRQTRG SVDLTLETQS VLSSIVKVDA DVVELQQKRD ELRQRFTPEH PQVKAIDSQL GRLRAVRGTL DKDVNRLPDT QQTALRLRRD VEVATALYTN LLNSAQQLRV ARAGTVGDVR VIDPAATAPL PVAPRKALIL LLSGVLGVLG SLGLVWAIRS LRVVVEDPQT IERELSLPVY ATVPDSKDEA VLSRAIARGK TDKGQLLATA HPDDDAMESL RSLRTTLHFA LLGAEKGSVL ITGPAPGVGK SFISKNLGAV LAQAGKRVML VDGDLRKGHI NKAFGIGRGV GVSDYIMGAA SIEQIVKPTG IDNFSLVTTG QIPPNPSELL MHPRFAALLA ELEKQCDVLI IDAPPVLAVS DAAIIGRQVG ATLLVARAGR HPVRELEQAI KRFDQAGVEV KGFVFNGFDL TRQRHRFGYE GYHYQYKYKA
|
| |