Gene Tmz1t_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1115 
Symbol 
ID7084644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1218096 
End bp1220288 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content70% 
IMG OID643698130 
Productcapsular exopolysaccharide family 
Protein accessionYP_002354770 
Protein GI217969536 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.335686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGACG ACCGGGACGA CTTCATCAAC CTGGGCGAGA TCATCGCCGT CCTGCTCGAA 
TACAAGTGGC TGATCCTCGC CGTCACCTTC TTCGCCGTCT TCATCGGCGC GGTGGTGGCC
TTCGTGTCCA CGCCCATCTA CCGCGCCGAC GGGCTGGTGC AGGTGCAGGA CAGCAAGGGG
CCCAAGGGCG GGCTGGCCGC GCTGCGCGAC GTCGAGGCCG TGCTCGGCGA GAACAGCTCG
GTCACCGCCG AGCTCGAGAT CCTGCGCTCG CGCATGATCC TCGGCCGCGT GGTCGAGCGC
CTGCGCCTCG ACATCCGCGC CACCCCCGAC TACTTCCCCA TCTTCGGCCG CGCCCTCGCG
CGCCGCTATA ACGGCGACCA GCCCGCCGCG CCGCTGCTCG GCCTGGGCAG CTACGCCTGG
GGCGGCGAGC GCATCACCGT GCAGACCCTC GAGGTGCCGT CCTACCTGGT CGGCCTGCCG
CTCACCCTCA TCGCCGGCGA CAACGGCGGC TTCACCCTCT ACGACGACCA GGACCAGCCC
CTGCTCGACG GCAGCGTCGG CCAGCCCGCC AGCAGCGCCG ACGGCAAGAC CACGCTCTTC
GTCGCCGAGC TCGTCGCCCG CCCCGGCACC CATTTCGATC TCGCCCGCAT CAGCCAGATC
CAGGCCATCG CCGCCCTGCG CGAAGACCTC GAAGTGCGCG AGCGCGCCCG CCAGTCCAAC
GTGATCGAAG CCGCCTACAG CGACGCCGAC CGCGCCGAAG CCGAGCGCCT GCTCAACGAA
GTCCTCAACG CCTACGTGCG CCAGAACGTC GAATACCGCT CCGCCGAGGC CGACGCCACC
CTGGCCTTCC TCGAAAAGCA GCTCCCCGAG CTCAAGGCCC AGCTCGACAC CGCCGAAGCC
GCCTACAACG ACTACCGCCA GACCCGCGGC TCGGTTGACC TCACCCTCGA GACCCAGTCG
GTGCTCAGCT CCATCGTCAA GGTGGACGCC GACGTCGTCG AGCTCCAGCA AAAGCGCGAC
GAGCTGCGCC AGCGCTTCAC CCCCGAGCAC CCCCAGGTCA AGGCCATCGA CTCGCAGCTC
GGCCGCCTGC GCGCCGTGCG CGGCACCCTC GACAAGGACG TCAATCGCCT CCCCGACACC
CAGCAGACCG CGCTGCGCCT GCGCCGCGAC GTCGAGGTCG CCACCGCGCT CTACACCAAC
CTGCTCAACA GCGCCCAGCA GCTGCGCGTG GCGCGCGCCG GCACCGTCGG CGACGTGCGC
GTGATCGACC CCGCCGCCAC CGCGCCGCTG CCGGTCGCGC CGCGCAAGGC GCTCATCCTG
CTGCTCTCGG GCGTGCTCGG CGTGCTCGGC TCGCTCGGCC TGGTGTGGGC CATCCGCAGC
CTGCGCGTGG TCGTCGAAGA CCCGCAGACC ATCGAGCGCG AGCTCTCGCT GCCGGTGTAC
GCCACCGTGC CCGACAGCAA GGACGAAGCC GTGCTGTCGC GCGCCATCGC CCGCGGCAAG
ACCGACAAGG GCCAGCTCCT CGCCACCGCC CACCCCGACG ACGACGCCAT GGAGAGCCTG
CGCAGCCTGC GCACCACGCT GCACTTCGCG CTGCTCGGCG CCGAGAAGGG CTCGGTGCTC
ATCACCGGCC CGGCGCCCGG CGTGGGCAAG AGCTTCATCA GCAAGAACCT CGGCGCCGTG
CTCGCCCAGG CCGGCAAGCG CGTCATGCTG GTCGACGGCG ACCTGCGCAA GGGCCACATC
AACAAGGCCT TCGGCATCGG CCGCGGTGTG GGCGTGTCCG ACTACATCAT GGGCGCCGCC
AGCATCGAGC AGATCGTCAA GCCCACCGGC ATCGACAACT TCTCCCTCGT CACCACCGGC
CAGATCCCGC CCAACCCGTC CGAGCTCCTC ATGCACCCGC GCTTCGCCGC GCTGCTCGCC
GAGCTCGAAA AGCAGTGCGA CGTGCTCATC ATCGACGCGC CCCCGGTGCT CGCCGTGTCC
GACGCCGCCA TCATCGGCCG CCAGGTCGGC GCCACCCTCC TGGTCGCCCG CGCCGGCCGC
CACCCGGTGC GCGAGCTCGA GCAGGCCATC AAGCGCTTCG ACCAGGCCGG CGTGGAGGTC
AAGGGATTCG TGTTCAACGG CTTCGACCTC ACCCGACAAC GGCATCGCTT CGGGTACGAG
GGGTATCACT ACCAGTACAA GTACAAGGCG TGA
 
Protein sequence
MRDDRDDFIN LGEIIAVLLE YKWLILAVTF FAVFIGAVVA FVSTPIYRAD GLVQVQDSKG 
PKGGLAALRD VEAVLGENSS VTAELEILRS RMILGRVVER LRLDIRATPD YFPIFGRALA
RRYNGDQPAA PLLGLGSYAW GGERITVQTL EVPSYLVGLP LTLIAGDNGG FTLYDDQDQP
LLDGSVGQPA SSADGKTTLF VAELVARPGT HFDLARISQI QAIAALREDL EVRERARQSN
VIEAAYSDAD RAEAERLLNE VLNAYVRQNV EYRSAEADAT LAFLEKQLPE LKAQLDTAEA
AYNDYRQTRG SVDLTLETQS VLSSIVKVDA DVVELQQKRD ELRQRFTPEH PQVKAIDSQL
GRLRAVRGTL DKDVNRLPDT QQTALRLRRD VEVATALYTN LLNSAQQLRV ARAGTVGDVR
VIDPAATAPL PVAPRKALIL LLSGVLGVLG SLGLVWAIRS LRVVVEDPQT IERELSLPVY
ATVPDSKDEA VLSRAIARGK TDKGQLLATA HPDDDAMESL RSLRTTLHFA LLGAEKGSVL
ITGPAPGVGK SFISKNLGAV LAQAGKRVML VDGDLRKGHI NKAFGIGRGV GVSDYIMGAA
SIEQIVKPTG IDNFSLVTTG QIPPNPSELL MHPRFAALLA ELEKQCDVLI IDAPPVLAVS
DAAIIGRQVG ATLLVARAGR HPVRELEQAI KRFDQAGVEV KGFVFNGFDL TRQRHRFGYE
GYHYQYKYKA