Gene Tmz1t_3281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3281 
Symbol 
ID7874179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3596072 
End bp3597616 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content67% 
IMG OID643700215 
Productpolysaccharide chain length determinant protein, PEP-CTERM locus subfamily 
Protein accessionYP_002890253 
Protein GI237653939 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.308326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAC TCGTAACACA GATCATCGGC TACCTGCGCG GCATGTGGCG GTTCCGCTGG 
TGGGGGCTTG CGCTCGCGTG GGTCGCAGGT ATCGCCGGTA GCCTGGCGAT CTACTTCATG
CCGGACCACT ACGAGTCGTC CGCACGCATC CACGTGGATA CCCAGTCGGT GCTGCGCCCG
CTGATGTCCG GGCTCGCCGT CCAGCCCAAC ATCAGCCAGC AGATCGACTT GCTCAGCCGC
ACGCTGATCA GCCGGCCCAA CGTCGAGAAG CTGATCACCA TGGCCGACCT CGACCTCACG
GTGCAGAACG CACAGCAGCG CGAGGCCCTG ATCACCTCGG TGACCAGCCG GCTGCGCATC
CAGTCCGGCC GCGGCGACAA CCTGTTCACG CTCTCGTACG AAGACGTCGA GCCGCATCGT
GCGCAGCGCG TCGTGCAGTC GCTGCTGTCC CTGTTCGTCG AATCCGGTCT CGGCGGCAAG
CGTCAGGATA CCGACGCGGC ACGGCGCTTC ATCGAGGACC AGATCCGCAG CTACGAGCAG
AAGCTGACCG ACGCCGAGAA CCGGCTCAAG GACTTCCGCC TGCGCAACAT GGCGCTGCTG
GGTACCGGCG CGCGCGATTA CGTCACCCAG ATCGCCGAGA CCAACGAGCA ATTGCGCGAA
GCACGCCTGG AACTGCGCGA GGCCGAGAAC TCGCGCGACG CGCTCCGCCA GCAACTCTCC
GCAGCGCCGT CCGATGCGGG CATCGCGCCG CCGCCGATGG TGGCGACGCC CGAGATCGAT
GCCCGCATCG ACCTGCTCAA GAGAAACCTC GACGAGATGT TGCAGCGCTA CACCGAGATG
CACCCGGACG TGGTCGGTGC ACGCCGCGTC ATAGAGGATC TCGAGAAGCA GAAGAAGGCG
CAGCTGGCCG AGCTGAGCGC GAGCATGGCG GGCAGTACCT TCGGTGCTCC CGGCGCCATG
AGTGCGGGGG TCGAGCAGAC CCGGCTCGCG GTGGCGCAGG CCGAGGCCCG GGTCGCGTCG
CTGCGTGCGC GGGTGGCCGA GTTCGAGAGC CGCCTGGCGA GCCTCTCGGA GGACGCCAAG
CGGATCCCCG AGCTCGAGAC CGAGATGGCG CAGCTCAACC GCGATTACAG CGTCCACAAG
AGCAACTACG ACCAGCTGGT ATCGCGGCGG GAGTCGGCGA ACATCGCCGC CGAGATGAGC
ACCCAGTCGG GCATCGCCGA CTTCCGCATC ATCGATCCGC CCACCCTGCC CACCAAACCC
TCGGCTCCCA ACCGGCTGCT GCTGCTGCCG CTCGCCGCGC TGGTGGGTCT GGGGGCGGGG
TTCGCGCTCA CCTTCCTGAT CAGCCAGCTG CGACCGGCGT TCAGCGATGC GCGCCAGCTG
CGCGAGATCA CCGGCCTGCC GGTGCTCGGC ACCGTCTCCA TGCTGACCAC TTCGGAGCGT
CGTCGTCGTC GCCGCAACGG GCTGTTCGCC TTCGGCAGCG GTCTGGTGGC GTACGCCGGG
GTGTTCGCCG CGGTTACGGT CGCGATCGTG TTCCTGCAGA GTTGA
 
Protein sequence
MEELVTQIIG YLRGMWRFRW WGLALAWVAG IAGSLAIYFM PDHYESSARI HVDTQSVLRP 
LMSGLAVQPN ISQQIDLLSR TLISRPNVEK LITMADLDLT VQNAQQREAL ITSVTSRLRI
QSGRGDNLFT LSYEDVEPHR AQRVVQSLLS LFVESGLGGK RQDTDAARRF IEDQIRSYEQ
KLTDAENRLK DFRLRNMALL GTGARDYVTQ IAETNEQLRE ARLELREAEN SRDALRQQLS
AAPSDAGIAP PPMVATPEID ARIDLLKRNL DEMLQRYTEM HPDVVGARRV IEDLEKQKKA
QLAELSASMA GSTFGAPGAM SAGVEQTRLA VAQAEARVAS LRARVAEFES RLASLSEDAK
RIPELETEMA QLNRDYSVHK SNYDQLVSRR ESANIAAEMS TQSGIADFRI IDPPTLPTKP
SAPNRLLLLP LAALVGLGAG FALTFLISQL RPAFSDARQL REITGLPVLG TVSMLTTSER
RRRRRNGLFA FGSGLVAYAG VFAAVTVAIV FLQS