Gene Tmz1t_3271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3271 
Symbol 
ID7874492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3582717 
End bp3585023 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content71% 
IMG OID643700205 
Productglycosyl transferase group 1 
Protein accessionYP_002890243 
Protein GI237653929 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCC TGCACATCCT CGATCACTCG ATCCCGCTGC ACAGCGGCTA CACCTTCCGT 
ACCGCGGCGA TCCTGCGCGA GCAGCGCGCG CTCGGCTGGG AGACCTTCCA CCTGACCTCG
CCCAAGCAGG GCGAGACGAA GGCCATGGTG GAGGAGATCG AGGGCTTGCG TTTCCACCGT
ACCGGCGTGC CGACGCCGGC GAGCAGCGGC ATCGCCGAGC TGCGCCAGAT CCGTGCGGTG
CAGGCGCGCA TCGAGCAGCT CGCGGCCGAA CTGCGCCCCG ACATCCTCCA CGCCCACTCG
CCGGTGCTCA ACGCCATTCC GGCCATCCGC GCCGGGCGCA AGCTCGGCAT CCCGGTCGTC
TACGAGATCC GCGCCTTCTG GGAAGACGCC GCGGTCGACC ACGGCACCAC CACCGAGGGC
AGCCTGCGCT ACCGCGCCAC CAAGGCGCTC GAGACCTGGG CGATCAAGCG CGCCGACCAC
GTCTTCACCA TCTGCGAAGG CCTGCGCGCC GACATCGTCG GCCGCGGCAT CCCGGCGGCC
AAGGTCACGG TCATCCCCAA CGCGGTCGAC ATCGAGTCCT TCCAGCTCTC GGGCGACGCC
GACCCTGCCT TGCGGGAGCA GCTCGGCCTC GCCGGCACCA CCGTGGTGGG CTTCGTCGGC
TCCTTCTACG CCTACGAGGG CCTCGATCTG CTGCTCGAGG CCTTCCCCGC GCTGCTGCAG
AAGCGCCCCG AGCTGCGCCT GCTGCTCGTC GGCGGCGGGC CGCAGGACGA AAACCTCAAG
GCTCAGGCCC TGCGCCTCGG CGTGGCCGAC AAGGTCGTCT TCACCGGGCC GCGTGCCGCA
CAAGGACGTC AGCCGCTACT ACGACCAGAT CGACCTGCTC GCCTATCCGC GCCACTCCAT
GCGACTCACC GAGCTCGTCA CCCCGCTCAA GCCGCTCGAG GCCATGGCGC AGGGTCGGCT
CTTCGTCGCC TCCGACGTCG GCGGCCACAA GGAGCTGATC CGCGACGGCG AGACCGGCAG
GCTGTTCAAG GCCGGCAGCG CCGAGGCGCT CGCCGCCGCC ATCGACGACC TGCTCGCGCA
CCGCGAGCGC TGGCCGGCGA TGCGCGCGGC GGGGCGGCAG TTCGTCGAGG ACGTGCGCAA
CTGGACGAAC AGCGTGGCGA ATTACACGCC GGTGTATCGC AGCCTGGTCG CGAAGCCTCG
CGCTGCGGCA TGAATTTCTC CGCGCTGCGC GTCCTGCTCG TCGGGCCGCT GCCGCCGCCT
GCCGGCGGCA TGGCCAACCA GACCCGCCAG CTCGCCGAGC TGCTGCGCGG CGAGGGCGCG
TCGGTCGAGG TCGTGCAGGT CAATGCGCCG TATCGTCCGG CTTGGGTCGA AAGGCTGAAG
GGCGTGCGCG CGCTGTTCCG GCTCCTGCCC TACCTGCTGC GGCTGTGGCA GGCCGCAGGG
CGGGCGAACC TCATGCATGT GATGGCCAAC TCCGGCTGGT CCTGGCATCT CTTCGCCGCG
CCGGCGGTGT GGATCGGCTG GCTGCGTGGC GTGCCGGTGG TGGTCAATTA CCGCGGAGGC
GAGGCGGCCG GCTTCCTTGC CCGCTCGGCG GCGGTCGTGC GTGCCACCCT GCGCCGCGCG
TCCAGCCTGG TGCTGCCGTC CGGCTTCCTG CTCGAAGTCT TCGCGCGCCA TGGCATGGCC
GGGCGCATCG TCTCCAACAT CGTCGACCTC GAGCGTTTCC ATCCCGCGAC GGCGCCGCGC
GCGGCCGGCG AGGGCCCGCA TCTGGTCGTC GCCCGCAACC TCGAAGCCCT CTACGGCAAC
GACACCGCGC TGCGCGCCTT CGCGCGGCTG TGCGCGGCCT GGCCGGCGGC GCGCCTCTCG
ATCGCCGGCA GCGGGCCGGA GGCCGCGGCG CTGGCCGCGC TCGCGCAGGA ACTCGGCATC
GCCGAGCGGG TCCGCTTCAC CGGTCGCCTG GACGGCGAGC AGATGGCTGC GCTCTACCGT
GACGCAGATC TCATGCTCAA CCCGAGTCGG GTGGATAACA TGCCCAATGC CATCCTCGAG
GCGCTGGCGA GCGGCCTGCC GGTGGTCACC ACCGACGTCG GCGGCATTCC CTTCATCGTC
ACGCAAGGCC ACACGGCCAT GCTGGTTCCA CCCGACGATC CGCAGGCGAT GGCCGACGCC
GCGCACAAGG TGCTTGCCGA CACCGGCCTG CGCGCCGCGC TGGTCGCCGC CGGGTGCGCC
GAGGTGCAGC GCTACCGCTG GAGCTCGGTG CGCGCACAAC TGCTGTCCGC CTACGCGGAT
GCGATTGCCG CGAAGAGGGC CACATGA
 
Protein sequence
MRILHILDHS IPLHSGYTFR TAAILREQRA LGWETFHLTS PKQGETKAMV EEIEGLRFHR 
TGVPTPASSG IAELRQIRAV QARIEQLAAE LRPDILHAHS PVLNAIPAIR AGRKLGIPVV
YEIRAFWEDA AVDHGTTTEG SLRYRATKAL ETWAIKRADH VFTICEGLRA DIVGRGIPAA
KVTVIPNAVD IESFQLSGDA DPALREQLGL AGTTVVGFVG SFYAYEGLDL LLEAFPALLQ
KRPELRLLLV GGGPQDENLK AQALRLGVAD KVVFTGPRAA QGRQPLLRPD RPARLSAPLH
ATHRARHPAQ AARGHGAGSA LRRLRRRRPQ GADPRRRDRQ AVQGRQRRGA RRRHRRPARA
PRALAGDARG GAAVRRGRAQ LDEQRGELHA GVSQPGREAS RCGMNFSALR VLLVGPLPPP
AGGMANQTRQ LAELLRGEGA SVEVVQVNAP YRPAWVERLK GVRALFRLLP YLLRLWQAAG
RANLMHVMAN SGWSWHLFAA PAVWIGWLRG VPVVVNYRGG EAAGFLARSA AVVRATLRRA
SSLVLPSGFL LEVFARHGMA GRIVSNIVDL ERFHPATAPR AAGEGPHLVV ARNLEALYGN
DTALRAFARL CAAWPAARLS IAGSGPEAAA LAALAQELGI AERVRFTGRL DGEQMAALYR
DADLMLNPSR VDNMPNAILE ALASGLPVVT TDVGGIPFIV TQGHTAMLVP PDDPQAMADA
AHKVLADTGL RAALVAAGCA EVQRYRWSSV RAQLLSAYAD AIAAKRAT