Gene Tmz1t_3769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3769 
Symbol 
ID7873766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4153555 
End bp4154679 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content70% 
IMG OID643700713 
Productglycosyl transferase group 1 
Protein accessionYP_002890737 
Protein GI237654423 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCC TCAAGGTCCT GCAACTTCTC CCCGCGCTGG ACAGCGGCGG CGTCGAGCGT 
GGCACGCTCG AGATCGCGCG CGCGCTGGTC GCGGCCGGGC ACGAATCGGT GGTGCTGTCC
AGGGGCGGGC GCCTGGTCGG GCAGTTGCAG GACGAAGGCT CGCGCCACCT CGCGCGCGAC
CTCGGGCGCA AGTCGCCGAC CACCTTCCTG CACTACCGCG CGCTGCGCAG GCTCTTCGAG
GCCGAGCGCT TCGACATCGT GCACGCGCGC TCGCGCCTGC CGGCCTGGGT CGCCTGGCTC
GCCTGGCGCG GCATGCCGGC CGACGCCCGC CCACGCTTCG TCACCACGGT GCACGGCATG
CACTCGGTCA GCCGCTACAG CGCCATCATG TGCGCGGGCG AGCGCGTGAT CGCGGTCAGC
GACACGGTGC GCGACTACAT TCGTACCCAT TACCCGCCGT CGCGCTGGCC GCACCTGGCC
GATGAGCACA TCACGGTGAT CCCGCGCGGT ATCGACCCGG CGGAGTTTCC GCGCGACTAC
CAGCCTTCAG ACGAATGGCT GGCGCGCTTC CATGCCGAGT TTCCGCAGCT TGGCCAGCGC
AAGGTGCTGA CGCTGCCGGG GCGGCTCACG CGGCTGAAGG GACATCACGA CTTCATCACC
CTCATCGGCA AGCTGGTCGC GGACGGACTG GACGTGGTCG GGCTGATCGT CGGCGGCGAG
GACCCGAAGC GGCCCGGCTA CGCGAAGGAG ATCCGCGAAC GGGTGCAGGC GGAGGGGCTA
GGGGAACGCA TCCTCTTCAC CGGTCACCGC AGCGACGTGC GCGAGATCTA CGCGATCTCG
GACTGCGTGC TGAGCCTGTC CTCCACGCCC GAATCCTTCG GGCGCACCGT GCTGGAGCCG
CTGGCGATGG GGCGGCCGGT GGTGGGGTAT GCGCATGGGG GGGTGGCGGA GATCCTGGGC
GAGGTGTTCC CGCATGGGGC GGTGGCGAAG GGGGACGTGG CGGCCGCGAC AAAGCGGGCC
GGGGACGTGG TCGCCGGACG GACGCCGGTG GTGGAGTTCA ACACGCGCTT CCTGCTCGAG
CGCATGCAGG CGCAGACGCT GGCGGTGTAT GGAGCGCTCG CATGA
 
Protein sequence
MKALKVLQLL PALDSGGVER GTLEIARALV AAGHESVVLS RGGRLVGQLQ DEGSRHLARD 
LGRKSPTTFL HYRALRRLFE AERFDIVHAR SRLPAWVAWL AWRGMPADAR PRFVTTVHGM
HSVSRYSAIM CAGERVIAVS DTVRDYIRTH YPPSRWPHLA DEHITVIPRG IDPAEFPRDY
QPSDEWLARF HAEFPQLGQR KVLTLPGRLT RLKGHHDFIT LIGKLVADGL DVVGLIVGGE
DPKRPGYAKE IRERVQAEGL GERILFTGHR SDVREIYAIS DCVLSLSSTP ESFGRTVLEP
LAMGRPVVGY AHGGVAEILG EVFPHGAVAK GDVAAATKRA GDVVAGRTPV VEFNTRFLLE
RMQAQTLAVY GALA