Gene Tmz1t_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3904 
Symbol 
ID7873552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4302505 
End bp4303629 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content66% 
IMG OID643700843 
Productglycosyl transferase group 1 
Protein accessionYP_002890866 
Protein GI237654552 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCG CCTTCCTCTG CAAACGCCGC TACATGGGCA AGGACGTGAT CCTCGACCGT 
TACGCCCGGC TCTATGAGAT TCCTCGCCAG CTTGCCCATC TGGACAACGA AGTGGGCGCT
TTCTGCTTGG ACTACCACGC AGCGGACACC GACGGTTGCT GGGAACATGA GGCAGCGCCG
GGCAGGCTGA GATGGCATTC GCTTTCGGTC GGAAGAACCC GTCTGCCCAG GCTGGCGGCC
TACCCTTGGC ATTTGCTGCG GCAACTGCGC GCCTTCAAGC CCGACATCCT GGTGGGTGCC
TCCGATATTC CCCACGTGGT GCTGGCACGG TGGCTGGCCA GGCGCTTGCA AGTTCCCTAC
GCAATAGACC TCTATGACAA TTTCGAAGGC TTCGGCCAGG CCCGTATTCC CGGCTTCGTG
CCGGCACTGC GCCGCGCCGT GCGCGACGCA ACTGTCGTAA CCACGACCAG CGAACCGCTT
CGCCAGAAGG TGCTGGCCGA CGGCGCCCGG GGCACCGTCA TCGCCATGCC CAGCAGCGTG
GACCTTGCGG TCTTTCACCC CGGCGACAAG GCGCAGGCCC GCCAGGCCCT GAGCCTGCCG
CAGGACGGCA AACTGGTCGG CACGGCCGGT GGCCTGTACC GGGAAAAAGG CATCGAGCCA
CTGTACGCCG CCTGGCCAGC GCTCGCAGCC AGCCGCCCCG ACGTGCATCT GGTGCTGGCC
GGCCCACTGG AAAACGGCTT CGCGCCTCCA CAGGGCGAGC GCGTGCACTA CCTCGGTCAC
CTCGCACACG GCCAGATCGC CAACCTGTTT CGTGCGCTGG ATGTGGGCAT CATCTCCATC
CTCGACACCC CCTTCGGCCG CTACTGCTTC CCGCAGAAGG CGTACGAAAT GCTTGCCTGC
AAGCTACCGG TCGTGGCCAC CGCCATCGGG CAGATGCGCG AAGTGTGCGC CAGCACGCCG
CAGGCGCTTT TTGCCCCAGG CGATTCGACG GCACTCACCC GCGCCGTGCT GTGGCAATTG
CAGTCGGGTT CCACGCCAGC CGTGCCCATC GCCGACTGGA AGACGCTGAT TGGCAGCATC
GAACCCGTGC TGAAGAGGCA GTCAGGAGCG GCAGCAGGGG GATAA
 
Protein sequence
MRIAFLCKRR YMGKDVILDR YARLYEIPRQ LAHLDNEVGA FCLDYHAADT DGCWEHEAAP 
GRLRWHSLSV GRTRLPRLAA YPWHLLRQLR AFKPDILVGA SDIPHVVLAR WLARRLQVPY
AIDLYDNFEG FGQARIPGFV PALRRAVRDA TVVTTTSEPL RQKVLADGAR GTVIAMPSSV
DLAVFHPGDK AQARQALSLP QDGKLVGTAG GLYREKGIEP LYAAWPALAA SRPDVHLVLA
GPLENGFAPP QGERVHYLGH LAHGQIANLF RALDVGIISI LDTPFGRYCF PQKAYEMLAC
KLPVVATAIG QMREVCASTP QALFAPGDST ALTRAVLWQL QSGSTPAVPI ADWKTLIGSI
EPVLKRQSGA AAGG