Gene Tmz1t_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3803 
Symbol 
ID7874045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4196194 
End bp4197330 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content66% 
IMG OID643700745 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_002890769 
Protein GI237654455 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC TCAAAGTCAT GTCCGTGGTC GGCACCCGGC CCGAAATCAT CCGCCTGTCG 
CGCGTGCTGG CGGCGCTCGA CGAGCACTGC GAGCACGTAC TGGTGCACAC CGGTCAGAAC
TACGACTACG AGCTCAACCA GGTCTTCTTC GACGACCTGG GGGTGCGCAA GCCGGATCAC
TTCCTCAACA GCGCCGAGGG CAGCACCGGC GCGGCGCACA CCATCGGCAA CCTGATCATC
GCGGTCGACC GCGTGCTGGG CGAGGTGCAG CCCGAGGCCA TGCTGGTGCT GGGCGACACC
AATAGCTGCC TGTCGGTGAT CCCGGCCAAG CGGCGCAAGA TCCCGATCTT CCACATGGAG
GCGGGCAATC GCTGCTTCGA CCAGCGCGTG CCGGAAGAGA CCAATCGCCG CATCGTCGAC
CACACCGCCG ACATCAACCT CACCTACAGC ACCATCGCGC GCGACTACCT GCTGCGCGAG
GGCCTGCCGC CCGACCAGGT GATCAAGACC GGCAGCCCGA TGTTCGAGGT GCTGACGCAC
TATCGCCCGC GCATCGAGGC GTCGGACGTG CTGCAGCGCC TGGCGCTGGA GGCGGGGCGC
TACTTCGTGG TGAGCGCGCA CCGGGAAGAG AACATCGAAT CCGAGAAGTC CTTCACCAAG
CTGGTGGCGG TGCTCAACGC AGTGGCGGAA GACCACGGCC TGCCGGTGAT CGTGTCGACC
CACCCGCGCA CGCAGAAGCG CGTGGATGCC ACCGGCGCGA AGTTCCACCC GATGGTGCGG
CTGCTCAAGC CGCTGGGCTT TCACGACTAC GTGAAGCTGC AGCTTTCGGC CAAGGCGGTG
CTGTCGGACA GCGGCACGAT CAACGAGGAG TCGTCGATCC TCAACTTCCC GGCGCTGAAC
CTGCGCGAGG CGCACGAGCG GCCGGAGGGC ATGGAAGAGG CGGCGGTGAT GATGGTGGGG
CTGGAGGTCG ACCGGGTGCG CCAGGGGCTG GCGGTGCTCG CGTCGCAGTC GCGCGGTGAG
GAACGCAGCC TGCGCCAGGT GGCCGACTAC AGCATGCCGA ACGTGTCGGA CAAGGTGGTG
CGCATCATCC ACAGCTACAC GGATTACGTG AAGCGGGTGG TGTGGAGGCA GTACTAA
 
Protein sequence
MKKLKVMSVV GTRPEIIRLS RVLAALDEHC EHVLVHTGQN YDYELNQVFF DDLGVRKPDH 
FLNSAEGSTG AAHTIGNLII AVDRVLGEVQ PEAMLVLGDT NSCLSVIPAK RRKIPIFHME
AGNRCFDQRV PEETNRRIVD HTADINLTYS TIARDYLLRE GLPPDQVIKT GSPMFEVLTH
YRPRIEASDV LQRLALEAGR YFVVSAHREE NIESEKSFTK LVAVLNAVAE DHGLPVIVST
HPRTQKRVDA TGAKFHPMVR LLKPLGFHDY VKLQLSAKAV LSDSGTINEE SSILNFPALN
LREAHERPEG MEEAAVMMVG LEVDRVRQGL AVLASQSRGE ERSLRQVADY SMPNVSDKVV
RIIHSYTDYV KRVVWRQY