Gene Tmz1t_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1123 
Symbol 
ID7084652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1231052 
End bp1232158 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content62% 
IMG OID643698138 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_002354778 
Protein GI217969544 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.362937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCTG AGTATCTGGT CGACATTATT GCAGGCGCGC GCCCCAATTT TATGAAGATC 
GCGCCGATTA TTCGTGCGTT CGAGGCCCGC AAGGCAGCGG GTGGTGCGCT GCGGTTTCGC
CTGATCCACA CCGGCCAGCA TTACGATCCG CGCATGTCGG GCGAGTTTTT CCGTCAACTC
GGCATTCCCG AGCCCGATGT GAACCTGGAG GTGGGTTCGG GCACTCAAGC GGAGCAGACC
GGCGCGATCA TGTCGCGTTA CGAGGCGCTG CTGCTGGAGA AGCCCAGCAA TCTTTGCCTG
GTGGTGGGCG ACGTCACGTC GACCATGGCG TGCGCGATCG CCGCGCAGAA GCTGCGCATT
CCGGTGGCGC ACGTGGAGGC GGGTATCCGT TCGGGCGACT GGACGATGCC GGAAGAGATC
AACCGCATGG TGACGGATTC GATCACCAAC TGGTTTTTCA CGACCAGCGA GGTGGCGAAT
GAAAACCTGC GCCGCACCGG GGTGAGCGAC GATCGGATCT TCTTCGTCGG CAACACCATG
ATCGACACGC TGCTGGTAAA TCTGCCGCGC CTGCAGAAGC CGGAGTTCTG GGACGAGCTT
GGTCTGAAGG CGGGCGAGTA TTTCGTTGTG ACGCTGCATC GGCCGGCCAA CGTGGATAAG
GGCCATGGTT TCGCCCGCCT GCTGGCGGCG ATCGGCGAAG GTACGCGCGG CTTGCCGGTG
GTGTTCCCGG TTCATCCGCG TACGGCAAAG ACGCTGCGGG ATTTGAATGA GGTTCCGGCC
AATTTCCGCC TGGTCGATCC GCAGCCTTAT CTTGAATTCA ACTATCTGGT GAAGAACGCC
AAGGCGGTGA TCACCGATTC GGGCGGCATC ACCGAAGAGA CGACGGTGAT GGGCGTGCCC
TGCATGACCT TGCGCGACAA CACCGAGCGC CCGGAAACGG TGACGACCGG CACCAACGAG
CTGATCGGCA CCAACCCCGA TGCGCTGGCG CCGGCATTGG AGAAACTGTT TGCCGGGCAG
TGGAAGAAGG GCGGCATTCC GCCGCTGTGG GATGGCAAGA CGGGCGAGCG CATCGTCGCC
GAGCTTGAAA GGCTGCTTGT CGCATGA
 
Protein sequence
MAAEYLVDII AGARPNFMKI APIIRAFEAR KAAGGALRFR LIHTGQHYDP RMSGEFFRQL 
GIPEPDVNLE VGSGTQAEQT GAIMSRYEAL LLEKPSNLCL VVGDVTSTMA CAIAAQKLRI
PVAHVEAGIR SGDWTMPEEI NRMVTDSITN WFFTTSEVAN ENLRRTGVSD DRIFFVGNTM
IDTLLVNLPR LQKPEFWDEL GLKAGEYFVV TLHRPANVDK GHGFARLLAA IGEGTRGLPV
VFPVHPRTAK TLRDLNEVPA NFRLVDPQPY LEFNYLVKNA KAVITDSGGI TEETTVMGVP
CMTLRDNTER PETVTTGTNE LIGTNPDALA PALEKLFAGQ WKKGGIPPLW DGKTGERIVA
ELERLLVA