Gene GM21_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1841 
Symbol 
ID8137172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2142939 
End bp2144030 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content60% 
IMG OID644869452 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_003021652 
Protein GI253700463 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.000516167 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGTCC TTCTTATCGC CGGTGCCAGG CCCAATTTCA TGAAGATCGC TCCTATTTAC 
CGCGCTTCCC TTGGCTATCC CTCGGTCGCA TGCAGCATCG TCCACACAGG ACAGCATTAC
GACAAGGAGA TGTCCGGCAC CTTCTTCGAT GAACTGGAAA TCCCCGAACC GCGCTATTCG
CTGAACGTCG GCTCGGGAAG CCATGCGGAG CAAACCGCAG CCATCATGGT CGCCTTCGAG
GAGGTCTGCC GGCAGGAGTC GCCCGACCTC GTGCTGGTGG TGGGGGACGT GAATTCCACA
CTTGCCTGCA GCATCGTGGC GAAAAAATGC GGGGTTTCAG TGGCGCACGT CGAGGCCGGA
TTACGGAGCT TCGACCTGTC CATGCCGGAG GAGATCAACC GCATGGTGAC CGACGCCATA
TCCGACAGCT TCTTCGTTAC CGAGGAAAGC GGCGTAGAGA ACCTGCTGAG GGAAGGAAAG
AAACCGGAAC GGATTCATGA GGTGGGGCAT GTCATGATCG ACAACCTGTT GCGCCAGGTG
AAGCTTCTGG AGGGGATCGA CCCCACGAGC TTCGATAGCC ACCGTCTCAG GAAGGGGGCG
GGAAGGTACC TCTTTCTCAC CCTGCACCGC CCCTCCAATG TGGACAGCAG GGAGGCGTTC
GCGGGGATCG CCGAGGCCGT CAACGAGTTG GCCCGTCAAA GGACCATCTT CTTCCCGGTC
CATCCTCGCA CCAGAAATAT GATGAGCGCG CACGGCATCG AGTTGAGCGA CAAGGTGGTC
CTACTGCCGC CGCTTGGTTA TCGGGAGGCG CTTTTTCTCT GGAAGGACGC CGAAGCTGTT
CTTACCGACA GCGGAGGCCT CCAGGAGGAA ACCACCGCGC TGGGGGTCCC GTGCGTGACC
ATACGGGAGA ACACCGAGCG TCCCGTCACT GTAGAGATCG GGACCAATGT CCTCGCCGGC
ACAGCACCTG AAAAAATCCT CGCGGGGTAT CGCCTAAGCC TGGAGAAGCG GGGCCGGGCC
AGGGTGCCGC AGTTGTGGGA CGGCAGGGCC GCCGAGCGCA TCTGGAAGGT ATTGGCTGGA
GAAAGTCGAT GA
 
Protein sequence
MNVLLIAGAR PNFMKIAPIY RASLGYPSVA CSIVHTGQHY DKEMSGTFFD ELEIPEPRYS 
LNVGSGSHAE QTAAIMVAFE EVCRQESPDL VLVVGDVNST LACSIVAKKC GVSVAHVEAG
LRSFDLSMPE EINRMVTDAI SDSFFVTEES GVENLLREGK KPERIHEVGH VMIDNLLRQV
KLLEGIDPTS FDSHRLRKGA GRYLFLTLHR PSNVDSREAF AGIAEAVNEL ARQRTIFFPV
HPRTRNMMSA HGIELSDKVV LLPPLGYREA LFLWKDAEAV LTDSGGLQEE TTALGVPCVT
IRENTERPVT VEIGTNVLAG TAPEKILAGY RLSLEKRGRA RVPQLWDGRA AERIWKVLAG
ESR