Gene Elen_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1601 
Symbol 
ID8415900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1900325 
End bp1901446 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID645024570 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_003181958 
Protein GI257791352 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.116766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000192117 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCGCA GGAAGAGCGT GCTCATCGCA TTCGGCACGC GCCCGGAAGC GGTTAAGATG 
GCGCCGGTTT TGAACGCGCT CGAAGCCGAC AAGGCGTTCG AGGTCGAGGC GCTCTCGACG
GCTCAGCACC GCGAGATGCT CGACCAGACG GCGGCTGCGT TCGACTTGCG CATCGATCAT
GACCTGAACC TCATGCGCGA CCGTCAGACG CTCGACGGGC TCACGGCGCG CATTCTGACC
TCCGCGACCC CCGTGCTGGA GCTCGCGGCG CCCGACGCAG TGCTCGTGCA CGGCGACACC
ACCACTGCTC TCGCGCTGGC GCTCGCGTCG TTCTACCGAC AGATCCCCGT GGGGCACGTG
GAGGCGGGCC TGCGATCGGG AAACCGGTAC TCCCCCTTTC CCGAGGAGAT GAACCGTCGT
CTTATCTCAA GCCTCTCCAC CTGGCACTAC GCCCCCACCG AAGGCAACCG AGCAAACCTC
CTCGCGCAAG GCGTCGATCC GGATAGCATC CGGGTAACGG GCAACACCGT CATCGACGCG
CTCCTTGAAA CGGTAAGCTC TTCCTACCGC TTCGAGGAGC CGGCTCTCGA AGCGGTCGAT
TTCGACGGAT TGCGCGTCTT AGGGTTGACG TGCCATCGGC GCGAGAACCT GGGCGAACCG
ATGGAGCGCA TGTTCGCTGC AGTACGCGAC ACGGCAGACG CCCACGCTGA CGTGCGCGTG
GTGTACCCCG TGCACAAGAA CCCCGCCGTG CGCGCGATAG CCGAGAGGGC GCTGGGCGGG
CATCCGCGCA TCACCCTCGT CGAACCGCTC GGATACGCCG ATTTCTCCAA CTTCATGGCA
CGATGCGCCT TCATGCTGAC GGATTCGGGC GGCGTGCAGG AGGAAGCACC GGCCCTCGAC
GTGCCCGTGC TCGTGCTGCG CGACGATACC GAGCGTCCCG AAGCGCTTGA AGCGGGCTGC
GCCGCCCTGG TCGGCACGAC GTACGAAGGC GTGCGCGCAT CGTGCGAGGC GCTGTTGGGC
GATGCGGCGC TGTACGCGCG CATGGCGGCC GCTCCCGACC CCTACGGAGA CGGGCATGCG
GCCGAGCGCA TATGCAGCGC ACTCCATGAC GCCTTCGACT AG
 
Protein sequence
MKRRKSVLIA FGTRPEAVKM APVLNALEAD KAFEVEALST AQHREMLDQT AAAFDLRIDH 
DLNLMRDRQT LDGLTARILT SATPVLELAA PDAVLVHGDT TTALALALAS FYRQIPVGHV
EAGLRSGNRY SPFPEEMNRR LISSLSTWHY APTEGNRANL LAQGVDPDSI RVTGNTVIDA
LLETVSSSYR FEEPALEAVD FDGLRVLGLT CHRRENLGEP MERMFAAVRD TADAHADVRV
VYPVHKNPAV RAIAERALGG HPRITLVEPL GYADFSNFMA RCAFMLTDSG GVQEEAPALD
VPVLVLRDDT ERPEALEAGC AALVGTTYEG VRASCEALLG DAALYARMAA APDPYGDGHA
AERICSALHD AFD