Gene Elen_2772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2772 
Symbol 
ID8417098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3215127 
End bp3216497 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content69% 
IMG OID645025747 
ProductUDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionYP_003183108 
Protein GI257792502 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.32493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCTG CCGCTATCGT GTTGGCCGCG GGTGCGGGCA CCCGGATGAA GTCCAAGAAG 
CCGAAGGTGG CTCACGAAGT GCTGGGCAAG CCGCTCGTTC GCTGGGTGGT GGACGCCGCG
CGCGACGCGG GCGTCGAGCG GGTGGTGTCC GTGGTCGGGC ACGCGCGCGA GCAGGTGGAG
CCGCTCGTCG CCGACACGCA GACGGTGGTG CAGGCGGAGC AGAACGGCAC GGCGGGCGCC
GTGGCCGTGT GCAAGGACGC GCTGGCCGAC TTCGACGGCT CGCTCGTGGT GCTGTCGGGC
GACTGCCCCC TTATCACGTC CGAGACTATC GCGCGCCTCG TCGCCGTGCG CGAGGAGGCG
GACGCTGCTG TGGTGGTGCT CACGATGGAG CTGGACGACC CGTTCGGCTA CGGGCGCATC
GTGCGCGACG AGCAGGGCGC GGTGGCGCGC ATCGTGGAGC AGAAGGACGC CGCGCCCGAG
GAAGCCGCCA TCTGCGAGTG CAACTCCGGG TTCTACTGTT TCGACGCCCG CGCGCTGTTC
GCGGCGCTCG AGCAGGTGAG CAACGACAAC GCCCAGGGCG AGTTCTACCT GACCGACGTG
CTTGAGATCT GCCGTAACGC CGGCCGTCCA GTGCTGGCGC TCGTCTGCGA GGACCCTGCC
GAGTGCCTCG GGGTGAACTC GCGCATCCAG TTGGCCGAGG CCACGAAGTT CGCGCAGCGA
CGCATCAACC GCGCGCATAT GGCCGCCGGC GTGACCATGG TGGACCCCGA GCTCGTGTGG
ATCGGGCCCG ACGTGACCAT CGCGCAGGAC GTGGAGCTGC TGCCGAACGT CATGCTCATG
GGCGAAACGA GCATCGGCGA GGACAGCGTC ATCGGCCCCG ATTCGCGCCT GACCGACACC
GCGGTGGGAC GAGGCTGCGT CGTGGACGAG ACGGTGGCCG TGGAGGCGCA GGTGGACGAC
GGCGCCACGT GCGGCCCGCG CGCGTACCTG CGTCCGGCGG CGCACCTGTG CGAGGGCGCG
AAGGCCGGCA CCCACGTGGA GATCAAGAAG TCCACGGTGG GGAAGGGCTC GAAGGTGCCG
CATCTGTCCT ACATCGGCGA CACCACCATC GGCGAGGACG TGAACATCGG CGCCGGCTCC
ATCACCTGCA ACTACGATGG CAAGAAGAAG CATGCCACCA CTATCGGCGA CGGCGCGTTC
GTCGGCAGCG ACACCATGAT GGTGGCCCCG GTGAGCATCG GCGCGGGTGC CATCATCGGC
GCGGGCTCGT GCATTACGAA AGACGTCGCG CCCGACGCCC TGGCCCTCAC GCGCCCCGAG
CAGCGCGAGA TCCCCGGCTG GGCCGCCAAG AAGCGCAGTC AACAAGAATA G
 
Protein sequence
MEAAAIVLAA GAGTRMKSKK PKVAHEVLGK PLVRWVVDAA RDAGVERVVS VVGHAREQVE 
PLVADTQTVV QAEQNGTAGA VAVCKDALAD FDGSLVVLSG DCPLITSETI ARLVAVREEA
DAAVVVLTME LDDPFGYGRI VRDEQGAVAR IVEQKDAAPE EAAICECNSG FYCFDARALF
AALEQVSNDN AQGEFYLTDV LEICRNAGRP VLALVCEDPA ECLGVNSRIQ LAEATKFAQR
RINRAHMAAG VTMVDPELVW IGPDVTIAQD VELLPNVMLM GETSIGEDSV IGPDSRLTDT
AVGRGCVVDE TVAVEAQVDD GATCGPRAYL RPAAHLCEGA KAGTHVEIKK STVGKGSKVP
HLSYIGDTTI GEDVNIGAGS ITCNYDGKKK HATTIGDGAF VGSDTMMVAP VSIGAGAIIG
AGSCITKDVA PDALALTRPE QREIPGWAAK KRSQQE