Gene Elen_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1105 
Symbol 
ID8415395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1336616 
End bp1337905 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content67% 
IMG OID645024068 
ProductUDP-N-acetylglucosamine1- carboxyvinyltransferase 
Protein accessionYP_003181465 
Protein GI257790859 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00159762 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000149564 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCTGAAG AGATCATCAT CGTACAGGGC AACAACACGC TGGCCGGCGA CGTCGCCGTG 
TCGGGCGCGA AGAACTCGGC GCTCAAGCTG ATCGCCGCCG CGTTGCTGGG CCAGGGGGAA
ACGACCATCC ATAACGTCCC GCTCATCTCG GATATCAGCA TCATGTCCGA CGTGCTGCGC
TGTTTGGGCG CGCGGGTGGA GCGCGACGGC CATACGCTGA CGGTGGACAC CTCCGCCGTC
GACAAGCACG AGACCCCCTA CGAGCTGGTG TCGAAGATGC GTGCGAGCAT CTCGGTGCTC
GGCCCGCTCG TTGGCCGCTT CGGATGCGCC CGCGTGGCCA TGCCGGGCGG CTGTCAGATC
GGCGCTCGCA AGATCGACAT GCATCTTGTC GGCCTCGAGG CTATCGGCGT GGAGTTCTCC
ATCGACCACG GCTTCCTTGA GGCCAGCACG CCCAACGGGC TGCACGGCGC GCACGTCGTA
CTCGACTTCC CCAGTGTGGG AGCAACCGAG AACCTGCTCA TGGCCGCCGT GGCGGCCGAG
GGCTCCACGG TCATAGAGAA CGCCGCGCGC GAACCCGAGA TCGTGGATCT TGCGAACATG
CTCGTGTCCA TGGGGACGCG CGTCACAGGC GCGGGCTCGG ACATCATCGA GGTCGAGGGC
GTGCCGCTTT CTTCGCTGCA TCCTTGCGAG CACACGACGG TGGGCGACCG CATCGAGGCC
GGCACCTTCC TGGCCGGAGG CGCGCTGACG GGCGGTCCCG TCACCGTGCA CGGCATCGAT
CCGTCGTATC TGCGCGTTGC GCTCATGAAG CTCCGCGCCA TGGGCTGCGA CGTGGAGACG
GGCGACGACT GGATCACCGT GGGCCGCACG CGCCCGCTGG CTCCCATCGA CTTGCAGACG
CTGCCGCATC CCGGGTTTCC CACCGACCTT CAGGCGCAGT TCATGCTGCT GGCCGCGTTC
GCCGACGGCA TGTCCGTCAT CACCGAGAAC GTGTTCGAGA ACCGTTTCAT GTTCGCCAGC
GAGCTCATGC GCATGGGCGC GGACATCGCC ATCGAGGACC ATCACGCCCT CGTGCGCGGC
GTGGACGTGC TGCAGGGCGC CGACGTGTCG TCCACCGATC TGCGCGCCGG GGCGGCTCTC
GTGCTGGCCG GCATTGCGGG GGAGGGCGAG ACTCGCGTGC ACAACATCGG CCATATCGAT
CGCGGTTACG AGGACTACGT GGGCAAGCTG CGCGCTCTCG GCGCCGATGT CGTGCGCGCT
GAAACCGAGC AAGCCCAGAC GTCCCGGTAG
 
Protein sequence
MAEEIIIVQG NNTLAGDVAV SGAKNSALKL IAAALLGQGE TTIHNVPLIS DISIMSDVLR 
CLGARVERDG HTLTVDTSAV DKHETPYELV SKMRASISVL GPLVGRFGCA RVAMPGGCQI
GARKIDMHLV GLEAIGVEFS IDHGFLEAST PNGLHGAHVV LDFPSVGATE NLLMAAVAAE
GSTVIENAAR EPEIVDLANM LVSMGTRVTG AGSDIIEVEG VPLSSLHPCE HTTVGDRIEA
GTFLAGGALT GGPVTVHGID PSYLRVALMK LRAMGCDVET GDDWITVGRT RPLAPIDLQT
LPHPGFPTDL QAQFMLLAAF ADGMSVITEN VFENRFMFAS ELMRMGADIA IEDHHALVRG
VDVLQGADVS STDLRAGAAL VLAGIAGEGE TRVHNIGHID RGYEDYVGKL RALGADVVRA
ETEQAQTSR