Gene Elen_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2010 
Symbol 
ID8416321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2355956 
End bp2357104 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content69% 
IMG OID645024987 
Productglutamate 5-kinase 
Protein accessionYP_003182363 
Protein GI257791757 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0263] Glutamate 5-kinase 
TIGRFAM ID[TIGR01027] glutamate 5-kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00026672 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000558815 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCCGG CTGGATGCGC GCATGCCGCC GATCATGGCA AGCGCCTCGT GATAAAAATC 
GGGTCGTCCA CGCTCACCAC GTCGGAGAGC AAGATCGACT ACGCCTACCT CGCGGAGGTG
ACCGACCAGG TCGCGCGCGT GCGCGCCGCC GGTTGGCGCC CCATCGTCGT CACCTCGGCC
GCTATCGCCT GCGGCCTCGA GCGCTTAAGC ATCGAGAAGC GCCCGCACGA CATGCCCAGC
CTGCAGGCGG CCGCCTCGGT GGGGCAGAGC GCGCTTTCCA CGGCGTACGC CGAGGCGTTC
GCGCGCCACG GCATCGTGAC GTCCACGGTG CTGCTGACGC GCCGCGACAC GGCCGACCGC
CGGGCGTACC TGCACGCGCG CGACACGTTC GACCGCCTGC TGGAGCTGGG GGTGGTGCCC
ATCGTGAACG AGAACGACAC CATCTCGGTC GAGCAGATCC GCTTCGGCGA CAACGATACG
CTGGCAGCGC TCGTGGCATG CCTCGTGGAA GCCGACCTCA TGGTCATCCT CTCGGACATC
GAGGGGCTCT ACGATGCCAA CCCGCATCAC CATCCCGACG CGAACCTCAT CGGCCGCGTT
GAGGCCATTG GCCCCGAGAT CATGGCCGTG GCGGGCGAAG CCGGCACCAC GGTGGGCTCG
GGCGGCATGA TCACGAAGAT CAAGGCCGCG CGCGTGCTCA TGGTGGCCGG CATCCCGCTC
GTGGTGTGCG ACGGTCATCG TGCGGAGGCC ATCGTGGACG CGGCGGCGGG CGAGGACGTG
GGCACACTGT TCGTGGCTGC GAAGAAGCCG CACGAGATCA CGCCCAAGAA GCTGTGGATC
GCGCTCGGCG ATGCCGCGCG CGGCGCGCTC GCTGTGGACG ACGGCGCGAA GGCGGCGCTC
ATCGAGCGCG GCAGCTCGCT TCTGTCGGTG GGCGTGCGCT CGGTGGAAGG GCGCTTCGAG
GCGAACGACA TCGTCGACAT CAAGGATGCG ACGGGGCATC TGTTCGCGCG CGGCAAGGTG
GCGTTCGCTA GCGACGAGGC GGCGTTGGCC ATCGGGCGCA CCCGCGCGGA GCTGCAGGCG
AACCGCCTGC TGGCAAGCTT GGCCGACAAG CCGCTCGTCC ATCGCGACGA GTTGGTCGTC
TTCGAATAG
 
Protein sequence
MKPAGCAHAA DHGKRLVIKI GSSTLTTSES KIDYAYLAEV TDQVARVRAA GWRPIVVTSA 
AIACGLERLS IEKRPHDMPS LQAAASVGQS ALSTAYAEAF ARHGIVTSTV LLTRRDTADR
RAYLHARDTF DRLLELGVVP IVNENDTISV EQIRFGDNDT LAALVACLVE ADLMVILSDI
EGLYDANPHH HPDANLIGRV EAIGPEIMAV AGEAGTTVGS GGMITKIKAA RVLMVAGIPL
VVCDGHRAEA IVDAAAGEDV GTLFVAAKKP HEITPKKLWI ALGDAARGAL AVDDGAKAAL
IERGSSLLSV GVRSVEGRFE ANDIVDIKDA TGHLFARGKV AFASDEAALA IGRTRAELQA
NRLLASLADK PLVHRDELVV FE