Gene Elen_2702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2702 
Symbol 
ID8417028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3129524 
End bp3130804 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID645025679 
Productaspartate kinase 
Protein accessionYP_003183040 
Protein GI257792434 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTAA TCGTAGCAAA ATTCGGGGGC ACGTCCGTCG CTTCGCCTGA GCGCATCCAG 
ATGGTCGCGA AGAAGCTCAT TGCGAAGAAG CAGGCAGGGC ACCAGGTGGT TGCCGTCGTG
TCCGCCATGG GCAAGACCAC CGACGAGCTC GTAGGCCTCG CCGCCTCGCT CAACGACAAC
CCGCCGGCGC GCGAGATGGA CCGTCTGCTG TCCACGGGCG AGCAGGTGTC CATGACGCTG
CTGGCGATGG CCATCGAGGC GCGCGGCTAC AAGGCCATGA GCTTCACGGG CCGTCAGGCC
GGCATCGAGA CGGACGGCAT GCACGCCAAG GCCAAAATCG TGAAGGTGCA CAACGAGCGC
ATCATGGAAG CCCTGAACAA GGGCGTGATC GCGGTGGTGG CCGGGTTCCA GGGCATCGAC
GCCAACGGCG ACATCACCAC GCTCGGCCGC GGCGGCTCCG ACACCACGGC GGTGGCGGTG
GCGCACGGTC TGGGCGCCGA CGTGTGCGAG ATCTACTCCG ATGTGGACGG CGTGTACACG
GCCGACCCGC GCGTGTGCCC GCGCGCCAAG AAGCTCGACG TCATCTCGTA CGACGACATG
CTGGAGCTGT CCAGCTCGGG CGCCGGCGTG CTGCAGATGC GCGCCGTGGA GTTCGCGCGC
AAGTACCAGG TCGTCATTCA TTCCCGCTCG GCGTTCTCCG ACGCCGAAGG CACCTATATC
AAGGAGGAGA CCGACATGAT GGAGGAAGCC GTCATCACCG GCATCGCCCA CGACACGTCC
GAGGTGAAGT TCACCATCCG CGGCGTGCCC GACATGACCG GCGTGGCCGC GAAGGTGTTC
TCGGCGCTGG CGGGCAACAC GGTGAGCGTG GACATGATCA TCCAGAACAT CTCGGAGGAC
GGCATCACCG ACATCAGCTT CACGTGCCCC GGTGCCGACC TGCCCCGCGC GAAGGAGACG
GTGGAGCGCA TCTTACCCGA CATCAACGCC CGCGACTACG ACGTGGACGA GGACATCGCG
AAGGTGAGCC TCGTGGGCAC GGGCATGAAG TCGTCGCCCG GCGTGGCGGC GCGCGCGTTC
TCGACGCTCG GCGAGAACCA GATCAACATC CTGGCCATCT CGACGTCGCC CATCCGGCTG
TCCGTCGTGG TGGACGGCGC GCAGGCCGCG GCGGCCGTGC GCTGCCTGCA CAAGGCGTTC
GACCTCGATT CCGACAGCGT GTTCGAGGAG ACGCAGCTGA GCGCCGAGGA AATCGCCGCG
AAGATGAACA AGGGTAGATA G
 
Protein sequence
MSLIVAKFGG TSVASPERIQ MVAKKLIAKK QAGHQVVAVV SAMGKTTDEL VGLAASLNDN 
PPAREMDRLL STGEQVSMTL LAMAIEARGY KAMSFTGRQA GIETDGMHAK AKIVKVHNER
IMEALNKGVI AVVAGFQGID ANGDITTLGR GGSDTTAVAV AHGLGADVCE IYSDVDGVYT
ADPRVCPRAK KLDVISYDDM LELSSSGAGV LQMRAVEFAR KYQVVIHSRS AFSDAEGTYI
KEETDMMEEA VITGIAHDTS EVKFTIRGVP DMTGVAAKVF SALAGNTVSV DMIIQNISED
GITDISFTCP GADLPRAKET VERILPDINA RDYDVDEDIA KVSLVGTGMK SSPGVAARAF
STLGENQINI LAISTSPIRL SVVVDGAQAA AAVRCLHKAF DLDSDSVFEE TQLSAEEIAA
KMNKGR