Gene Elen_0085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0085 
Symbol 
ID8414366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp112482 
End bp113828 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content62% 
IMG OID645023062 
Productcitrate synthase 
Protein accessionYP_003180468 
Protein GI257789862 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.691353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACTG AAAGCCAGAT CGCTCTGTAC GAGAACTTCA AGGCGATCAA CACCATCGAA 
ACCGCGAGCT ACGACCAGTT CGACGTGAAG CGCGGCCTGC GCAACGCCGA CGGCACGGGC
GTCATCGCAG GTCTTACGAA CATCGCCAAC GTTCACGGCT ACGTGGTCTC AGACGGCGAG
AAGGTAGCCG ACGAGGGTAT GCTGCGCTAT CGCGGCTACG ACGTGTACGA CCTGCTGGAC
ACCAGCGTGG CCGATCGCCG CTTCAACTTC GAGGAGGTGG CGTACCTGCT GCTCATGGGC
GAGCTGCCCA CACAGGAGCA GCTCGACCGC TTCATCGCCG CGCTCGATGC CGAACGCGAA
CTGCCCGACG GCTTCACTGC CTCCATGATC ATGCGCGACA CCCCGCCCGA CATCATGAAC
ATGCTGCAGC GCACCATCCT GCTGCTGTAC GCCTACGACG CGGACGCCGA GGATCGTTCG
GCTCATCACG AGATCCACAC CGCCATCTCG CTGATCTCGC GTCTGCCGCG CATCATGGTG
CTGACCTACT ACGCGAAGCA GGCTCGCTAC AACAACGGCT CCATGATCAT GCATCGCTTC
ATTCCCGGTC AGTCCACGGC CGAGACCATC CTGTCCATGC TGCGTCCCGA TCGCCAGTTC
ACGGCTGAGG AAGCGCGCAT GCTGGACATC ATGCTGTGCC TGCATGCCGA GCATGGCGGC
GGCAACAACT CCACGTTTGC CACGCGCGTG CTGACCTCGT CCGACACCGA TCCGTACTCC
ACGTACGCCG GCGCTATCGG TTCGCTCAAG GGGTCGAAGC ATGGTGGCGC GAACCATCAG
GTGCTAGCTA TGCAGCAGGA GATCAAGCAG AACGTAGCCG ACTGGTCCGA CGAGGGCCAG
GTGGCCGATT ACCTGGCGAA GATTGTCAAC AAGGAGGCTT TCGACAAGAC GGGTCTCGTG
TACGGCATGG GGCATGCGGT GTACACGAAG TCCGACCCGC GCGCCATCAT CTGCAAGCAG
TTCGCCGAGA AGCTGGCCGT GGGCACGGAG TTCGAGGCCG AGTATCGTCT GCTGGAAAGC
ATCGAGCGCC TGGCGCCCGA GGTGATTCTG CGTGAAAAGG GCACCAGCAA GGACATGTGT
GCGAACATCG ACATGTATTC GGGCTTCGTG TACTCGATGA TGGGCATTCC CGAGGATCTG
TTCACGCCGC TGTTCGCGTG CGCGCGCATG TCCGGCTGGG CTGCGCACCG CTTCGAGGAG
ATCGTCTCCG GCAAGCGCAT CATCCGTCCT GCGTACAAGT CCATTCGCAG CGGCAAGCGC
GATTACGTTC CCATGAGCGA ACGCTAG
 
Protein sequence
MATESQIALY ENFKAINTIE TASYDQFDVK RGLRNADGTG VIAGLTNIAN VHGYVVSDGE 
KVADEGMLRY RGYDVYDLLD TSVADRRFNF EEVAYLLLMG ELPTQEQLDR FIAALDAERE
LPDGFTASMI MRDTPPDIMN MLQRTILLLY AYDADAEDRS AHHEIHTAIS LISRLPRIMV
LTYYAKQARY NNGSMIMHRF IPGQSTAETI LSMLRPDRQF TAEEARMLDI MLCLHAEHGG
GNNSTFATRV LTSSDTDPYS TYAGAIGSLK GSKHGGANHQ VLAMQQEIKQ NVADWSDEGQ
VADYLAKIVN KEAFDKTGLV YGMGHAVYTK SDPRAIICKQ FAEKLAVGTE FEAEYRLLES
IERLAPEVIL REKGTSKDMC ANIDMYSGFV YSMMGIPEDL FTPLFACARM SGWAAHRFEE
IVSGKRIIRP AYKSIRSGKR DYVPMSER