Gene Elen_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2119 
Symbol 
ID8416437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2493696 
End bp2495546 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content67% 
IMG OID645025102 
Productfumarate reductase/succinate dehydrogenase flavoprotein domain protein 
Protein accessionYP_003182471 
Protein GI257791865 
COG category[C] Energy production and conversion 
COG ID[COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0348358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.647949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATACGT CTGCAACAAC CAATCATGGG GAATCGAACA ACGCGAACGG CGGCTTCTCG 
CGCAGGACGT TCGTCAAGGG CGGCTTGGCC GCCGGCGTCG CTGCGCTGGG CGGAATGGCG
CTGACGGGAT GCGCCGCAGC ACCGGCTTCA TCGAAGCAGG CGGCTGCCGC CGACGCCGCC
GCTCCGACCG ACGAGATCAC CGCGCGCCTG GTGGAGCGCG TGCACGACGC CAACCTGCCC
GACGCGGCGC CCATCCTGCC GGTGGAGCCG CCGGCGTCGT GGGACGACGA GGCCGACGTG
GTCATCGTCG GTGTGGGCGG CGGCGGTATC GTGGCCACGG CGTTTCTCGC CCAGCAGGGG
CTCAAGGTCA TCGGCATCGA GAAAGAGGGC CAGGTGGGCG GTGCGAGCCG CCATGCCTGC
ACGTTCGCCA ACGTGTTCGG CGGCTCGAAG GACCAGAACG CGCTGGAGTT CAGCGTGCCC
ACGTTCCCGC CCGACGTGAA GGCGTTCACC CGCATGTACG AGGAGCAGAA CGCCTACTCC
ATCGACGAGA AGTTCCTCAT GAACCAGCTG CTCATGTCCG GTCCCGCGTG CGACTGGATC
ATGGAGCAAG ACGGCATGAA CATGGAATGC TTCGGGCCCA TCTGGCACGA CGCCGACGTC
CACGCCGGCA AGCAGAGCGT GGTGCTGGGC ATGAACAACC CCACGAACGC CATGGAAGCC
GTTGCGCTGG CAGCGGGCGC CGACATCCGC CTGTCCACGA AGTGCGAGAA GCTCGTGGCT
GACGGCGGTC GCGTGGTGGG CGTCGTGGCC AAGGGGCCGG ATGGCAAGGA GCGCTACGTC
AAGGCCGAGA AGGGCGTCAT CCTGTGCGCG GGCGGCTTCG GCATGAACCG CGACCTGATC
CGCGCCTACC TGCCGAGCGC CTACGAGGGC ACCGTGCAAG GCGGTCCCAT GCCGTCGCAT
ACGGGCGAAG CCTTCCGCAT GGGTCTGGGC ATGGGCGCCG ACTTCTCCGG CTTCGATTCG
TGGAGCTGTT GGGAAGGCGC CATCGACGAG GAGACGGCCG GCGGCGACGG CCAGTTCTGG
CACTATTTCT GGCACGGCGA GCGCCAGCTG TTCCACAACC CGTGGCTCAT CATCGACAAG
CGGGGCAACC GCCAGCCGTA CTTCGCAGCC ACGCAGGAGC TGTTCGCGAA CCCGGGCGGG
CAGATGGGCG ACCTGAGCAA CTGCGCGGCC TGGATGTCGG CGGTGGGACA TCATGTGTAC
TCCATCTGCG ACTCCGACTT CCCGACCACC GTGTTCGAGA AGAACGTGCT CACCGACGAG
GGCACCGACC GCAACCGCAT TCCCATCACC GACCCGAGCA CGCTGATCGA CACGAAGGGC
CTCGTGTCGG CAGACTGGCT GGCCGAGGTC GACGAGGCGG TGGAGCGCGG CGCCGTGAAG
AAGGCCGACA CCATCGAGGA GCTGGCCGAT ATGCTGCTGC TCGACCGCGA CGTGCTGGTG
CGCGCGGTGA AAGAGTACAA CGAGCTGTGC GAGAAGGGCG TGGATGACGA GATGTCCACG
CCGTACGACC CCTCGTGGCT GCATCCCGTG GTGAAGCCGC CGTTCTACGG GGCCATCATC
GGCAGCCAGA TGGCGAAGAC GATGTGCGGC CTGCGCACCG ACGAGCATCT GCAGGTCATG
CGCGAGGACG GCTCGCTCAT CGAGGGTTTG TACGCCAACG CCACCACGGC GGGCGGCCTG
TCGGGCGAGG CGAACTACGG CTGCTTCTGG AACTCGACGG TGTTCGGCGG GGTGGGCACC
AGTTGGATCA CCGGGTGGAT CGCGGCGAAG TCGCTGTTGG ACGCCCAGTA G
 
Protein sequence
MDTSATTNHG ESNNANGGFS RRTFVKGGLA AGVAALGGMA LTGCAAAPAS SKQAAAADAA 
APTDEITARL VERVHDANLP DAAPILPVEP PASWDDEADV VIVGVGGGGI VATAFLAQQG
LKVIGIEKEG QVGGASRHAC TFANVFGGSK DQNALEFSVP TFPPDVKAFT RMYEEQNAYS
IDEKFLMNQL LMSGPACDWI MEQDGMNMEC FGPIWHDADV HAGKQSVVLG MNNPTNAMEA
VALAAGADIR LSTKCEKLVA DGGRVVGVVA KGPDGKERYV KAEKGVILCA GGFGMNRDLI
RAYLPSAYEG TVQGGPMPSH TGEAFRMGLG MGADFSGFDS WSCWEGAIDE ETAGGDGQFW
HYFWHGERQL FHNPWLIIDK RGNRQPYFAA TQELFANPGG QMGDLSNCAA WMSAVGHHVY
SICDSDFPTT VFEKNVLTDE GTDRNRIPIT DPSTLIDTKG LVSADWLAEV DEAVERGAVK
KADTIEELAD MLLLDRDVLV RAVKEYNELC EKGVDDEMST PYDPSWLHPV VKPPFYGAII
GSQMAKTMCG LRTDEHLQVM REDGSLIEGL YANATTAGGL SGEANYGCFW NSTVFGGVGT
SWITGWIAAK SLLDAQ