Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0629 |
Symbol | |
ID | 8414919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 802813 |
End bp | 804612 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645023606 |
Product | fumarate reductase/succinate dehydrogenase flavoprotein domain protein |
Protein accession | YP_003181003 |
Protein GI | 257790397 |
COG category | [C] Energy production and conversion |
COG ID | [COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.56075 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACGA ACGTAACGAA CGAAGGCGGC ATCTCCCGCC GCAGCTTCCT GGGCGGCGTC GCAGGCGTCG GCGCGCTTGC CGCCATGGGC CTGGCCGGCT GCTCGCCCAA GGCCGCGGGT ACAACCGGAG CCGAGAGCGC TTCCAGCTCT GCCAGCGCAG CCACGGGCGT GGCCGACAAC AACGCGGTGG CCGTGGACGA CGGCAGCGCC GCCATCACGG TCGACTGGCT GGGCGCCCCG CCGGAGATCG GCGACATCAC CGAGACGAAG GACACCGACC TGCTCATCGT GGGCGCAGGC AACGGCGGCA TGATCGCCGG CGCGTACGCA TCCGACCAGA AGATGGACTT CATCCTATGC GAGAAGGGCA CCGAGGTGGG CGCCACGCGC CACTGGTTCA ACGCCGTCGA CACCAAGCCC TTCACCGACC AGGGCTACCA CACCGACCGC GCCCGCCTGC ACGGCGAGTG GGCTCGCTAT TCCAGCGGCA CGTGCGACCA CAACCTCATC AACATGTGGA TGAACGAGAG CAACGACATG TTCGAGTACG TCGACAAGTA CATGAGCGAA GCCGGCGCCG TGGTCATCGC CGACGAGTTC GAGATGCCGG GCGGCATGGG CGCCACCCCC TTCTACACCC CCTGCGGCGA GCACCACTAC GGCAACGCCG AGGGCGGCCG CGACGGCGTC CCCGTGCGCA ACGAGCTGTT CGAGAAGGTC ATGAACGACA ACGGCTACGA GATCTCCTAC AAGCACGAGC TGGTGAAGCT GGTCACCGAG GATTCCGGCA AGGTTACCGG CGCCATCTTC AAGACTGACA ACGGCTACAC GCAGATCAAC GCCAAGAAGG GCGTGCTGCT GACCACGGGC GGCTACTCCG CGAACCCCGC CATGCTCTCG TCGCTGTCCC CCATCACCAC CGCGTCGGTG ACCGCGCTGG GCTACAACCA GAACAACACG GGCGACGGCA TCAAGGCGGC GCTGTGGGCA GGCGCGGTGA AAGACATCAC CAGCGCCACG ATGATCTTCG ACCGCGGCCT CGTCGCGCCC GGCACCACCG CGGGCTACAC CGAGGAGTCC GTCAAGGCCG GCAATCCGCA GTGGCCGGGC AACGGCCAGT TCAACCCCGG CACGCAGCCC TTCCTCAAGG TGAACCTGCG CGGCGAGCGC TTCGCCCTCG AGTCCGCCGA CTACGACTAC TTGCCCCATG CGGCCGCGCA GCAGCCCGGC GGCGTGTACA TCAGCGTCTG GGACGGCAAC TTCGGCGACG ACGTGCAGCG CTTCCACACC CTCGGCTGCT CGGCCGGCAC CCGCACGGGC GTGCTGGGCG TGAAGAAGGA AGACGGCACG TACGACCTGG ACAAGTACTT CGAGAAGGAG CTGGCCGACG GTCGCCTGCA GAAGGCCGAC ACGCTCGACG AGCTTGCCGA CAAGCTTGGC TTCGACGCCG ACGCCAAGAA GACGTTCCTC GCCACCTGCG AGCGCTACAA CGAGCTGTAC GACAACCAGG AAGACGTCGA CTTCGGCAAG GAGCCCTACC GCCTCTCCGA GCTGCGCACC CCGCCGTTCT TCGGCGCCAC GCTGGGCGGC ACGCTGCTCA CCACCATCGA CGGCGTCCGC ATCAACGCCG ACTGCCAGGC GCTCAACACC GACTTCGAGC CCATCGAGGG CCTGTACTGC GCCGGCGACT GCTCGGGGTC GCTCTTCAGC GGCAACTACC CCGACCAGAT GCACGGCGTC GCCTGCGGCC GCACGATGAC CGAGGCCCTG CACGTGGTCA AGCTCGTGGC CGCCAAGTAA
|
Protein sequence | MGTNVTNEGG ISRRSFLGGV AGVGALAAMG LAGCSPKAAG TTGAESASSS ASAATGVADN NAVAVDDGSA AITVDWLGAP PEIGDITETK DTDLLIVGAG NGGMIAGAYA SDQKMDFILC EKGTEVGATR HWFNAVDTKP FTDQGYHTDR ARLHGEWARY SSGTCDHNLI NMWMNESNDM FEYVDKYMSE AGAVVIADEF EMPGGMGATP FYTPCGEHHY GNAEGGRDGV PVRNELFEKV MNDNGYEISY KHELVKLVTE DSGKVTGAIF KTDNGYTQIN AKKGVLLTTG GYSANPAMLS SLSPITTASV TALGYNQNNT GDGIKAALWA GAVKDITSAT MIFDRGLVAP GTTAGYTEES VKAGNPQWPG NGQFNPGTQP FLKVNLRGER FALESADYDY LPHAAAQQPG GVYISVWDGN FGDDVQRFHT LGCSAGTRTG VLGVKKEDGT YDLDKYFEKE LADGRLQKAD TLDELADKLG FDADAKKTFL ATCERYNELY DNQEDVDFGK EPYRLSELRT PPFFGATLGG TLLTTIDGVR INADCQALNT DFEPIEGLYC AGDCSGSLFS GNYPDQMHGV ACGRTMTEAL HVVKLVAAK
|
| |