Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0519 |
Symbol | |
ID | 8414803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 670332 |
End bp | 672155 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645023490 |
Product | fumarate reductase/succinate dehydrogenase flavoprotein domain protein |
Protein accession | YP_003180893 |
Protein GI | 257790287 |
COG category | [C] Energy production and conversion |
COG ID | [COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.220754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGAAG AGAAGAAGCA GGGCCTTTCG CGTCGAGCGT TCTTGGGTCT GGGCGGCACC GCGCTCGCGG GTGCGGCCGT CGCCGGGCTG GCGGGCTGCG CGCCGCAGGG TTCCGCCGAC GCGGGCAAGG CCTCGACGGC GGGCGCGGCG GATGGCGCGA CGGGCGGCAT GCCCGTGGCC GACAGCGGCG CTTCGGCCGG TCCCGACGTG GCCGGCATGC ACAGCTGGGA GATCGCGCCC GAGGACATCC CCGCGGACAA GATCACGAAC ACCGAGGACT GCGACGTGCT CGTCGTGGGC GCGGGCCTCG GCGGCTGCTG CGCCACCATC GCCGCGCTCG AGGAGGGCGC GAAGAAGGTC ATCACCATCG ACAAGAACCC CGAGACGGTG GTCGCGCGCG GCGTGCACAT CGCCGGCTTC CACACGAAGG TGCAGCAGGG TCTCGTGGAC CAGGGCCTCG TCGAGGAGCC CGACTACAAC AACGTCGTGC GCCGCTGGAT CAACTGGGCG CAGGGCCGCG TGAAGGAGCC GCTGCTGTGG GAGTTCGCGC ACAAGAGCGG CGCATGCTTC GACTGGCTGT ACGACCTGGC CACCAAGAAG GGACTCGAGG CGCTGCTGTG GGACGGCTAC TACAAGGGTC CCGACTACAC CGAGTACCCG GTCACGCACA TCTTCTACCA GGCCGACAAG TACGAGGAAA CCATCAACTT CACCTTTTAC CAGGGCTCGG GCGTGGGCGA CGTGTACGGC AACGCGGTGC TCGTGCCGGC GCTGTACGAC ACCATCGAGG AGCTGGGCGG CGAGATCCGC TGGGAGACGA AGAGCGAGCG CCTCGTCCGC GACGGCGACG GCCCGGTGAC CGGCGCCATC GTGGCCACCG GCAAGAACGA GTACACGCAG ATCAACGCGA AGTCGGTCAT CATCGCCTCG GGCGACTACG CCGCCGACGA CGAGATGTTC CAGTACTACT CGCCCATGAC CGCCTACGCT ATGGACGGCC GCTTCTACAA TCCGCCCGAC GTCGACACCG GCGACATGCA CAAGCAGGCC CTGTGGGCCG GCGCGGCCAT GCAGAAGTCC GAGCCGCATT CGGCCGTCAT GCACCTCGAC TTCGGCGCGG CAAGCTACGG CTTCCTGCAC GTGAACTGGG ACGGCAAGCG CTTCAAGAAC GAGGACGTGA ACACCCAGTC GAAGAGCGTG ACGAAGGCGC TGCAGACGCA GAAGGACGCC TGGACCATCT ACGACTCCCA TGGCCTCGAG CAGGTGAAGG CCCAGATGGA CGCCGGCCTC GGCGGCGGTT TGCAGTGGGG TCAGCTCACG CAGCCCGTGG GCGGCGAGTA CAACCTGGAG GCGCAGCAGA TCGTCCTCGA GGGCGAGGTC GAGAGCGGCC AGACGCTGAA GGCCGATTCG CTGAAGCAGC TCGCCGAGAA GATGGGCGTG CCGCCCGAGA ACCTCGAGGC CACGGTCGCC CGCTACAACG AGCTGTGCGA TCTGGGCAAG GACCTCGACT ACGGCAAGCG CCCCGAGGTC ATGGGCAAGG TGCAGGATCC GCCGTTCTAC GCCGGCAAGC TGGTGGCCAG CCTGCTCACC ATGTGCGGCG GCCTGCGCAC GAACACCGAC TGCCAGGTGC TCGACGCCGA GGATCAGCCG ATCGAGGGCC TGTACGTGTG CGGCTCGGCC GCCGGCGAGT TCTTCGGCGC GGGCGACTAT CCCACCTACG TGCCGGGTAT CGGCCACGGC CGCTGCGTGA CCTTCGGCCG CATCGCCGGC ATCAACGCCG CCGGCGGCGA CGCCGAGTCG AAGATCCCCA GCCTCGACAT CTAA
|
Protein sequence | MEEEKKQGLS RRAFLGLGGT ALAGAAVAGL AGCAPQGSAD AGKASTAGAA DGATGGMPVA DSGASAGPDV AGMHSWEIAP EDIPADKITN TEDCDVLVVG AGLGGCCATI AALEEGAKKV ITIDKNPETV VARGVHIAGF HTKVQQGLVD QGLVEEPDYN NVVRRWINWA QGRVKEPLLW EFAHKSGACF DWLYDLATKK GLEALLWDGY YKGPDYTEYP VTHIFYQADK YEETINFTFY QGSGVGDVYG NAVLVPALYD TIEELGGEIR WETKSERLVR DGDGPVTGAI VATGKNEYTQ INAKSVIIAS GDYAADDEMF QYYSPMTAYA MDGRFYNPPD VDTGDMHKQA LWAGAAMQKS EPHSAVMHLD FGAASYGFLH VNWDGKRFKN EDVNTQSKSV TKALQTQKDA WTIYDSHGLE QVKAQMDAGL GGGLQWGQLT QPVGGEYNLE AQQIVLEGEV ESGQTLKADS LKQLAEKMGV PPENLEATVA RYNELCDLGK DLDYGKRPEV MGKVQDPPFY AGKLVASLLT MCGGLRTNTD CQVLDAEDQP IEGLYVCGSA AGEFFGAGDY PTYVPGIGHG RCVTFGRIAG INAAGGDAES KIPSLDI
|
| |