Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0626 |
Symbol | |
ID | 8414916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 799857 |
End bp | 801545 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645023603 |
Product | fumarate reductase/succinate dehydrogenase flavoprotein domain protein |
Protein accession | YP_003181000 |
Protein GI | 257790394 |
COG category | [C] Energy production and conversion |
COG ID | [COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.770057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGAGT ACGACGTATC GCGTCGCTCG TTCCTGAAGG GGATCGGCTT GGCGGGCCTT TCTGCTGGTA CGCTGGGGAC GCTGGGCGCG ACGGGCATCG CCTTCGCCGA CGATGCGCCC GTCGCCGAAT CGGTGGACGC GTATCGGCAG CTTGCGTGCG AGACGGCGCT TGTGCCGGTG AAGAAGGCGG TATGCCCGGG GCCGCGCGGG CCGGTCGGCT TCGAAGACCG CGACATCGCC GCGAGCGACA TCGCTTCGAC CGAGGATTGC GACATCGTGG TGGTGGGCGC GGGTATCGCC GGGCTCATGG CCAGCCTCAA GGCAGCCGAA GAGGGCGCGA AAGTCATCTG CATCGAGAAG ATGACGAAGG GCCGCGGCTG CTTCGAGTGC TTCGGCGCCG TGGGGGCGAA GTGCCAGGAG GGCACCGAGA TCGACAAGGT GGCGCTGTTG GACGAGATGT ACCGCAGCGC GTACTGGCGC GTGCGCCCCG AACCCATACG CACCTACGTC GACCGTTCGG GCGAGGCTAC CGATTTCTGG CAGGCCGAAC TCGACAAGGG ACAGAACGGC TTCGTCATCT CGAAGGTGGA GCAGGCTCCC TCCACCTGCG GCATGCCGGC GCTGACCCCG CTCATCGACA CGGAGCTGGG CTTCTACGAC AGCCCCTCGC TGCCGCCCGA CGCCGGAGTG CGCTCCGGCT ATTCGGGCAT CTACGTGTGC CTCGAGATGC AAGAGGTGGC GAAGGCCTAC GACAACCTCG ACCTGCGCTT CAGCACGCCG GGCGTGCAGC TGGTGCGCGA CGGTTCGGGA CGCGTGGGCG GCGTCATCGC CAAGAGCGGC GACGAGTACG TGCAGATCAA CGCCGGAAAA GGCGTGATCC TGGCCACGGG CGGCTACGAT GCGAACCCTG AGCTCATGGA AGCGTGGACG CGTCCGGAGG ATTATGCCTC GTCGAGTTGG TGGAACCCCG GTTGGGGCAC CACGGGCGAC GGGCATCTCA TGGGCATCAA AGCAGGCGCG CAGATGGATC CGTGCCCTCA GCCGGTGATG AACTTCCGCT GGGGCAATCC CGATTCGTTC TACGACGCGC GCACGTGGAA CGCCGTGTAC TTCGCCCTCA TGGTGAACGG CGACGGCAAG CGCTTCGTGC GCGAGGATCT GCCCTTCCAA AGCGTGTCGA ACGCGCAGAA CGCCCAGCCC GACTTCGGGA AGAGCTGCTG GCAGGTGTTC GACGATACCA TGTTCGAGGG CATGGAGGAC GCGGTTGAGG AGTTCAAGCA GAAGGGCTGG CTGTTCGAGG GATCAACGCC CGAAGAGCTG GCCGCCGCCT GCGGCGTCGA CCCGCAGGGC CTCGCCGACA CCATCGCGCG GTACAACGGG TTTTTCGAGG CCGGCGTGGA CGAGGACTTC GGACGCGACC TGGCGGGAAC CATCCCGTTC ACGGGCACGC GCTTCTTCGC GCTGACCACG AACAGCTGCG TGCTCGCCAC GGTGGGCGGG CTCACCATCG ACGGTTCGTG CCGCGTGCTC GATACCGACG ACCGGGTGAT CGAGGGGCTG TACGCCGTGG GCAACGCGTC GGGCAACTTC TTCGCCGGCA ACTACCCCCG TCATATCCCT GGAACGTCCA TCGGACGCGC CGTCACGTTC GGCTACGTTG CCGCCGAGCA TGCGGTGAAA GGAGCGTAA
|
Protein sequence | MGEYDVSRRS FLKGIGLAGL SAGTLGTLGA TGIAFADDAP VAESVDAYRQ LACETALVPV KKAVCPGPRG PVGFEDRDIA ASDIASTEDC DIVVVGAGIA GLMASLKAAE EGAKVICIEK MTKGRGCFEC FGAVGAKCQE GTEIDKVALL DEMYRSAYWR VRPEPIRTYV DRSGEATDFW QAELDKGQNG FVISKVEQAP STCGMPALTP LIDTELGFYD SPSLPPDAGV RSGYSGIYVC LEMQEVAKAY DNLDLRFSTP GVQLVRDGSG RVGGVIAKSG DEYVQINAGK GVILATGGYD ANPELMEAWT RPEDYASSSW WNPGWGTTGD GHLMGIKAGA QMDPCPQPVM NFRWGNPDSF YDARTWNAVY FALMVNGDGK RFVREDLPFQ SVSNAQNAQP DFGKSCWQVF DDTMFEGMED AVEEFKQKGW LFEGSTPEEL AAACGVDPQG LADTIARYNG FFEAGVDEDF GRDLAGTIPF TGTRFFALTT NSCVLATVGG LTIDGSCRVL DTDDRVIEGL YAVGNASGNF FAGNYPRHIP GTSIGRAVTF GYVAAEHAVK GA
|
| |