Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2107 |
Symbol | |
ID | 8416425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2479069 |
End bp | 2480823 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645025090 |
Product | fumarate reductase/succinate dehydrogenase flavoprotein domain protein |
Protein accession | YP_003182459 |
Protein GI | 257791853 |
COG category | [C] Energy production and conversion |
COG ID | [COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.107675 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0249042 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAACG AAAAGCAAAA GAAAGAGCTG GCCGACGGCC TCACGCGTCG CGGGTTCCTC ACGCTAGGAG GCGTGGCGGC GCTGGGAGCG GGAGCAGCGT TGGCGGGATG TTCGCCGCAG GCGTCTTCGG CAAGCTCGGC GGGCGATGCT GCGACGGCTG CGAGCGCGGC CGACGCCGCC GAAGGACTTG CCCCGTCGCC GTGGGGAGAC TATTATCCTT GGCCCGCCAA TCCTCCTGAG ATCACCGACG ATATGGTGGA GGAAGAGCTT GACTGCGACG TGGCGGTCGT GGGTCTGGGC GTCTCGGGCG TCGCGGCGTT CCGTGCGGCG TCCGAGGGCG GCGCCAAGGT CGTCGGCATC GAGAAGGGTT CCATGCCGCA GCAGCGCTCC AGCCAGTACT GCTACTTGAA CGGCAAGCTG ACCGACACCT TCGGCCTGCC CCACCTCGAC CTTGAGGCCA TCATCAAGGA AGAGTTCCAG GAGTGCGCCA GCATCACGAA TTACAATATC GTCCGCAAGT TCGTCTTGGG CGACGCGGAG GCTATGGACT GGTGGATCGA GGGCGCGGAT TGCGAGATCC TCGAGAATTA CGAGATGTCG ATGGATCCCA GCGCGGACGT TCCCAACGGC GTCAGCTGCA TGAGCGATCC GACCATCGAC TGGGAGAGCG AGCCGCAGGC CGCTTTCCCC GAATGCCTGA ACTTCACCGA CCATCAGGCC GTGCTCGACA ACAATTTCCA GAAGGGCCTC GATGCCGGCA CGGGCTCCGA AGCGTATTTC GGCCATTTCG CCGAAGCGCT CATCCAAAAC GATGAAGGTC GCGTGGTGGG CGTGTACGCG CGCAACGCCG ATACCGGCAA GTACAAGAAG ATCAACGCTG AAAACGGCGT GTGCCTGGCA TCGGGCTGCT GTCGATCCAA CGAGGACATG GTGCGCTACT TCGCACCGAA CCTCATCTGG AACGGCAACG GCAACCCATG GCCGAACATG GACGTCGAGG GCAACCGCAC CAATACGGGC GACGGCTACA AGCTGGGGTA CTGGGCGGGT GCTTCCATCC AGCAGTACCA GTGCTCGCAG ATGCACGTGA TGGGGGGTCC GGGCGACACC GATTCGCAGA ACGATTCCAT GGGCTTCACC GGACCGCTGT TGCGCATCAA CTACAACGGC GAGCGGTTCA TGAACGAGGA CACCTGCGCC GCCGATGCCG AGTATCCCAT CGAGCTGCAG CCCAAGCACA AGTGCTTCAT GATCACCGAC AGCCACTTCG AAGAGAATGC TGCGAAGTGC GTGAACACGT TCGCGGCCAC GCTCGCCGAG TGGGACGAGC GGGTGGGCGA CGGCACCATC TTCAAGGGCC AGACGCTGCG CGAGTTGTTC GAGAGCATCG ACGGCATGGA CGTCGATGCA GCGCTGGCCA CGGTCGAGCG CTACAACGAG CTGTGCGAGA AGGGCGTCGA CGAGGACTAC GCCAAGCAGG CGAAGTACCT GATGCCCGTG ACCGACGACG GCCCGTACTA CGCGCAGCGC ATGGGCGTGG GCCTGTGCCT GGTCATGATG GGCGGCTTGG AATCCAACCA GAACGCCCAG GTGCTCGACC TCGAGCGCCA GGTGATCCCC GGCTTGTATG CGTCCGGCAA CATCCAGGGC AGCCGCTTCG CTATCAAGTA CCCCTTCCGT CTGAGCGGCC ACAGCCACGC GATGGCCATG TTCTACGGCA AGGTGGCCGG CGAGAACGCC GCAGCCGGTT TGTAG
|
Protein sequence | MNNEKQKKEL ADGLTRRGFL TLGGVAALGA GAALAGCSPQ ASSASSAGDA ATAASAADAA EGLAPSPWGD YYPWPANPPE ITDDMVEEEL DCDVAVVGLG VSGVAAFRAA SEGGAKVVGI EKGSMPQQRS SQYCYLNGKL TDTFGLPHLD LEAIIKEEFQ ECASITNYNI VRKFVLGDAE AMDWWIEGAD CEILENYEMS MDPSADVPNG VSCMSDPTID WESEPQAAFP ECLNFTDHQA VLDNNFQKGL DAGTGSEAYF GHFAEALIQN DEGRVVGVYA RNADTGKYKK INAENGVCLA SGCCRSNEDM VRYFAPNLIW NGNGNPWPNM DVEGNRTNTG DGYKLGYWAG ASIQQYQCSQ MHVMGGPGDT DSQNDSMGFT GPLLRINYNG ERFMNEDTCA ADAEYPIELQ PKHKCFMITD SHFEENAAKC VNTFAATLAE WDERVGDGTI FKGQTLRELF ESIDGMDVDA ALATVERYNE LCEKGVDEDY AKQAKYLMPV TDDGPYYAQR MGVGLCLVMM GGLESNQNAQ VLDLERQVIP GLYASGNIQG SRFAIKYPFR LSGHSHAMAM FYGKVAGENA AAGL
|
| |