Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0124 |
Symbol | |
ID | 8414407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 166763 |
End bp | 168256 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023103 |
Product | Succinate dehydrogenase |
Protein accession | YP_003180507 |
Protein GI | 257789901 |
COG category | [C] Energy production and conversion |
COG ID | [COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAG GAATTTCACG GCGCAACTTC ATCGGCGGTG CGGCGCTGGG CGGCGCAGCG CTCGCATTGG GCGCGCTTGC GGGCTGCGCT CCGCAAGGCG ACAAGCCCAC GGGTTCCGAG GGCACCTGGG ATGCCGAGGT CGACCTGCTG GTGTGCGGCT GCGGCGGCGC CGGCATGGCG TGCGCGGTCG AAGCCAAGGA CAATGGCGTG GAGAACGTGC TGATCATCGA AAAGGGTGAC CAGATCGGCG GCACTACGGC CATGTCCCAG GGCATGATCG CCGGTTGGGA TACCCAGCTG CAGAAGTCGC AGGGCGTCGA GCTCACCTAC GACGCGATGT ATGCGAACCT GATGAACAAC GCGTCGTATC ATCTTGACCC GGCGCTCACT AAGATCACGG TGGAGAACAG CGGCAAGACC ATCGACTGGC TGATCGACCG CGTGGGCGTC AAGTTCACGG ATCAGATCGA CATCTACTAC GGCCCCCTGC AAATGATGCA TAACGTCGAC GGCGCGGGCG GCGGTTTCGT CGCGGCTTTC ACCGCCTGCC TGGACCAGCT GGGCGTGGAG ATCCAGAAGG GCACGAAGCT GGTGGAGGTT CTGCTGGACG CCGAGGGCAA GACGGAAGGC GCGGTGGTCG AGGCGAAGGG AAAGACGCAG CGTATCAAGG CGCGCGCTAT CATGATCGCC ACCGGCGGCT ACGCCTACAA CGCCGAGCTG GCGGCGCGCT TCGATCCCGA GAAGGCGGGT ACGTTCGGCA TCGGGCACCC GAATTGCGAG GGCGAGGGCC TGGTGGCCGC TTCGAACGCA GGCGCGCTCC TGTCGCACAC CAACGACATG ATGTGCGTGG TGAAGGACTA CACCATCATG AGCGAGCACA ACGGCACCTC GGCCAGCGCT AACCTCAACG GTTTCACGAA CCTTCCCAAC ATGATCTTGG TGGGCGCCGA CGGCAAGCGC TTCGTCGACG AGGGCAAAAA AGGCTTCATG AGCCAGAACC TGAACGGCCC CATTTTCGAC AAGATCCATC GCGACGGCAT GGGCTACGTA TGGGAGATCT CCGACGAGGC CACGGTGGCA GCAGCCGGCG GCAAAGTGAA GCGCGGCGAA GGACTCGAAT ACATCAAGGG TGCCGATGCG ACTGAGCTCG CGGCCAACAT GGGCGTGGAT GCCGCAGCGC TTGCCGAGAC CATCGAGGCC TACAACGCTG CGGTGGACTC GGGCGTGGAC CGCGAGATCG GCGGCTTCCC CACGGCCAAG CTGGAGGCTC CTTTCCTGGC CGTGCCCGTC GTGCCGTGCG AGATCATCAC GTACGGCGGC GTGGCTCGCA CCGAGCAGGG CGAGGTCATC CGTGCTGATG GCGAAGTGAT GCCGGGCCTG TTCGTGGGCG GCGAGGCTAG CTGCAACTCC GCCTACATGG GATTCACGCT TTCCAACTGC TTCACGTGGG GTCGCATCGG CGCCCAGAGC GCGGCGGCGT ACCTGAAAGC CTAG
|
Protein sequence | MNEGISRRNF IGGAALGGAA LALGALAGCA PQGDKPTGSE GTWDAEVDLL VCGCGGAGMA CAVEAKDNGV ENVLIIEKGD QIGGTTAMSQ GMIAGWDTQL QKSQGVELTY DAMYANLMNN ASYHLDPALT KITVENSGKT IDWLIDRVGV KFTDQIDIYY GPLQMMHNVD GAGGGFVAAF TACLDQLGVE IQKGTKLVEV LLDAEGKTEG AVVEAKGKTQ RIKARAIMIA TGGYAYNAEL AARFDPEKAG TFGIGHPNCE GEGLVAASNA GALLSHTNDM MCVVKDYTIM SEHNGTSASA NLNGFTNLPN MILVGADGKR FVDEGKKGFM SQNLNGPIFD KIHRDGMGYV WEISDEATVA AAGGKVKRGE GLEYIKGADA TELAANMGVD AAALAETIEA YNAAVDSGVD REIGGFPTAK LEAPFLAVPV VPCEIITYGG VARTEQGEVI RADGEVMPGL FVGGEASCNS AYMGFTLSNC FTWGRIGAQS AAAYLKA
|
| |