Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0363 |
Symbol | |
ID | 8414647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 466503 |
End bp | 468206 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645023340 |
Product | fumarate reductase/succinate dehydrogenase flavoprotein domain protein |
Protein accession | YP_003180743 |
Protein GI | 257790137 |
COG category | [C] Energy production and conversion |
COG ID | [COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.629419 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGC CTGACAAGAG CACGCTCAGC CGCCGTTCGT TTTTGACGGG AGCGGCCGCC ACGGCCGGCC TCGCCGCCTT CGCGGGACTC TCCGGCTGCG CGCCGCAAAG CACGACGGGC GATGCCGCGA ACGCGGCAAC CGCATCCGGC GCCTCGGCGT CCGGCAGCGC CTCGGCAGGC GCCATGCAGA CCACCGACCC CGCCCAGGCC GTGTGGCCCG TGGTGGAAGA GGTGGAGGTG GGCGCCGCCG GCGAGGGCAA GATCGCCTTC GTGGCCGAGC CCATCGCAGC AAGCGATATC GTGGCCACGC ACGACGTGGA CGTGGTGGTG TGCGGCCTCG GCCCCGCCGG CGACGCGGCG GCGCTGGCCT GCGCCGAACA GGGCCTCAAG ACGGTGGCCG TGGAGAAGCA GACCACCGGC AACTACAATT CCGCCACCAT CGGCGGCACG AACTCCAAAC TGCACGAGCA TTGGGGCATG ACCTACGACA CCGATGCCTG GATCAGCGAC GCCATGATCG ACTGCGCTTT CCAGGGCGAC ATGGAGCTGT ACCGCCACTG GCTGGAGAAG AACGGCGAAG CCGTGGACTG GTACATCAGC CACTTCGACA ACCAGAACCT CGACGACTAC CCGCTGACGT TCGCCGCGGG CGACTTTCCC GACTTCCGCG ACGAATGGGA CAAGACCTCT CTGTCGCGCT CGTGGAACAC GTCGCTCAAC CTGCCCTACC CTCCGGGCGA GCTGGCGGGC ATCCTCGCCG GCATCCTCGA GCAGGCCGGC GTGGAAATCC GCTACAGCTG CCCCGCCTGC CAGCTGGTCA CCGACGACTC GGGCAAGGTG ACGGGCGTCA TCGTGCAGAG CGACCAGGGA TTTGAGCAGT ACAACTGCGC GAAGGGCGTG GTGCTGGCAA CCGGCGGCTA CGAGTTCAAC CAGCAGATGC TCAAGGAGCG CTGCCGTCCG CGCGGCGTGC CGGGCAGCTG GCTGACGGGC GCGTTCGGCA ACACCGGCGA CGGCCATCAG ATGGGCCTGG CGGTGGGCGC CGCCGAAGAC GAGTTCCCGC ACGCCATCAT GCTGGACCCC GAGCAGCTCA TGCCGTATCT GCGCGTGAAC AAGCTGGGCG AGCGCTTCAC GCCCGAGTAC GAGCCGTACG GGCATTTGGC GCTGGCCATC CAGGCGCAAC CGGGCACGTA CGACTTCTAC GTGGTGGACA GCGCCATCGG CGAGAAGATC GACAAGATCT GGACGCCCAG CTCCTCGTGC TACGGCCCGA AGGAAGTGTG GACCGCGGCC GCCATGAGCG AGAAGGCTCT CAAGGCCGAC ACGCTGGAGG AACTGGCGAA GCTCATGGAG GTGCCCGAGG CCCAGTTCGT CGCCACCATC GAGCGCTGGA ACGAGATGGC CGCCGCCGGC AAGGACGAGG ACTTCAACTT CCCTGGCGAG ATGATGATGA CCATCGACAC GCCGCCCTAC TACGCTACGA AGGAGTTCGC CGACGGTCTG TGCACGGCAG GCGGCTTGCT GGTGGACACC GAGTGCCGCG TGCTCGACAA GGATCGCCAG CCCATCGACG GCCTGTTCGC CATCGGCCTC ACGTCGGGCG GCATGTTCTT CAACACGTAT CCGCACAACC TGAACTGCCT GAGCCACACC CGAAACTGTC TTATGGGCTA CACCGTCGGC CAGGTGCTGG GCGACAAAGC GTAA
|
Protein sequence | MNQPDKSTLS RRSFLTGAAA TAGLAAFAGL SGCAPQSTTG DAANAATASG ASASGSASAG AMQTTDPAQA VWPVVEEVEV GAAGEGKIAF VAEPIAASDI VATHDVDVVV CGLGPAGDAA ALACAEQGLK TVAVEKQTTG NYNSATIGGT NSKLHEHWGM TYDTDAWISD AMIDCAFQGD MELYRHWLEK NGEAVDWYIS HFDNQNLDDY PLTFAAGDFP DFRDEWDKTS LSRSWNTSLN LPYPPGELAG ILAGILEQAG VEIRYSCPAC QLVTDDSGKV TGVIVQSDQG FEQYNCAKGV VLATGGYEFN QQMLKERCRP RGVPGSWLTG AFGNTGDGHQ MGLAVGAAED EFPHAIMLDP EQLMPYLRVN KLGERFTPEY EPYGHLALAI QAQPGTYDFY VVDSAIGEKI DKIWTPSSSC YGPKEVWTAA AMSEKALKAD TLEELAKLME VPEAQFVATI ERWNEMAAAG KDEDFNFPGE MMMTIDTPPY YATKEFADGL CTAGGLLVDT ECRVLDKDRQ PIDGLFAIGL TSGGMFFNTY PHNLNCLSHT RNCLMGYTVG QVLGDKA
|
| |