Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0532 |
Symbol | |
ID | 8414816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 689909 |
End bp | 691570 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023503 |
Product | fumarate reductase/succinate dehydrogenase flavoprotein domain protein |
Protein accession | YP_003180906 |
Protein GI | 257790300 |
COG category | [C] Energy production and conversion |
COG ID | [COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.642079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAG GCATGGAAGG CGGAGTTTCC CGTCGTTCGT TCCTGAAGGG CGCGGGCGTC GCTGCCGCCG CCGTGGCGGG TTCCGCCGCG CTGGCCGGCT GCGCGTCGGG TTCGACGGGG GCGGCTTCGG GCGATTGGAT GCCGAAGAGC TGGGACTACG AGTGCGACCT GCTGGTCATC GGCTACGGCG GCGCGGGCAT GTGGGCGTCC CTCATCGGCG CCGACGAGTG CGGCCAGGAA GTCATCGTGC TGGAGAAGGC TCCCGAGCGC GGCGGCGGCA ACAGCTCCAT CAACAACGGC GAGTGGACCA TCGTCGAAGA GAACGAGAAG GACCGCTTCG CGAAGTACAT CAAGGCCTTC ACGCACGGCA AGACGCCCGA GCCGATGATC GACGCCTGGG TGAACGAGTG CGCCCGCAAC ACCGAGTACG CCGACAAGTA CGGCATGACC TACGAGGTGT CCGAGGTCGC GCTGGCCGGC GCCATCCCCG AGTACTACTT CTTGGACGAC AACGCCTACG AGGGCTCCTG CAAGCTGTCC TCCGTCGACG GCTTCGGCAT GCTGTCGTTC CACGAGCTGG ACGCGCAGCG CGAGAAGCTG GGCGTGCAGG TCATGTTCGA CTGCCATGAC GAGCGCCTCA TCCAGAACCC GGACACCAAG GAGATCGTCG GCGCGTACAC GATGATCGGC TCCGAGGAGA AGGCGGTGAA GGCTCGCAAG GGCGTCATCC TGACGCTGGG CGGCTTCGAG TTCAACGAGG ACCTCAAGAA CGAGTACCTC AAGTGCTACC CCTTCAAGTT CGAGGGCTGG CAGTACAACA CGGGCGACGG CATCAAGATG GTCGAGGACG TGGGAGCGAA GCTGTGGCAC ATGGACATGG CGATCTCCAT GTACTCCATG TGGACGCGCG ACCCCGAGAA CGACTTCTCC ATCCTGTACT TCATGCCCGG TTTCAGCTAC TTCAACGTGA ACCGTCTGGG CAAGCGCTAC GTCAACGAGA ACAGCATGGG CTCGCCGCAC AACGGCTGGC ACACGCTGCT GTCGTTCGAC GACTCCATCG ACGACTTCGA CCGCATCCCG TCGTGGGGCA TCTTCGACCA GACCTGCTTC GACGCCGGCA AGCTGTCCAC CAGCCAGGGC GATTTCTTCG AGTGCGGCAA CTTCGCCAGC GACCTGCCCG ACGAGATCCG CGCATGGAAC GGCTGGAGCC AGGACAACAA GGCTGAGCTG GATAAGGGCT GGATCCTCAA GGGCGACACC ATCGAGGAGC TTGGCAAGAA GATCAAGGAG TTCGACCACT GGATGGACGT CGACGCCTTG AAAGCCACGT TCGACGAGTA CCAGGCATTC TGCGAGGCAA AGAAGGACGC GCGTTTCGAC CGCTCCGAGA AGACGCTCGA GAAGCTGGAC GACGGCCCGT ACTACGCCAT CTCCATCTAC CCCGGCTCCT GCTCCACCTT GGGCGGCCCG ATGAAGAACG AGCACGGCCA GGTGCTGGAT CCCGCCGAGA ACCCCATCCC GCGTCTGTAC GCGGCGGGCT GCTTCGGCAA CTTCCAGAGC CACAGCTACG GCATCACCGG CGGCAACAAC GCCGAGAACC AGGTGTGGGG CCGCATTTCC GCCCGTCACG CCGCGGGTCT CGAGCCTTGG GACGCCAAGT AG
|
Protein sequence | MSKGMEGGVS RRSFLKGAGV AAAAVAGSAA LAGCASGSTG AASGDWMPKS WDYECDLLVI GYGGAGMWAS LIGADECGQE VIVLEKAPER GGGNSSINNG EWTIVEENEK DRFAKYIKAF THGKTPEPMI DAWVNECARN TEYADKYGMT YEVSEVALAG AIPEYYFLDD NAYEGSCKLS SVDGFGMLSF HELDAQREKL GVQVMFDCHD ERLIQNPDTK EIVGAYTMIG SEEKAVKARK GVILTLGGFE FNEDLKNEYL KCYPFKFEGW QYNTGDGIKM VEDVGAKLWH MDMAISMYSM WTRDPENDFS ILYFMPGFSY FNVNRLGKRY VNENSMGSPH NGWHTLLSFD DSIDDFDRIP SWGIFDQTCF DAGKLSTSQG DFFECGNFAS DLPDEIRAWN GWSQDNKAEL DKGWILKGDT IEELGKKIKE FDHWMDVDAL KATFDEYQAF CEAKKDARFD RSEKTLEKLD DGPYYAISIY PGSCSTLGGP MKNEHGQVLD PAENPIPRLY AAGCFGNFQS HSYGITGGNN AENQVWGRIS ARHAAGLEPW DAK
|
| |