Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2087 |
Symbol | |
ID | 8416405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2454630 |
End bp | 2455487 |
Gene Length | 858 bp |
Protein Length | 285 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645025070 |
Product | hypothetical protein |
Protein accession | YP_003182439 |
Protein GI | 257791833 |
COG category | [C] Energy production and conversion |
COG ID | [COG2864] Cytochrome b subunit of formate dehydrogenase |
TIGRFAM ID | [TIGR01583] formate dehydrogenase, gamma subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000659531 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000000000148039 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCTGGT TCGATCAAGC GCCCTGGCTC GTCGCCCTGG CGCCGTTTTT CGGTCTGTTC CTTTCCGCAT TCGCGAAACG AACAGACCCC TTTATCGCAG GCGACCGGGT GTACCGCCAC GACGCCCCCG CGCGCCTGTC CCACTGGACC CATGGCATCG GCACCGCCGT GTGCCTCGCG TCGGGCATCG TGCTGGGACT GAGGTTCACC CCGGCGTTCG TGGAGGACGG CCCGGCCGCC ATCCTATGGC AGAACGTCCA TTTCGCGGCC GCGATCGTGT TCCTGTTCGG GACGTTCTAC TACCTGGGCA ACACGATCAT CTCGAAATGG CGCTTGCGCG AGCACCTTCC CACGAAGAAC GTGGTCGCCT ACACGGTGCG CCACTACGGC CTGCTCGTGG GCATCAAGAA GTTCACGATG CCGCCCGAGG ACAAGTACTT CGAAAGCGAG AAGGCCGCCT ACGTGATGGC CGTGGTCACG GCCGCGTTGC TGGTGGTGAC AGGCCTGGTC AAGGCATTGG CCCACGTGGT GCTCACGCTG CCCGACGGCC TCATGAACGT CATGTTCTGG GTGCACGACA TCGCGGCCGT GCTCATGCTG CTGTTCTTGG CGGCGCACGT GTTCTTCGCC GTGATCGCGC CGTTCTCGTG GAAGACCTTC CCGTCCATGC TGATCGGATG GATGCCGCGC GGCGAGGCCG AAAAGGAACA TGCAGGTTGG ATGGAGCGGC TGGAGCGCGA GCAGCCCGAG CGCGGCATGG ACGAGTCCGC ACTCGGCGCC GGAGAGCGCG CGGCCGCGCA AGAGACGACG GGAGCCGCCG CGCGCGCAGC AGCCGCCGAC GGCACGCAAG GGAGATGA
|
Protein sequence | MPWFDQAPWL VALAPFFGLF LSAFAKRTDP FIAGDRVYRH DAPARLSHWT HGIGTAVCLA SGIVLGLRFT PAFVEDGPAA ILWQNVHFAA AIVFLFGTFY YLGNTIISKW RLREHLPTKN VVAYTVRHYG LLVGIKKFTM PPEDKYFESE KAAYVMAVVT AALLVVTGLV KALAHVVLTL PDGLMNVMFW VHDIAAVLML LFLAAHVFFA VIAPFSWKTF PSMLIGWMPR GEAEKEHAGW MERLEREQPE RGMDESALGA GERAAAQETT GAAARAAAAD GTQGR
|
| |