Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2088 |
Symbol | |
ID | 8416406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2455493 |
End bp | 2456278 |
Gene Length | 786 bp |
Protein Length | 261 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645025071 |
Product | 4Fe-4S ferredoxin iron-sulfur binding domain protein |
Protein accession | YP_003182440 |
Protein GI | 257791834 |
COG category | [C] Energy production and conversion |
COG ID | [COG0437] Fe-S-cluster-containing hydrogenase components 1 |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000857475 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000000000100219 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGAGA CGAATCAGCG CGCGACCGAC GCGCAGCATA AAGGCGGCCT CTCCAGGCGC CAGTTCATCG CCGGGATCGG CGGATTGGGA ATCGGCGCCG TGCTGGGAAG CGGCATCACG GCGCTGCTGT TGCCCGACGA CGTGTACGCC ATCGAGGCGA GCCAAGGCTA CCTGCTGGTG GATGCGAAGA AGTGCGCCGG CTGCGAGACG TGCGTCATCT CGTGCTCGCT CGCGCATCTG GGCCGTATCA ACACCTCGCT TTCGCGCATC CAGGTGATGA AGAACGCGCT GGGAAGCTTC CCCTCCGACG ACGTCATGCA GAACCAGTGC CGCCAATGCC CCTACCCTTC CTGCGTGGAA GCCTGCCCCG TGGGCGCCAT GCACGCCGAC CCCGAGACGG GCGTGCGCCT CGTGGACGAG GGCAAGTGCA TCGGCTGCGA GCGCTGCGTG GAGGCGTGCC CCTTCACGCC GTCGCGCGTG CAGTGGAACT TCGAGGACAA GCACGCCCAG AAGTGCGACC TGTGCAAGAA CACGCCCTTC TGGGATGAAG AGGGCGGCCC GTCCGGCAAG CAGCTGTGCG TCGAGATCTG CCCCATGAAG GCCATCGCGT TCACCAACGT GCTGCCCGTC CAAACCGACG AAGGCTACAC GGCGAACCTG CGCAACGACC ACTATCTGGA AATCGGCCTG CCCAGCGACG ACGAGGCGCG CATCCCTCCC GCGCGGCTCG GATACGGCGC AGGCGGCACG GCAGCCCAGG CGGCAGCCGG CTCGAACGAC AAATAA
|
Protein sequence | MTETNQRATD AQHKGGLSRR QFIAGIGGLG IGAVLGSGIT ALLLPDDVYA IEASQGYLLV DAKKCAGCET CVISCSLAHL GRINTSLSRI QVMKNALGSF PSDDVMQNQC RQCPYPSCVE ACPVGAMHAD PETGVRLVDE GKCIGCERCV EACPFTPSRV QWNFEDKHAQ KCDLCKNTPF WDEEGGPSGK QLCVEICPMK AIAFTNVLPV QTDEGYTANL RNDHYLEIGL PSDDEARIPP ARLGYGAGGT AAQAAAGSND K
|
| |