Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1646 |
Symbol | |
ID | 8415945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1947247 |
End bp | 1948122 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645024615 |
Product | protein of unknown function DUF558 |
Protein accession | YP_003182003 |
Protein GI | 257791397 |
COG category | [S] Function unknown |
COG ID | [COG1385] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00046] RNA methyltransferase, RsmE family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.600892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00800846 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCCTGC AGCATTTCTA CCTGCACGAC CAGGTGCTTG CCGACGAGGG GGCGCCGACT TTCCCGCTGC GTTTGTCGCC GGACGACGCG AAGCACGCCC GCGTGCTGCG GCTTGCGCCG GGCGAGCATA TCGCGGTGGT GGACGCCGCG CAGGACTACT TCGAGTGCGA GATCGCCGCT TTCGACGATG CGGTCCCGGT GGTGCGCATC GCGCAGCGCC TCGACGACGA GGAGCGCCCC CTCGTGATGC TCGTGCAGGG CCTGGCCAAG GGCGACAAGA TGGAGACGGT GATCCGCCAC GCCACCGAGC TGGGCGTTTC GGCGTTCGTT CCGATGTCGT GCGAGCGCTC CATCGTGAGG CTCGATGCGA GGAAGGCCGC GGCGAGGACG CAGCGATGGC GCGCCATCGC GAAGAGCGCG GCCATGCAGT CCGGCCAGCG CGCGTGTCCC GAGGTGGGCG AGCCGATGGC GCTGGCGGAT GTGTGCGCCT CCTTGGCGCA CGCCACCGCC GTGCTGGTGT GCTGGGAGGA GGCGCTGCTG GACGCGCGCA TCGAGAAGGC GCTCGAGCGC GGCCTCCGGC TCGACCGCGC GCTTCCCGAG AACGCGCGCA TCGCCGTGGT GGTGGGCCCC GAAGGGGGCC TTTCGCGAAG CGAGGCGGAC GCGCTCCTGG CCTGCAACCC GCGTGCGTCG CTCGTGTCGC TGGGCTCGTC CATCCTGCGA ACCGAGACGG CCGGCATCGT GGCTCCCGCG CTCGTCCTGC ACGAGCTGGG CCGCATGATG CACGACGCGC GCGAAGCGCA GGAACGCACG TCCGCCCATG TCGAAAGCCG TCGCGAGGAA GCCGGGGACG CGTGCGAGGC AGGCGGGCAA TCGTGA
|
Protein sequence | MSLQHFYLHD QVLADEGAPT FPLRLSPDDA KHARVLRLAP GEHIAVVDAA QDYFECEIAA FDDAVPVVRI AQRLDDEERP LVMLVQGLAK GDKMETVIRH ATELGVSAFV PMSCERSIVR LDARKAAART QRWRAIAKSA AMQSGQRACP EVGEPMALAD VCASLAHATA VLVCWEEALL DARIEKALER GLRLDRALPE NARIAVVVGP EGGLSRSEAD ALLACNPRAS LVSLGSSILR TETAGIVAPA LVLHELGRMM HDAREAQERT SAHVESRREE AGDACEAGGQ S
|
| |