Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1718 |
Symbol | |
ID | 8416017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2023299 |
End bp | 2024468 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024684 |
Product | hypothetical protein |
Protein accession | YP_003182072 |
Protein GI | 257791466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.473614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACG ACATCACGAT TTTGGCGGAC TTCGATTCTG CGCTCTACGA GCAGGTCGAG CCGCCCCTCG TATCGCTCTA CCTGCCCACG CACGGCAGTG CTCCCGGCGA CGAGAGCGAC CGCATCGAGT TCGAGGCCCT CGTGGAGCAG GCGCGCGCGA AGCTCGCCCA GGAGCGCGAG CGCCGCGAAT ACAAAGGCGT CGACGAGCGG CTCGCCTACG CAGCCGAGCA CTTCGACGAT CTGATGAGCC CCGCGCCCGG CGGAAGCCTC GCCGTGCTCG CAGGCAACGA CCGAACCTAC ATCTACCGGC TTGGCTACGA GGCGGGCCCG CTCGCGTTCG TGGGCGAGCG GTTCTACGTC AAGCCGCTGC TGAAGAACTT CCAGTTCGGA TCGCACTACT TCCTACTGGG GCTTTCGGCC GACCGCTTCG CGTTCGTCCA CGGCGACTTC GGCTCGCTCG AGCGCGTGGA GCTGCCCCGC GACGTGCTCG ACGCGTTCAG CGAGGAGTTC CCGCTCGTGT ACGACGGGCA CGAGGGTGCG CTGGACTACT CGTCGCTCGA GAACCATATG CCGCCCTACC ACGGCTGGAA GTCGCGCAAC GACGTGAGGA AGGAGGAGGC CGGAAAGTTC TTCCAGTACG TGAACAAGGC GGTGACCGAC TACCTCGTGG CCGGCACCGA CCTGCCGGTG ATCCTCGTGA GCCTGCCCGA GCACCAAAGC GCCTTCCGCC GCATCTCCAC CATCCCCCAT CTGCTGGACG AGGGCATCGA GAAGGACATC GGCGGCATCG AGGCCCCCGA GCTCTTGTCC GATGCGAAAG CCGTCATCGA GCATGTGCGC GAGGCGCGTG CGACCGAGCT GCTGGAGAAG TTCGGCGATG CCGAGGCGCA CGGCGGCGCG TCATCGGACC TGAAAGCCAT CGGGCTCGCT CTCGTGGAGC GCAAGGTGCG CGCCCTGTTC CTGGCCGAAG GCGCCTACAT CCCCGGCGGC TTCGACGAGC AGACGGGCGA GGTCTTCCTG TTCGAGCGAG AGCCGCACGG CCGCTTCCAG GGCCCCGAGC TGGCCGACGG CTTCGTCCGC GCGGCCCTTG CCCAGGACGC CGACGTGTTC GAGCTTCCCG CCGAGAAGAT TCCCGGCGAC TCCGGCATCG CCGCGCTGTA CCGGTACTAG
|
Protein sequence | MNDDITILAD FDSALYEQVE PPLVSLYLPT HGSAPGDESD RIEFEALVEQ ARAKLAQERE RREYKGVDER LAYAAEHFDD LMSPAPGGSL AVLAGNDRTY IYRLGYEAGP LAFVGERFYV KPLLKNFQFG SHYFLLGLSA DRFAFVHGDF GSLERVELPR DVLDAFSEEF PLVYDGHEGA LDYSSLENHM PPYHGWKSRN DVRKEEAGKF FQYVNKAVTD YLVAGTDLPV ILVSLPEHQS AFRRISTIPH LLDEGIEKDI GGIEAPELLS DAKAVIEHVR EARATELLEK FGDAEAHGGA SSDLKAIGLA LVERKVRALF LAEGAYIPGG FDEQTGEVFL FEREPHGRFQ GPELADGFVR AALAQDADVF ELPAEKIPGD SGIAALYRY
|
| |