Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3065 |
Symbol | |
ID | 8417400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3563237 |
End bp | 3564346 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645026045 |
Product | protein of unknown function DUF917 |
Protein accession | YP_003183397 |
Protein GI | 257792791 |
COG category | [S] Function unknown |
COG ID | [COG3535] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAGGA AAATCGGCAT CAAAGAGATC GAGGACATGG CGCTTGGCGC GACGGTTCTC GGCGCCGGCG GGGGCGGCGA CCCGTACGTC GGCAAGCTCA TGGCCATCGA GGCCATCAAG AAGTACGGCG AGGTGGAGCT CATCTCGCCC GACGAAGTTC CTGACGACGC CGTGGTGTGC GTGTCCCAGA TGATGGGCGC CCCCACCATC ATGGTGGAGA AGATCTGCAG CGGCCTGGAG CCCATGGCCA CGTACGACGA GCTGGTGAAG GAGCTGGGCC AGGAGCCGTA CGCCATCTAC GCGGTGGAAG CCGGCGGCGT GAACTCCACC ATCCCGTTCA TCCTGGCGGC CACGCGCCGC ATCCCCGTGG TGGACTGCGA TCTCATGGGC CGCGCGTTCC CCGAGCTGCA GATGACCACG CTGGGCATCA ACGGCGTGAA GGGACAGCCC GCCGTCATGG CCGATGAGAA GGGCAACACG GTCACGGTGC GCGCGATCGA CGACAAGTGG CTCGAGCGCA TCTCGCGCCA GGCCACGTCG GTGATGGGCG GTTACACCAT CCTGGCGTCG TATCCGTGCA CGGGGCGCCA GCTCAAGGAC TACTGCATCC CCGACACGCC TACGCTGTGC GAGGAGATCG GCCGCACGCT GCGCGAGGCG CGCGAGCAGC ATGCCGACCC CATCGAGGCC GTGCTGAACG TGACGAACGG GTTTCGCCTG TTCCGCGGCA AAGTGGTGGA CGTCGAGCGC AAGACCGACG GCATGTTCGT GCGCGGTCGC GCCGTGGTGG ACGGGCTCGA CCAGGACAAG GGCAGCCAGC TTATCATCGA GTTCCAGAAC GAGAACCTCA TCGCGCTGCG CGACGGCCAG CCGGTGACCA CGTCGCCCGA CCTCATCATG TCGCTGGACA TGGAGTCCGG CTCGCCCGTG ACTACCGAGG GCCTGAAGTA CGGCGCTCGC ATTGTGGTGG TGGGCATGCC CTGCGCGCCG CAGTGGCGCA CGCCCGAGGG CCTGGCCGTG GTAGGGCCGC GCGCGTTCGG CTACGACATC GACTACGTGC CGGTTGAGCA GCGCGTCGCC GCGATGAACA ACGAGGAGGT GCAGGCGTAA
|
Protein sequence | MRRKIGIKEI EDMALGATVL GAGGGGDPYV GKLMAIEAIK KYGEVELISP DEVPDDAVVC VSQMMGAPTI MVEKICSGLE PMATYDELVK ELGQEPYAIY AVEAGGVNST IPFILAATRR IPVVDCDLMG RAFPELQMTT LGINGVKGQP AVMADEKGNT VTVRAIDDKW LERISRQATS VMGGYTILAS YPCTGRQLKD YCIPDTPTLC EEIGRTLREA REQHADPIEA VLNVTNGFRL FRGKVVDVER KTDGMFVRGR AVVDGLDQDK GSQLIIEFQN ENLIALRDGQ PVTTSPDLIM SLDMESGSPV TTEGLKYGAR IVVVGMPCAP QWRTPEGLAV VGPRAFGYDI DYVPVEQRVA AMNNEEVQA
|
| |