Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2136 |
Symbol | |
ID | 8416458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2510674 |
End bp | 2511843 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025123 |
Product | NLP/P60 protein |
Protein accession | YP_003182488 |
Protein GI | 257791882 |
COG category | [S] Function unknown |
COG ID | [COG3883] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000118144 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.895402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGC ATACAATCGG ACTGAGCAGG CGAACATTCC TCACCGGCGC GGCCGCGCTC GGCGCCCTTT CCGTTCTGGC TCCGACGACC GCGTTCGCAG AGACGGCTGC TGAGAAGCAG GCCGAGGCCG ATGCGGTGCG CAACCAGCTG ATCGGCTTGC AGGCCGATCT CGAGGCTGCC GAAATCAGCT ATTACTCCGC CCTCGACGAG CGCGATGCCG CGCAGAAGGC GATGGAGGAT CAGCAGGCCA AAATCGACGA CGCTAACAGC CAGATCAGCG ATCTGCAGGA CAAGCTGGGC ACGCGTGCCC GCAACATGTA CCGCAACGGA TCCACGAGCT TCGTCGATTT CGTCCTGGGT GCCGCCTCGT TCGAAGAGTT CACCCAGAAT TGGGATCTTC TCAACAAGAT GAACGAGAAC GACGCCGATA TGGTCGACCA GACGAAGACC TTGCGCGAAG AGCTGCAGGC TGCCAAGGAC GAATTCGCTC GCCAAGAGCA AATCGCCTCG GCCAAGGCTG CGGAAGCCAA ACAGATCCAA AGCGATGTCC AGGCCAAGGT CGACCAGGCT ACCGAGCTGG TCAGCTCCCT CGATGCCGAA GCGCAGGAGC TCCTTCAGCA GGAGCAGGCT GCGGCGGCGG CCGCTGCGGC GGCGGAGGCT GCGGCCGAGG CCGAGCGTCA GCGCCAGGCG GAGCAGGCGG TGAATCCCGG CGGCGGCGGT GGCGGCGCTA GCGGCGGTTC AGGTTCCGGC TCCGGCGGCG GAAGCGGTTC GTCCGGCGGC GGCGGTGGTG GCGGTTCTGT AGTGTATCCG TCCCGTCCGG TCGGTTCCTA CGACTCGGTC GTGGGCTACG CTATGAGCCG TATCGGCTGC CCCTACATCT GGGGTGCCGA AGGCCCCGAC TCCTTTGACT GCTCCGGCTT GGTCACGTGG GCGTACCGCC AGGTGGGCAT GTATCTGCCG CACCAGAGCG AGGCGCAGTA CGCGGCAGCC GCGCGCGTCG TATCGGTTTC CGAGGCGCGT CCGGGCGACG TGCTGTGGCG TTACGGTCAC GTCGGTATCG CGGTGAGTGC AGGCGGCTCG CACTACGTGC ACGCTCCCAC CTTCAACGCG TACGTGCGCG ACACCGATCC GCTGTCGTGG GCGCAGTTCA CGAACGCGCT GCAGTTCTAA
|
Protein sequence | MSEHTIGLSR RTFLTGAAAL GALSVLAPTT AFAETAAEKQ AEADAVRNQL IGLQADLEAA EISYYSALDE RDAAQKAMED QQAKIDDANS QISDLQDKLG TRARNMYRNG STSFVDFVLG AASFEEFTQN WDLLNKMNEN DADMVDQTKT LREELQAAKD EFARQEQIAS AKAAEAKQIQ SDVQAKVDQA TELVSSLDAE AQELLQQEQA AAAAAAAAEA AAEAERQRQA EQAVNPGGGG GGASGGSGSG SGGGSGSSGG GGGGGSVVYP SRPVGSYDSV VGYAMSRIGC PYIWGAEGPD SFDCSGLVTW AYRQVGMYLP HQSEAQYAAA ARVVSVSEAR PGDVLWRYGH VGIAVSAGGS HYVHAPTFNA YVRDTDPLSW AQFTNALQF
|
| |