Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1516 |
Symbol | |
ID | 8415814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1809827 |
End bp | 1811014 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 645024484 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_003181873 |
Protein GI | 257791267 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.205219 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGAGA CGAAGGACAG TGGCGTTGAC TGGATCGGCG AGGTGCCGGT CAACTGGGAA ATCGTTCCGA TAAAGGCGGA TGTTTCAATT GGACATGGCT CCGACCCCAC AACGCCTGGT GATATTCCGG TTTGGGGCAG CGGGGGCGAA CCCTTTAAAA CCTGTGGCGA GCATAAGAAT GGGCCAGCGG TGCTTCTTGG ACGAAAGGGA ACATTAGATT GCCCCCAACT TGTTACCGGG CTCTACTGGA ACGTCGATAC AGCTTTTGAT GCGAAAATCA CAAGTAAAAA GCTCTCACTG AAGTTCTTCT ACTATGCTGC AACCTGTGTT GACATTAAGC CATATATGAC CAACACGGCT AAGCCGAGTA TGACTCAATT TGACTGGGAC AATTCGAGAA TCCCCCGCCC TCCACTTGCG GAGCAGCGAC GAATCATCTC ATACCTCGAT GAGCGCTGCG CGGCCATCGA CGAGGATGTC GCGAAGCGTC GCGATGTCAT CGGGAAGCTC AAGGAATACA AGAAGTCACT CATTGCGCAT GCCGTGACGA AGGGCCTCGA CCCGAACACA GAGATGAAGG ACAGCGGAGT CGACTGGATC GGCGAGGTGC CGGCGAATTG GCGTCTAACG AAAATCGGAC AGGTCTATGA CCTGCGCAAC ACGAAAGTCA GCGATTGCGA TTACGAGCCA TTGTCCGTAA CCATGCAAGG CATCGTTCCT CAACTTGATA GCGCTGCCAA GACCGATGCG CATGACGATA GGAAGTTGGT TATGGAGGGA GACTTCGTAA TCAACAGCAG GTCAGATAGA CGTGGATCAT GCGGAATAGC AAGACAAGAT GGCTCCGTAT CCCTTATCAA TACGGTACTT ATCCCCCGAG AACATATGGA GCCACGCTTC TATGACTGGT TGTTTCACAC GACCCTATTT GCTGATGAGT TTTATAAGAA CGGTCACGGA ATAGTAGACG ACCTTTGGAC CACAAAATGG GCAGAGATGA AGGGCATCAC CATAGTTGAA CCCCCATTTG AAACGCAAAT AACCGTCGCG AACTACCTCG ACGAGCGCTG CGCGGCCATC GACGAGGCAA TCGCCCGCCA GGAGCAGCTC ATCGAGAAGC TCGGCGAGTA CCGCAAGTCC GTCATCCACC ACGCCGTGAC CGGCAAAATC GACTGTATGG AGGCCTAA
|
Protein sequence | MRETKDSGVD WIGEVPVNWE IVPIKADVSI GHGSDPTTPG DIPVWGSGGE PFKTCGEHKN GPAVLLGRKG TLDCPQLVTG LYWNVDTAFD AKITSKKLSL KFFYYAATCV DIKPYMTNTA KPSMTQFDWD NSRIPRPPLA EQRRIISYLD ERCAAIDEDV AKRRDVIGKL KEYKKSLIAH AVTKGLDPNT EMKDSGVDWI GEVPANWRLT KIGQVYDLRN TKVSDCDYEP LSVTMQGIVP QLDSAAKTDA HDDRKLVMEG DFVINSRSDR RGSCGIARQD GSVSLINTVL IPREHMEPRF YDWLFHTTLF ADEFYKNGHG IVDDLWTTKW AEMKGITIVE PPFETQITVA NYLDERCAAI DEAIARQEQL IEKLGEYRKS VIHHAVTGKI DCMEA
|
| |