Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0801 |
Symbol | |
ID | 8415091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 999484 |
End bp | 1001178 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 645023767 |
Product | SMC domain protein |
Protein accession | YP_003181164 |
Protein GI | 257790558 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3593] Predicted ATP-dependent endonuclease of the OLD family |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.779455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATAG ATAAACTGAC AATTAAAAAC TATAGGAGCG TCCGTGATTT GGAACTCAGC CTGTCGCCGC GCATCAATGT CTTCATTGGG GCAAATAACG TTGGCAAGAG CAATATCCTC TCTGCAATGG AATATCTGCT GGGTCCGTCC TATCCAACAG CCAATCGGCT TGAGCGGTGG GACTTCTACC AGGGCGATGA GGAGCTTCCC CTCAAAATAG CCCTTGATTT CGATGACGGG GCTCACCTTT CATTCGATTC AACCTGGCAC GATGGTTATG GAAGAGAAAA ACACGGCTTA AATTACAACG GTAGCTACAT TTCGGATGAA GTGCGTAGTC GCTATATTTC AGCTTCGATT GGGCCCGACA GGCGCGTTCT CGACAATCCG GCGTCCAGCC AATGGAGCCT GTTGGGCAGG ATGCTCAAGG AATTCAACGA ACGTCTTAGC GAGGAGACGA TCTCGTCTGC CGATGGGCAT ACGGTCACCA AAGCCGAAGC GTTTAAACAG AGCATGCAGG AGATTCGTGA TCAAATACTC TTCTCCATTA CCGACCAAGA CGGTACGAAC CTTATGGGCG AGCTTAGCCG CATTATGCAG CAGGAAACTG CGAATCAGCT CAATTGCTCG CCTAACGATT TGACTGTCGA CTTGAATGCC TACGACCCGT GGAACCTGTA CAAAACACTG CAGATTTTCG TGACCGAGCA GGAGACCGGT GTTCAGATGC GGGCATCTGA CATGGGCATG GGGGTGCAGG CAAGCCTCAC TATAGCTATC CTCCGTGCCT ATTCGAAGCT CAAGTTGAAG AACCAAACGC CGCTGTTTAT CGACGAGCCA GAACTGTATT TACATCCTCA GGCAAGGCGA AAGTTTTATC GCGTGATTGA AGAGCTCGCA GATTCGGGAA CCCAGATATT CCTTACGACT CATTCCACTG AGTTCATTGA TCTGGGCAAC TTTGATCAGA TATACCTTGT GCGCAAGAAC GCCGAGCGAG GGACCTATGT TAGAAAAGCA GATCCCCAGA GTTTTGTAGA TGACCTACAA AACAGGCTCA ATATAAGAAC GGACGCAAAC AGATTGATGC TCGAATACCG CAATGCTTTC GAGAACACGG GCGACTCTCA AAAAGCTGCC GAAGGCCTTT TCGCTTCGAA GGTGTTACTA GTCGAGGGAG AGAGTGAGTC GCTTATCCTG CCGTTTTGCT TCGATAGGAT AGGCTTCGAC TACGATGGAA AAGGCATCTC CATAGTACGC TGCGGCGGCA AAAATGAGCT TGACCGTTTC TATCGCTTAT ACAGCGAATT CGGCATCCCT TGCTTCATCC TTTTCGACGG GGACTTTCAG AATTTCCAAA CCGAAGATCA AGCACACACC ATTAAAGCCA ATAAGAGCAT CCTTTCGCTC TTCGGTTGCT TGGACGATTT CCCTGACGGA AATGTGCATG AGTCATATTT TGGTTTCCGG ACGCTACTCG AGGATAATCT GGGGCTCAAC GGTATTGGCT CAAAAACGAA AGGCCTTCGG CTGTTCGTTA GGTTCAAGAA TGCCGTTTCC CGCGAGGAAG CAGCTGTTCC GTTCTGGGTT AAAGAGATTG CCGACAAGCT TGACGGTTTG CCTAACGAGG CGCGCTCCGT CCTAACTTGC AAATGTGAAC CCCTTGCATG GGATGATGAC TACATCCCTT TTTAG
|
Protein sequence | MKIDKLTIKN YRSVRDLELS LSPRINVFIG ANNVGKSNIL SAMEYLLGPS YPTANRLERW DFYQGDEELP LKIALDFDDG AHLSFDSTWH DGYGREKHGL NYNGSYISDE VRSRYISASI GPDRRVLDNP ASSQWSLLGR MLKEFNERLS EETISSADGH TVTKAEAFKQ SMQEIRDQIL FSITDQDGTN LMGELSRIMQ QETANQLNCS PNDLTVDLNA YDPWNLYKTL QIFVTEQETG VQMRASDMGM GVQASLTIAI LRAYSKLKLK NQTPLFIDEP ELYLHPQARR KFYRVIEELA DSGTQIFLTT HSTEFIDLGN FDQIYLVRKN AERGTYVRKA DPQSFVDDLQ NRLNIRTDAN RLMLEYRNAF ENTGDSQKAA EGLFASKVLL VEGESESLIL PFCFDRIGFD YDGKGISIVR CGGKNELDRF YRLYSEFGIP CFILFDGDFQ NFQTEDQAHT IKANKSILSL FGCLDDFPDG NVHESYFGFR TLLEDNLGLN GIGSKTKGLR LFVRFKNAVS REEAAVPFWV KEIADKLDGL PNEARSVLTC KCEPLAWDDD YIPF
|
| |