Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_20210 |
Symbol | |
ID | 7314345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2178735 |
End bp | 2180432 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 643612465 |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_002509761 |
Protein GI | 220932853 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.266778 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTATAGATAA TTTTCAGTTA ATCTTTGATT CCAGGGAAAA GATTCAGGAA TTAAGAAATT TAATTTTAAA TTTAGCTGTC AGAGGAAAGC TTGTGCCTCA GAACTCGGAT GATGAACCAT CATCAGTATT GATTGAAAAA ATAAAAAAAG AGAAAGAAAG ATTAATTAAA GAAAAGAAAA TTAGAAAAAA GAAACCCTTA CCACCAATCA AAGAAGAAGA AATTCCTTTT GAATTACCTG AAAGCTGGGA ATGGGTGAGG TTAGGGAATA TCGGAAGGAT AGTTGGAGGT GGAACTCCTA AAACTAAAGT ACATGCGTAT TGGGAAAATG GTAATATTGC ATGGTTAACC CCAGCAGATT TAAACGGGCT TAAATCAAAG TATATTTCTA GGGGACGGAG AAATATTACT AAATTGGGTT TGCAAAATAG TTCAGCAAAA CTTTTGCCCA AAGGAAGCGT ATTGTTTTCA AGTAGAGCTC CTATAGGGTA TGTAGCAATT GCACAAAATG ATTTAGCAAC AAATCAAGGA TTTAAATCAT GTGTTCCATA TATTATGGAT ATGAATCAAT ATATTTATTA TTTTTTGATG TATGATGCCA AAAGAATAAA TGATAATGCT TCAGGGACTA CTTTTAAAGA AGTATCTGGG AAAGAGGTCG CAAATTTTAT TTTCCCATTA CCACCTCTTA ATGAACAGAA ACGTATCGTA AACAAATTGG ATGAGTTAAT GACCTTTTGT GACCAACTTG AAGTATCATT AGAAAAAAAG GCTAACGCAA AACAATTAGT ATCAAAGAGT ATTTCAAATA GAATTCAAAA GAGTAAGAGT AAAGAAGAAT TAGATAAGAA TATAACATTT ATAATTAGAA ACCTTAAAGA AGTATATACA ACACCTGAAA ATTTAAATGA TTTAAAAGAT ATCATTCTCC AGTTGGCTAT TCAGGGTAAG CTTGTGCCTC AGGATCCGGA TGATGAACCA GCATCAGTAT TAATTGAAAA AATAAATAAA GAAAAAGAAA GATTAATTAA AGAAAAGAAA ATTAGAAAAA CCAAACCCTT ACCACCAATA AAAGAAGCTG AAATTCCCTT TGAATTACCT AAAGGCTGGG AATGGGTGAG GTTGGGAGAA ATAATGATAA TTAATCAACG AAATAAATTA AATGATAATT TAGAAGTGTC ATTTGTTCCA ATGAAGCTGA TTGAAGATGG TTATTTAAGT AAACACAGTC ATAAAAAGAA GTTATGGAAA GAAGTAAAAA AAGGTTATAC TCATTTTAAA GAAAATGATT TGGTAGTAGC TAAGATTACT CCTTGCTTTG AAAATAGAAA ATCAGCAATC ATGAAAAATT TATATTCAGG TTATGGAGCT GGGACAACTG AATTGCATGT TCTTACCAGC TACTTAAAAG AAATAGATAT GAAATTTTTC CTCTACATTG TAAAAGCAAA GAATTTTATT AATCAAGGGG TTTCAACTTT TACTGGAACA GCTGGACAGC AAAGAATTAG AAAAGATTTT ATAGAAAATT TTGTTATAGG TTTACCTCCG TTAAATGAAC AGAAACAAAT TGTCAAAAAA ATAGACAAAT TAATGGCCTT ATGTAATTTA CTAGAAAACC AAATTAATAA AAATAGAAAT AATAGTGAGT TATTAATGAA ATCTTTACAA AGGAAATTAT TTGAGTGA
|
Protein sequence | MKKIIDNFQL IFDSREKIQE LRNLILNLAV RGKLVPQNSD DEPSSVLIEK IKKEKERLIK EKKIRKKKPL PPIKEEEIPF ELPESWEWVR LGNIGRIVGG GTPKTKVHAY WENGNIAWLT PADLNGLKSK YISRGRRNIT KLGLQNSSAK LLPKGSVLFS SRAPIGYVAI AQNDLATNQG FKSCVPYIMD MNQYIYYFLM YDAKRINDNA SGTTFKEVSG KEVANFIFPL PPLNEQKRIV NKLDELMTFC DQLEVSLEKK ANAKQLVSKS ISNRIQKSKS KEELDKNITF IIRNLKEVYT TPENLNDLKD IILQLAIQGK LVPQDPDDEP ASVLIEKINK EKERLIKEKK IRKTKPLPPI KEAEIPFELP KGWEWVRLGE IMIINQRNKL NDNLEVSFVP MKLIEDGYLS KHSHKKKLWK EVKKGYTHFK ENDLVVAKIT PCFENRKSAI MKNLYSGYGA GTTELHVLTS YLKEIDMKFF LYIVKAKNFI NQGVSTFTGT AGQQRIRKDF IENFVIGLPP LNEQKQIVKK IDKLMALCNL LENQINKNRN NSELLMKSLQ RKLFE
|
| |