Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_04460 |
Symbol | |
ID | 7314425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 479792 |
End bp | 482824 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643610869 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002508199 |
Protein GI | 220931291 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACCAAAA ACTGGGACGA ATACCATCGG GTTGAAAAAA GGGCACTCAA CTTATTTGAA AGGTTAGGTT ACAAGGTTTT CGATATTAAC AAGACCATGG AACGCCCTGC AAGGAATTCA GAACACGAAG TTTTGTTACT GGATAATTTA AAAAAGGGAA TCAAGAGAAT TAACCCCTGG ATTTCCGAAA ACAACCTGAA TAAAGCTGTC AATATGATTC GTCCTGCACG GATTAAGGCT ACCAATTTGC TGGAAGCCAA TGAAATTATA TATGAAAGGT TAGTTAAACA TGTTTCTGTA CTGCAGGACC TGGGCCAGGG CAAGAAAAAT CAGACGGTTA AATATATAGA TTTTGAAGAT CCGGATAATA ACGAGTTTCT GGTGATGAAC CAGTATAAGG TAAAAGGAAG AGAAAATATT ATTCCTGATA TAGTTGTCTT TATCAACGGT ATTCCGATTG GGGTTATAGA GTGCAAGGTT GATACAATTG ACGAACCAGA AGAAAAAGCC ATCGAACAGT TGAGAAGATA TCAGAATATC AGGGAATATG ACCTTGAAGA AGGTGCCGAA CAGTTATTTT ATACCAACCA GGTACTGGTG GCAGCCTGGA AGGATTCTGC TTCAGCCAGT ACCATCGGTG CCCCTGCCAG GCAGTTTAAA GCCTGGAAGG ACCCCTATCC CAGTACCATA GATAATATAA CAAAACTTGT AGGAGAAGAA CCTACAATGC AGGATATTCT GTTATACTCC ATGTTTAAGA AGGAACGCCT CTTGGACTTA ATACAGAACT TCATTGTTTT TGAGCGGGAA GGTAATGGAA TAGTTAAAAA ACTGGCCAGA TATCAGCAAT ACAGAGCTGT CTGTAAAGCT GTTGAGAGGA TCAAAAATGC CAGAAAATTA ACAGAGAGAA GCGGTACAGT CTGGCATACC CAGGGTTCGG GCAAGTCCTT AACCATGCTC TTTTTAGCTC TGAAGTTAAG GCGAATGAAA GAACTGGAGA ATCCTACTCT ATTAATTGTG ACTGACAGAA GGGATCTCGA TGAACAGATT ACAGGAACTT TTAAAAAATG TGGTTTTCCC AATCCCATCA GGGCTAAAAG TGTAAAGGAT TTAAAAGAAA AACTCAGACT TGATGCAGGC AAGACCATTA TGACTACAGT TCAGAAGTTT CAGGAAAGGG ATAATGACAA ATACCCGGTC TTAAGTGAAG ATACCAATAT CTTTGTGATG GTGGATGAGG CTCATCGTAC CCAGTATAAG GATCTGGCAG CCAATATGAG GAGGGCTTTG CCCAATGCCT GTTATCTTGG TTTTACAGGT ACGCCAATAG ATAAAAAAGC AAGGAGTACC ATCAGGACCT TTGGTACATA TATTGACACC TATACCATAG AAGAATCGGT AGAAGATGGG GCTACTTTAC CTATTTTCTA TGAAGCCAGG CTGGCTGATT TAAGGGTTGA AGGCCGGGAT CTGGATAAGT TGTTTGATAG AATTTTTAAG GATTATACTG ATGAAGAAAA GCAGAAGATT AAAGAAAAAC ATGCTACAGA AAAAGATATT GCTGAGGCCA GTTCAAGGAT TGAAAAGATA TGCCTTGATA TTATTGAACA TTATGAATCC AAAATCTACC CTTTTAAGGC CCAGATAGTA ACTGTTAGTA GAGAAGCTGC AGCTAAATAT AAGGAGACAC TTGATAATTT GAATGGTCCT GAGTCTGCAG TTATCATTAG TGGAGATAGA ACTGATAAAG GATTAATAAA AAAATATATA ACAACAGGTG ATGAAAGAAA AGAATTAATT AAAAGATTCA AAGATTATCA TGATAGTCTG AAGTTTTTAA TTGTATGTGA TATGCTACTG ACAGGCTTTG ATGCTCCTGT TGAACAGGTG ATGTATCTTG ATAAACCCTT AAAAGAATAT AATTTACTCC AGGCTATAGC CAGGGTAAAT AGGCGATATG ATAACAAAAA CTTTGGTCTG GTAGTCGATT ATTATGGAGT TTTTGATCAT TTGAAAGAAG CACTGGAAAT CTTTAATAAA AAGGATATCG AAAATGCTGT TACGCCAGTT AAAGATGAAA AGCCCAGGCT GGAAAGAAAT TACAGGGGAG TAATGAGGCT TTTTGATGGA GTTAATATGG ATAATCTGGA TCAATGCATC CTTGCCTTTA AAGAGGAAGA TCAAAGAATT AAGTTTAAAA ATGCCTTTAA AGCCTTTGCC CGCAGTATGG ACATTATCAT GCCTGACCCC ATTGCTGACC CATATCGTGA GGATTTGAAA AAGCTGGGTA AAATTTATAA GGCAGTACGC AATCATTACC GGGATAAAAA CCTGAATATT AAAGGAGTAG GGGATAAAGT TAAAAAATTA ATTGATAAGC ATATCATGGC CACTGACATT AAAATCTTAA GTGAACCTGT ATCCATTCTT GACGAAGAAA AATTTGAAGA AACCATTAAT GAAATAAGAA ATAAAGAAAC CAGGGCCAGT GAAATGGAAC ATGCCATTAG AAATGAAATT AGTATAAAAA TTGATGAAAA TCCTGCCTAT TATCAATCAT TAAAAGAAAG GCTTGAGGAG TTAATTGAAA GAAGAAAGCA GGGTATGCTT GATTTTGCAG AACAAATAGA AGAGATGAAA GAAATTATAA ATGATATAAG AAATGTTAGG TCAAAGGCTG AGAGGTTGGG GCTAAATGAG AAGGAGTTTG CCCTTTATGA ACTTCTGGTT GATGAGTTAG AACCATATTA TACAGAGGAA GTGGCTGACC CGCCGGTAAA ATATAATGCA GGCAAACAAT CACGCACTGA TATAAAAATC AATGAAAAGG TAAAAAATCT AGCTCAATCA TTAATCAACG AACTGGAGGA TATGGCTGTA TTTGAATGGT ATAAAAAGGA GATTGTATTA AAGAACATGC GTAGAAAAAT CAAACTTTCT CTGGCAGGTT ATAAGGAATT TAGGAATAAA CTTGACAGTC TTACTACAAA AATTATAAAG CTTGCTCGAA ATATTTTGAT GATGATGCTT TAG
|
Protein sequence | MTKNWDEYHR VEKRALNLFE RLGYKVFDIN KTMERPARNS EHEVLLLDNL KKGIKRINPW ISENNLNKAV NMIRPARIKA TNLLEANEII YERLVKHVSV LQDLGQGKKN QTVKYIDFED PDNNEFLVMN QYKVKGRENI IPDIVVFING IPIGVIECKV DTIDEPEEKA IEQLRRYQNI REYDLEEGAE QLFYTNQVLV AAWKDSASAS TIGAPARQFK AWKDPYPSTI DNITKLVGEE PTMQDILLYS MFKKERLLDL IQNFIVFERE GNGIVKKLAR YQQYRAVCKA VERIKNARKL TERSGTVWHT QGSGKSLTML FLALKLRRMK ELENPTLLIV TDRRDLDEQI TGTFKKCGFP NPIRAKSVKD LKEKLRLDAG KTIMTTVQKF QERDNDKYPV LSEDTNIFVM VDEAHRTQYK DLAANMRRAL PNACYLGFTG TPIDKKARST IRTFGTYIDT YTIEESVEDG ATLPIFYEAR LADLRVEGRD LDKLFDRIFK DYTDEEKQKI KEKHATEKDI AEASSRIEKI CLDIIEHYES KIYPFKAQIV TVSREAAAKY KETLDNLNGP ESAVIISGDR TDKGLIKKYI TTGDERKELI KRFKDYHDSL KFLIVCDMLL TGFDAPVEQV MYLDKPLKEY NLLQAIARVN RRYDNKNFGL VVDYYGVFDH LKEALEIFNK KDIENAVTPV KDEKPRLERN YRGVMRLFDG VNMDNLDQCI LAFKEEDQRI KFKNAFKAFA RSMDIIMPDP IADPYREDLK KLGKIYKAVR NHYRDKNLNI KGVGDKVKKL IDKHIMATDI KILSEPVSIL DEEKFEETIN EIRNKETRAS EMEHAIRNEI SIKIDENPAY YQSLKERLEE LIERRKQGML DFAEQIEEMK EIINDIRNVR SKAERLGLNE KEFALYELLV DELEPYYTEE VADPPVKYNA GKQSRTDIKI NEKVKNLAQS LINELEDMAV FEWYKKEIVL KNMRRKIKLS LAGYKEFRNK LDSLTTKIIK LARNILMMML
|
| |