Gene Hore_04460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04460 
Symbol 
ID7314425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp479792 
End bp482824 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content37% 
IMG OID643610869 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002508199 
Protein GI220931291 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACCAAAA ACTGGGACGA ATACCATCGG GTTGAAAAAA GGGCACTCAA CTTATTTGAA 
AGGTTAGGTT ACAAGGTTTT CGATATTAAC AAGACCATGG AACGCCCTGC AAGGAATTCA
GAACACGAAG TTTTGTTACT GGATAATTTA AAAAAGGGAA TCAAGAGAAT TAACCCCTGG
ATTTCCGAAA ACAACCTGAA TAAAGCTGTC AATATGATTC GTCCTGCACG GATTAAGGCT
ACCAATTTGC TGGAAGCCAA TGAAATTATA TATGAAAGGT TAGTTAAACA TGTTTCTGTA
CTGCAGGACC TGGGCCAGGG CAAGAAAAAT CAGACGGTTA AATATATAGA TTTTGAAGAT
CCGGATAATA ACGAGTTTCT GGTGATGAAC CAGTATAAGG TAAAAGGAAG AGAAAATATT
ATTCCTGATA TAGTTGTCTT TATCAACGGT ATTCCGATTG GGGTTATAGA GTGCAAGGTT
GATACAATTG ACGAACCAGA AGAAAAAGCC ATCGAACAGT TGAGAAGATA TCAGAATATC
AGGGAATATG ACCTTGAAGA AGGTGCCGAA CAGTTATTTT ATACCAACCA GGTACTGGTG
GCAGCCTGGA AGGATTCTGC TTCAGCCAGT ACCATCGGTG CCCCTGCCAG GCAGTTTAAA
GCCTGGAAGG ACCCCTATCC CAGTACCATA GATAATATAA CAAAACTTGT AGGAGAAGAA
CCTACAATGC AGGATATTCT GTTATACTCC ATGTTTAAGA AGGAACGCCT CTTGGACTTA
ATACAGAACT TCATTGTTTT TGAGCGGGAA GGTAATGGAA TAGTTAAAAA ACTGGCCAGA
TATCAGCAAT ACAGAGCTGT CTGTAAAGCT GTTGAGAGGA TCAAAAATGC CAGAAAATTA
ACAGAGAGAA GCGGTACAGT CTGGCATACC CAGGGTTCGG GCAAGTCCTT AACCATGCTC
TTTTTAGCTC TGAAGTTAAG GCGAATGAAA GAACTGGAGA ATCCTACTCT ATTAATTGTG
ACTGACAGAA GGGATCTCGA TGAACAGATT ACAGGAACTT TTAAAAAATG TGGTTTTCCC
AATCCCATCA GGGCTAAAAG TGTAAAGGAT TTAAAAGAAA AACTCAGACT TGATGCAGGC
AAGACCATTA TGACTACAGT TCAGAAGTTT CAGGAAAGGG ATAATGACAA ATACCCGGTC
TTAAGTGAAG ATACCAATAT CTTTGTGATG GTGGATGAGG CTCATCGTAC CCAGTATAAG
GATCTGGCAG CCAATATGAG GAGGGCTTTG CCCAATGCCT GTTATCTTGG TTTTACAGGT
ACGCCAATAG ATAAAAAAGC AAGGAGTACC ATCAGGACCT TTGGTACATA TATTGACACC
TATACCATAG AAGAATCGGT AGAAGATGGG GCTACTTTAC CTATTTTCTA TGAAGCCAGG
CTGGCTGATT TAAGGGTTGA AGGCCGGGAT CTGGATAAGT TGTTTGATAG AATTTTTAAG
GATTATACTG ATGAAGAAAA GCAGAAGATT AAAGAAAAAC ATGCTACAGA AAAAGATATT
GCTGAGGCCA GTTCAAGGAT TGAAAAGATA TGCCTTGATA TTATTGAACA TTATGAATCC
AAAATCTACC CTTTTAAGGC CCAGATAGTA ACTGTTAGTA GAGAAGCTGC AGCTAAATAT
AAGGAGACAC TTGATAATTT GAATGGTCCT GAGTCTGCAG TTATCATTAG TGGAGATAGA
ACTGATAAAG GATTAATAAA AAAATATATA ACAACAGGTG ATGAAAGAAA AGAATTAATT
AAAAGATTCA AAGATTATCA TGATAGTCTG AAGTTTTTAA TTGTATGTGA TATGCTACTG
ACAGGCTTTG ATGCTCCTGT TGAACAGGTG ATGTATCTTG ATAAACCCTT AAAAGAATAT
AATTTACTCC AGGCTATAGC CAGGGTAAAT AGGCGATATG ATAACAAAAA CTTTGGTCTG
GTAGTCGATT ATTATGGAGT TTTTGATCAT TTGAAAGAAG CACTGGAAAT CTTTAATAAA
AAGGATATCG AAAATGCTGT TACGCCAGTT AAAGATGAAA AGCCCAGGCT GGAAAGAAAT
TACAGGGGAG TAATGAGGCT TTTTGATGGA GTTAATATGG ATAATCTGGA TCAATGCATC
CTTGCCTTTA AAGAGGAAGA TCAAAGAATT AAGTTTAAAA ATGCCTTTAA AGCCTTTGCC
CGCAGTATGG ACATTATCAT GCCTGACCCC ATTGCTGACC CATATCGTGA GGATTTGAAA
AAGCTGGGTA AAATTTATAA GGCAGTACGC AATCATTACC GGGATAAAAA CCTGAATATT
AAAGGAGTAG GGGATAAAGT TAAAAAATTA ATTGATAAGC ATATCATGGC CACTGACATT
AAAATCTTAA GTGAACCTGT ATCCATTCTT GACGAAGAAA AATTTGAAGA AACCATTAAT
GAAATAAGAA ATAAAGAAAC CAGGGCCAGT GAAATGGAAC ATGCCATTAG AAATGAAATT
AGTATAAAAA TTGATGAAAA TCCTGCCTAT TATCAATCAT TAAAAGAAAG GCTTGAGGAG
TTAATTGAAA GAAGAAAGCA GGGTATGCTT GATTTTGCAG AACAAATAGA AGAGATGAAA
GAAATTATAA ATGATATAAG AAATGTTAGG TCAAAGGCTG AGAGGTTGGG GCTAAATGAG
AAGGAGTTTG CCCTTTATGA ACTTCTGGTT GATGAGTTAG AACCATATTA TACAGAGGAA
GTGGCTGACC CGCCGGTAAA ATATAATGCA GGCAAACAAT CACGCACTGA TATAAAAATC
AATGAAAAGG TAAAAAATCT AGCTCAATCA TTAATCAACG AACTGGAGGA TATGGCTGTA
TTTGAATGGT ATAAAAAGGA GATTGTATTA AAGAACATGC GTAGAAAAAT CAAACTTTCT
CTGGCAGGTT ATAAGGAATT TAGGAATAAA CTTGACAGTC TTACTACAAA AATTATAAAG
CTTGCTCGAA ATATTTTGAT GATGATGCTT TAG
 
Protein sequence
MTKNWDEYHR VEKRALNLFE RLGYKVFDIN KTMERPARNS EHEVLLLDNL KKGIKRINPW 
ISENNLNKAV NMIRPARIKA TNLLEANEII YERLVKHVSV LQDLGQGKKN QTVKYIDFED
PDNNEFLVMN QYKVKGRENI IPDIVVFING IPIGVIECKV DTIDEPEEKA IEQLRRYQNI
REYDLEEGAE QLFYTNQVLV AAWKDSASAS TIGAPARQFK AWKDPYPSTI DNITKLVGEE
PTMQDILLYS MFKKERLLDL IQNFIVFERE GNGIVKKLAR YQQYRAVCKA VERIKNARKL
TERSGTVWHT QGSGKSLTML FLALKLRRMK ELENPTLLIV TDRRDLDEQI TGTFKKCGFP
NPIRAKSVKD LKEKLRLDAG KTIMTTVQKF QERDNDKYPV LSEDTNIFVM VDEAHRTQYK
DLAANMRRAL PNACYLGFTG TPIDKKARST IRTFGTYIDT YTIEESVEDG ATLPIFYEAR
LADLRVEGRD LDKLFDRIFK DYTDEEKQKI KEKHATEKDI AEASSRIEKI CLDIIEHYES
KIYPFKAQIV TVSREAAAKY KETLDNLNGP ESAVIISGDR TDKGLIKKYI TTGDERKELI
KRFKDYHDSL KFLIVCDMLL TGFDAPVEQV MYLDKPLKEY NLLQAIARVN RRYDNKNFGL
VVDYYGVFDH LKEALEIFNK KDIENAVTPV KDEKPRLERN YRGVMRLFDG VNMDNLDQCI
LAFKEEDQRI KFKNAFKAFA RSMDIIMPDP IADPYREDLK KLGKIYKAVR NHYRDKNLNI
KGVGDKVKKL IDKHIMATDI KILSEPVSIL DEEKFEETIN EIRNKETRAS EMEHAIRNEI
SIKIDENPAY YQSLKERLEE LIERRKQGML DFAEQIEEMK EIINDIRNVR SKAERLGLNE
KEFALYELLV DELEPYYTEE VADPPVKYNA GKQSRTDIKI NEKVKNLAQS LINELEDMAV
FEWYKKEIVL KNMRRKIKLS LAGYKEFRNK LDSLTTKIIK LARNILMMML