Gene Hore_20210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20210 
Symbol 
ID7314345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2178735 
End bp2180432 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content30% 
IMG OID643612465 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002509761 
Protein GI220932853 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.266778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTATAGATAA TTTTCAGTTA ATCTTTGATT CCAGGGAAAA GATTCAGGAA 
TTAAGAAATT TAATTTTAAA TTTAGCTGTC AGAGGAAAGC TTGTGCCTCA GAACTCGGAT
GATGAACCAT CATCAGTATT GATTGAAAAA ATAAAAAAAG AGAAAGAAAG ATTAATTAAA
GAAAAGAAAA TTAGAAAAAA GAAACCCTTA CCACCAATCA AAGAAGAAGA AATTCCTTTT
GAATTACCTG AAAGCTGGGA ATGGGTGAGG TTAGGGAATA TCGGAAGGAT AGTTGGAGGT
GGAACTCCTA AAACTAAAGT ACATGCGTAT TGGGAAAATG GTAATATTGC ATGGTTAACC
CCAGCAGATT TAAACGGGCT TAAATCAAAG TATATTTCTA GGGGACGGAG AAATATTACT
AAATTGGGTT TGCAAAATAG TTCAGCAAAA CTTTTGCCCA AAGGAAGCGT ATTGTTTTCA
AGTAGAGCTC CTATAGGGTA TGTAGCAATT GCACAAAATG ATTTAGCAAC AAATCAAGGA
TTTAAATCAT GTGTTCCATA TATTATGGAT ATGAATCAAT ATATTTATTA TTTTTTGATG
TATGATGCCA AAAGAATAAA TGATAATGCT TCAGGGACTA CTTTTAAAGA AGTATCTGGG
AAAGAGGTCG CAAATTTTAT TTTCCCATTA CCACCTCTTA ATGAACAGAA ACGTATCGTA
AACAAATTGG ATGAGTTAAT GACCTTTTGT GACCAACTTG AAGTATCATT AGAAAAAAAG
GCTAACGCAA AACAATTAGT ATCAAAGAGT ATTTCAAATA GAATTCAAAA GAGTAAGAGT
AAAGAAGAAT TAGATAAGAA TATAACATTT ATAATTAGAA ACCTTAAAGA AGTATATACA
ACACCTGAAA ATTTAAATGA TTTAAAAGAT ATCATTCTCC AGTTGGCTAT TCAGGGTAAG
CTTGTGCCTC AGGATCCGGA TGATGAACCA GCATCAGTAT TAATTGAAAA AATAAATAAA
GAAAAAGAAA GATTAATTAA AGAAAAGAAA ATTAGAAAAA CCAAACCCTT ACCACCAATA
AAAGAAGCTG AAATTCCCTT TGAATTACCT AAAGGCTGGG AATGGGTGAG GTTGGGAGAA
ATAATGATAA TTAATCAACG AAATAAATTA AATGATAATT TAGAAGTGTC ATTTGTTCCA
ATGAAGCTGA TTGAAGATGG TTATTTAAGT AAACACAGTC ATAAAAAGAA GTTATGGAAA
GAAGTAAAAA AAGGTTATAC TCATTTTAAA GAAAATGATT TGGTAGTAGC TAAGATTACT
CCTTGCTTTG AAAATAGAAA ATCAGCAATC ATGAAAAATT TATATTCAGG TTATGGAGCT
GGGACAACTG AATTGCATGT TCTTACCAGC TACTTAAAAG AAATAGATAT GAAATTTTTC
CTCTACATTG TAAAAGCAAA GAATTTTATT AATCAAGGGG TTTCAACTTT TACTGGAACA
GCTGGACAGC AAAGAATTAG AAAAGATTTT ATAGAAAATT TTGTTATAGG TTTACCTCCG
TTAAATGAAC AGAAACAAAT TGTCAAAAAA ATAGACAAAT TAATGGCCTT ATGTAATTTA
CTAGAAAACC AAATTAATAA AAATAGAAAT AATAGTGAGT TATTAATGAA ATCTTTACAA
AGGAAATTAT TTGAGTGA
 
Protein sequence
MKKIIDNFQL IFDSREKIQE LRNLILNLAV RGKLVPQNSD DEPSSVLIEK IKKEKERLIK 
EKKIRKKKPL PPIKEEEIPF ELPESWEWVR LGNIGRIVGG GTPKTKVHAY WENGNIAWLT
PADLNGLKSK YISRGRRNIT KLGLQNSSAK LLPKGSVLFS SRAPIGYVAI AQNDLATNQG
FKSCVPYIMD MNQYIYYFLM YDAKRINDNA SGTTFKEVSG KEVANFIFPL PPLNEQKRIV
NKLDELMTFC DQLEVSLEKK ANAKQLVSKS ISNRIQKSKS KEELDKNITF IIRNLKEVYT
TPENLNDLKD IILQLAIQGK LVPQDPDDEP ASVLIEKINK EKERLIKEKK IRKTKPLPPI
KEAEIPFELP KGWEWVRLGE IMIINQRNKL NDNLEVSFVP MKLIEDGYLS KHSHKKKLWK
EVKKGYTHFK ENDLVVAKIT PCFENRKSAI MKNLYSGYGA GTTELHVLTS YLKEIDMKFF
LYIVKAKNFI NQGVSTFTGT AGQQRIRKDF IENFVIGLPP LNEQKQIVKK IDKLMALCNL
LENQINKNRN NSELLMKSLQ RKLFE