Gene Hore_04450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04450 
Symbol 
ID7314424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp478520 
End bp479788 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content30% 
IMG OID643610868 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002508198 
Protein GI220931290 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAGG ACGGGTATAA GGAAGTAAGA ATAGGACCCA AAAAATATCA TATTCCAAAA 
GAATGGGAGT TTAGAAATTT TGGTTTGATT TCCAAATATA TTAAAGCAGG TGGCACACCC
AAAGCAGATA AAAAAGAGTA CTATGGTGGG GAAATATTAT TTGTAAAAAT TGAAGATATG
ACAAAAAATG GTAAATACAT CTATAATACA AAAAGCACAA TTACAGAAGA TGGTTTAAAA
AACTCTTCTG CTTGGATAGT ACCAAAAAAA TCTTTGTTGT TATCAATGTA TGGGAGTTAT
GGAAAAGTGT CTATTAATAA AGTTGAATTA GCTACTAATC AAGCAATATT AGGGATTATT
CCATCTGAAG AAGTAAATTT GGATTATCTA TATTATTTTT CTTTAGGTTG CTTAAAACCT
TATTTTAAAT CATTAGTTAA AGCTACAACT CAAGCCAATT TAACAAAACA AATTGTTAAT
AATACTCCTG TATTATCTCC ACCTCTCCCA GAACAGAAAA AAATAGCTGC AATTTTGTCC
ACCGTAGATA AAGCAATCGA AAAAACAGAT GAAATAATTG AAAAAAGCAA GGAATTGAAA
AAGGGATTAA TGCAACAATT GCTGACAAAA GGGATTGGGC ATAGTGAGTT TAAGGAAGTA
AGGATAGGAA CAAAGAAAAT AAAGATTCCT GTAGTATGGA CTTTAATTAA ATTTGGAGAA
GTATTTAAAA AAAGAAATGA GAAGGCAAAT GTAGAAAAAG AATATAAATA TGTGGGTTTA
GAACATTTAG GAACAGGCGA AATCAATTTA CTTGGCTATG ATAGGAATGG TAATAATAAA
AGTAGTAAAA GGTTATTTAA GTCAGGAGAT ATTCTTTATG GAAAACTTCG TCCTTATTTA
AAAAAAGCTG CCATTACAGA TTTTGATGGC ATTTGTTCTA CGGACATAAT TCCAATATAT
GCAACTAAAA AATCTGTTAA TAATTATTTA ATTTATTTAG TTCATTCTAA AATGTTTGTT
GATTTTGCAG TTTCTACTAT GGAAGGGACC AATTTACCAA GAACATCTTG GCGAGTAATA
AAGAATTTAA TTATACCTTT ACCACCACTC CAAGAACAAA AGAAAATAGC GTCTATCCTA
TCATCAGTAG ATGAAAAAAT TCAGAAAGAG CAGGAATACA GAGAAAAGTT GGAGGAGTTA
AAGAAGGGCT TAATGCAGAA GTTGTTGACA GGTGAAGTAA GGGTTAAGGT AGAAGATGAG
GAGGTGTAG
 
Protein sequence
MIKDGYKEVR IGPKKYHIPK EWEFRNFGLI SKYIKAGGTP KADKKEYYGG EILFVKIEDM 
TKNGKYIYNT KSTITEDGLK NSSAWIVPKK SLLLSMYGSY GKVSINKVEL ATNQAILGII
PSEEVNLDYL YYFSLGCLKP YFKSLVKATT QANLTKQIVN NTPVLSPPLP EQKKIAAILS
TVDKAIEKTD EIIEKSKELK KGLMQQLLTK GIGHSEFKEV RIGTKKIKIP VVWTLIKFGE
VFKKRNEKAN VEKEYKYVGL EHLGTGEINL LGYDRNGNNK SSKRLFKSGD ILYGKLRPYL
KKAAITDFDG ICSTDIIPIY ATKKSVNNYL IYLVHSKMFV DFAVSTMEGT NLPRTSWRVI
KNLIIPLPPL QEQKKIASIL SSVDEKIQKE QEYREKLEEL KKGLMQKLLT GEVRVKVEDE
EV