Gene Hore_05110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_05110 
Symbol 
ID7314490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp557074 
End bp558234 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content34% 
IMG OID643610934 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_002508264 
Protein GI220931356 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGACA GTAGTACCTG TAATGACAAT AATATTAACA ATTATAAAAT AATCAGGTTA 
AAACATAAAT CTAATTTGCA CTCTTCAGAA ATTAAAGCCA TTGACTATAT AAATAAATTC
TTTAACCAGG ATAACTTCTA CATTGAGGTA AATCCCGGCT CCGTAAAGAT TTATTTAAAA
AAACTCGACA TGAAAAAGCT GGGTCGGTTA ATTAAAGAAA TAAACGATAC CAGTCCTTTT
CTTCTCAAAA TATACCGTAC TATAAAAAAT ATAAAAAGCT ATGTCATTGA AAATAACAAA
AGGGTTTATG TAGACTATAA CAGAGAAAAA AAAGTTAAAA ACAGAAAGAA AAAGGAAAGA
AACAGGCAAA AATATTTTTA TGCCAGAAAT CATAATTTTA ATGTCATTAA TCAACCATTA
CCTGAAAAAT ATATAAATAA AATTATCTGT GGTGACAGTG AACAAATACT TAAAGAAATA
CCTGATAACA GTATAGATCT CATTCTTACT TCCCCGCCAT ATAACTTTGG ACTTGATTAC
AAAGATTCAC GGGATGGCTA TTACTGGAAA AGTTATTTTA GTAAGTTGTT TTCCATTTTT
AAGGAATGTA TCAGAATTCT CAAATATGGC GGCCGGATAA TCATCAACGT CCAGCCCCTC
TTTTCAGATT ATATCCCCAC CCACCACCTG ATCAGCAACT TTTTTATAAA AAATAAGATG
ATCTGGAAGG GAGAAATCCT CTGGGAAAAA AATAACTACA ACTGCAAATA TACAGCCTGG
GGTAGCTGGA AAAGCCCCTC AAGTCCTTAT TTAAAATACA CCTGGGAGTT TTTAGAGATC
TTTGCAAAGG GTAGTTTAAA GAAAAAAGGA GATAAAAAAA ATATTGATAT TACAGGAGAG
GAATTTAAAG AATGGGTTTC GGCCAGGTGG TCGATTGCCC CGGTCCGTAA TATGAAAAAA
TACCAGCACC CGGCAGTATT CCCCGAGGAA CTGGTTTATA GAGTCCTGAA GTTATTCAGT
TATAAGGGTG ACGTTATCCT CGATCCCTTT AACGGAACAG GAACCACTAC AGCAGTCGCC
CACAGACTTA AAAGGAATTA TCTGGGGATT GATATCTCAC CTGATTACTG TAATACAGCC
CGTGGCCGTC TTAATCCATA G
 
Protein sequence
MNDSSTCNDN NINNYKIIRL KHKSNLHSSE IKAIDYINKF FNQDNFYIEV NPGSVKIYLK 
KLDMKKLGRL IKEINDTSPF LLKIYRTIKN IKSYVIENNK RVYVDYNREK KVKNRKKKER
NRQKYFYARN HNFNVINQPL PEKYINKIIC GDSEQILKEI PDNSIDLILT SPPYNFGLDY
KDSRDGYYWK SYFSKLFSIF KECIRILKYG GRIIINVQPL FSDYIPTHHL ISNFFIKNKM
IWKGEILWEK NNYNCKYTAW GSWKSPSSPY LKYTWEFLEI FAKGSLKKKG DKKNIDITGE
EFKEWVSARW SIAPVRNMKK YQHPAVFPEE LVYRVLKLFS YKGDVILDPF NGTGTTTAVA
HRLKRNYLGI DISPDYCNTA RGRLNP