Gene Hore_20230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20230 
Symbol 
ID7314347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2181897 
End bp2184209 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content35% 
IMG OID643612467 
ProductEcoEI R domain protein 
Protein accessionYP_002509763 
Protein GI220932855 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.601124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTCTTA CAGAAACTGA AGTTTGTATG CGGTATATTA CACCTGCTAT TCAGAATGCA 
GGCTGGGATA TAAATAAACA GGTGCTTAGA GAGTATAGCT TTACTGATGG ACGGGTGATT
GTTCGTGGTA AGCTTGTTAA TAGGGCTAAG CCCAAAAGAG CAGACTATAT TCTTCAATAT
AAAAGTAATA TACCCCTTGC AATTATTGAA GCTAAAAAAG ACACCCTTCC TATAGGTGCA
GGAATGCAAC AGGCGTTGGA GTATGCTGAA ATTTTAGATA TTCCCTTTGT TTATTCTAGC
AATGGCAAAG GTTTCATTGA GCATGACCGC AATAGTGGTA GAGAAAAAGA AATTGCTCTG
GAAGATTTTC CTTCTCCAGA GGAATTGTGG GGTCGTTTTA GTAAGCAGAA AGAATTTACC
AGAGAACAGG AGCAGTTGTA CTTACAAGAT TATTTTTATC AAATAAATTA TAAAACCCCT
AGATACTACC AGAGGGTGGC CATAAATAGG GCAGTTGAGG CAATTGCCAA AGGGCAAAAA
CGTATTCTGC TTGTTATGGC TACTGGAACC GGTAAGACCT TTACAGCCTT TCAAATTATC
CATAGATTGT GGAAAGCGGG TAAAAAGAAA AGAATCCTCT TTCTTGCAGA CCGTAACATT
TTAGTTGACC AGACAATAAC GGGTGACTTT AGTCCTTTTG GTGACAAGAT GATTAAGATT
CAAAGAGGAA ATGTTTCTCA TGCCCATGAG ATTTATCTTG CCCTATATCA GTCAATGACG
GGTCCTGAAG AATGGCAAAA GACCTTTGAG GAATACAGTC CTGATTTTTT TGATCTTGTT
GTAATAGATG AGTGTCATCG AGGAAGTGCC AGGGCAAATA GTGCCTGGAG AGTGGTTTTA
GAGTATTTTG ATCAGGCAAC ACATCTTGGA TTAACTGCTA CTCCAAAAGA GGATGCTGAT
GTATCAACTC AAGGTTATTT TGGAGAACCA ATTTATACAT ATTCGTTGAA GCAGGGGATT
GAGGATGGCT TTTTAGCACC ATATAAGGTA GTAAGAATGG GACTAGATAA GGATTTAATG
GGATTTAGAC CAATTAAAGG TCAGACTGAC AAGTATGGGC GAGTTATGGA AGATAGAGAG
TATTATGTTA ATGATTTTGA TAGAGAATTA ATTTTAGAAA AAAGACATGA ATTAGTTGCA
AAAGAAATTA CAAGGTATTT AAAAGAGGAG TTGCAAGACA GATTTGCTAA AACTATTGTG
TTTTGCCAGG ATATTGAACA TGCTGAGAAT ATGAGACAGG CATTAGTGAA TGAAAATGCA
GATCTTGTAA AGCAAAATTA TAAATATGTA ATGCGTATTA CCGGGGATAA TCCAGAGGCT
AAAATTCAGC TGGATAATTT TATTGAACCT TCTGAGACTT ATCCAGTTAT TGCGACTACA
TCTAAATTAA TGACAACTGG CGTTGATGCT CAAACCTGTA AGCTTATTGT TCTTGATATG
ACGATCAATT CTATGTCTGA GTTTAAACAA ATAATAGGGC GGGGGACACG TTTAAGGCCT
GATTATGGTA AATATTATTT TACTATTTTA GATTTTAGGG GAGCTACAAG GTTATTTGCT
GACCCTGACT TTGATGGATA TCCAATACCT GACGATGAAG ATGGTGATAA CTACCCAGGA
AATGATGTTA ATGGTGGTAA TAGAATACCT GGAAATGATG AAGGTGGGGA TGAAGAAGGA
AGAGTAAAAT ACTATGTGGA TGATGTCGAG GTTAATTTAA TTAATAAAAG GGTTCAGTAT
CTTGATGCTA ATGGTAAACT TGTTTCTGAA TCCTATGTAG ACTATACAAA GAAAAATATT
AGAAAACAAT ATGCTACACT TGACGATTTT ATAAGGAGAT GGACTGAAGA AGAAAAAAAA
CAGGTTATAT ATGATGAGTT ACTGGAACAG GGAATTATAC TTGATGAACT TAGAAAAGAG
GTAGGCAAAG AGGATATAGA TGATTTTGAT TTAATTTGTC ATATAGTTTT TGATGCCAAG
CCTTTAACAA AGTCAGAAAG AATAAATAAT GTTAAAAAGC GAAATTATTT CACTAAATAT
GGAGAGCAAG CAAGAAAAGT CCTTGAGATT CTTCTTGATA AGTATAAAAA CAGCAACATT
ACAGAAATAG AAGATATAAA AATCCTTAAA CTTGATGAAT TTAAGCAAAT AGGTATGCCA
GGAAGAATAT TTAAACTCTT TGGCGGAAAG AAAAAATATT TAGAAGCGAT TAAGGAATTA
GAAGCAGAAA TATATAAAGG AGAGGTAAGT TGA
 
Protein sequence
MALTETEVCM RYITPAIQNA GWDINKQVLR EYSFTDGRVI VRGKLVNRAK PKRADYILQY 
KSNIPLAIIE AKKDTLPIGA GMQQALEYAE ILDIPFVYSS NGKGFIEHDR NSGREKEIAL
EDFPSPEELW GRFSKQKEFT REQEQLYLQD YFYQINYKTP RYYQRVAINR AVEAIAKGQK
RILLVMATGT GKTFTAFQII HRLWKAGKKK RILFLADRNI LVDQTITGDF SPFGDKMIKI
QRGNVSHAHE IYLALYQSMT GPEEWQKTFE EYSPDFFDLV VIDECHRGSA RANSAWRVVL
EYFDQATHLG LTATPKEDAD VSTQGYFGEP IYTYSLKQGI EDGFLAPYKV VRMGLDKDLM
GFRPIKGQTD KYGRVMEDRE YYVNDFDREL ILEKRHELVA KEITRYLKEE LQDRFAKTIV
FCQDIEHAEN MRQALVNENA DLVKQNYKYV MRITGDNPEA KIQLDNFIEP SETYPVIATT
SKLMTTGVDA QTCKLIVLDM TINSMSEFKQ IIGRGTRLRP DYGKYYFTIL DFRGATRLFA
DPDFDGYPIP DDEDGDNYPG NDVNGGNRIP GNDEGGDEEG RVKYYVDDVE VNLINKRVQY
LDANGKLVSE SYVDYTKKNI RKQYATLDDF IRRWTEEEKK QVIYDELLEQ GIILDELRKE
VGKEDIDDFD LICHIVFDAK PLTKSERINN VKKRNYFTKY GEQARKVLEI LLDKYKNSNI
TEIEDIKILK LDEFKQIGMP GRIFKLFGGK KKYLEAIKEL EAEIYKGEVS