Gene BCAH820_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_0843 
SymbolhsdR 
ID7186955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp801775 
End bp804981 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content37% 
IMG OID643554255 
Producttype I restriction enzyme EcoKI subunit R 
Protein accessionYP_002449795 
Protein GI218901961 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value1.76009e-28 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAAAA GTAACTTTTC ATTTTTTCGA GAAAAATGGG ATGTGTTAGC TAATCTTGGT 
GAGACTGCTG AAAAGAATAT GTACTATGAT CCTCATACAA CATTGATGAA ACTTCGTTTG
TTCGGGGAAA CGTTGGCTAA GGTTATTTTA GCTATGGAAA ATATTAAAGA AGCATATAAT
ACGAGTCAGG TTGATCGTAT GCAAACATTA CGGCGTGAAG GTTTATTAGA AAAAGAGCTT
TACGATATGT TTGATGCACT TCGTAAAAAA GGAAACAACG CAGCGCATGA AGCTGGATAT
GGAACGGTTA AAGAAGCGCA AGCGTTATTG CTTATGGCAT TCCGTTTAGG TATTTGGTTT
ATGGAAGTGT ACGGTGATTG GGACTTTGAA GCACCAGAGT ATATTGAGCC TGAAAAAGAA
GAAAAAGTAG ACGTTTCCGT ATTGCAAAAG GAATATGATG AGAAGGTAAA ACAATTAGAA
ATAGAATTAG AAAAGGTTCG TAAAGAATCA CAGTATGATA CATCAGAGGA CAAGCAAAGA
CGTAGTCGTA TTTCAAAAAA ATATGTGAAT CGTTTACATT TATCCGAAGA AGAGACGAGA
ACAATTATTG ATGAAAAGTT ACGTCAAGCT GGTTGGGAAG CTGATAGTGT TAATCTCACT
TTCCAAAATG GCACACGTCC AGAAAAGGGA CGCAATATGG CGATCGCTGA GTGGACTGTT
AAAGGTGGTC GAGCAGATTA CGCCTTATTT ATTGGTAAAC AGCTTGTTGG TTTTATTGAG
GCTAAAGCAA AACATAAAAA AATTGCTTCT GTATTAGATA GCCAGACGAA GTTTTATGCC
CGTAATGTTT ATCAGCATGA AGATGAAACC TTAATGCCAA CAACGGGTGA GTACAAAGCT
CCATTTTTAT ACGCAACAAA TGGACGTCCG TATTTAAAAC AATTAAAAGA TGAATCTGGT
ATTTGGTTCT GGGATTCTCG TAAACCATTA GAGCATTCAC GCCCATTAGA AGGATGGCAT
TCTCCTATGG ACTTGCAGAT GCTGTTAGAA CAAGATGACC AGGATGCTGA TAAGAAACTA
GAAAAAGAAA GTATTGAAAA ATTTAGTTTA CGTCCTTATC AACAAAATGC CGTACTATCT
ATAGAAAGCG GCTTAAAGGA AGGTAAGCGG AGAATGTTAG TTGCGATGGC AACGGGAACG
GGAAAAACTC GTACAGCGAT CGCATTAATG TATCGTCTCA TAAAAGCGAA AAAGTGTCGC
CGAATTCTAT TTTTAGTTGA TCGTAAATCA TTGGGGACAC AAACAGAAGA CTCTTTGAAA
GATACAAAAT TTGATGGACT TGCATTTACG GATATATATG ATGTGAAAAC ATTAGAACAT
ATGTCGCCTG AGATAGAAAC GAAAGTACAT ATTGCAACCG TACAAGGTAT GGTAAAGCGT
CTGTTTTATA GTGACAATGA GAACTTACCA ACAGTTGGGC AATATGATTT TATTATTGTT
GATGAAGCAC ATCGTGGCTA TACAAGTGAT CGTGAGATGT CACAAGAGGA AATGGAATTC
CGTGATCAAA ATGATTATAT TAGTCAATAT CGCCGCGTGA TTGATTATTT TGACGCAGCA
TGCTTAGGAT TGACGGCAAC ACCAGCGCTT CATACGACAG ATATTTTCGG TATGCCGATT
TATAAGTATT CGTATAGTGA AGCAGTATTA GATGGAGCCT TAGTAGATCA TGAGCCACCG
TATGTGTTTA AAACAGAGTT AATGGAAGCA GGTATTAAGT TCGAAAAGGG TGACGAAGTA
CAAGTGTACG ATGTTGACCA ACAAGAACTA AAGTTAGAAG AGATGGAAGA TACAGTTCAA
TTTGAAGTAG AACAGTTCAA CAGAAAGGTA ATCACGGAGC CTTTTAATCG TGCTGTTTTA
AATAAGTTAA CGGATTATAT CGATCCAACT AGTAAAGAGA AAACGTTAAT TTTCGCAGTA
AATGATGCGC ATGCTGATAT GGTTGTACGT TTATTAAAAG AAGCTTATAA AGAGCGCGGT
GATGAAGTGG AAGATGACGC AATTATGAAA ATTACAGGTT ATATTCATAA ACCACTTGAT
GCGATTAAAC GCTTTAAGAA CGAGCGTTTA CCGAATGTCG TTGTTACAGT AGACTTGTTA
ACGACAGGTG TTGATGTACC AGCTATTACA AACTTAGTGT TTTTACGTCG CGTGCAATCG
CGTATTCTGT ATGATCAAAT GTTAGGACGA GCAACTCGTT TATGTACGGA TATCGGAAAA
ACACATTTCA ATATTTATGA TGCAGTTGGT ATTTATGATA ATTTAAAATC ATACACAGAC
ATGAAACCTG TTGTGAAACA ACAAAATTAT TCAATTGATG ATTTGTATCA GTCATTAGCA
AACGCTAATG ATGAAAAAGA AGCGGATTTT TATCGTGATC AACTTATTGC AAAAATACAA
CGCAAGAAAC AACGCCTGCC AGAAGAAGCG AAGCAGAAAT TTACAGAATT GACGAATGGG
AAAAGCATTG ATGATTGGGC GCATGAATTA CAATCTGTAA CACCAGAAGC AGCAAGAGAA
CAAGAACTTT TATTTGAATA TGTTTCACAA TATCGTACCC AAGGTGAAAA AATATATGTT
TCTAACCACG ATGACCGTGT AACGAAAGTA GAGCGTGGAT ACGGGGAAGG AAATAATCGT
CCTGAAGATT ATTTAGAAGG ATTTGAACAC TTTGTCAAAG AAAATATCAA CTTGATTCCT
GCGTTGCAAA TTGTGTGTAC ACGTCCAAAA GAATTAACAC GTCAAGATTT ACGGGAGTTA
ATTACGATCC TCGAAACGAA GGGTTTTAAG CAATCTCACT TGCAAACAGC TTGGAAACAA
ACAAAAAATG AAGATATTGC AGCGGACATT ATTACATTTA TCCGTCAGGC AGCTCTTGGA
GATGCATTAG TGGATCATGA GGCACGCATT AAACGTGCGA TGCAGAAGGT ATACAGCTTA
CATGATTGGA CACCACGCCA ACAAAAATGG TTAGAGCGAA TTGAAAAACA GTTACTACAA
GTACCCGTGC TAGCGCCAAC TCCAGAGGAT GCATTTTCAG AAGAGCCATT TCGTAGTCGA
GGTGGTTACA ATATGTTAAA GCGTGAATTT GGAGAAGAGA TTGACAAGAT TGTGTATACT
ATAAATGATT ACTTATATAT TAGTTAA
 
Protein sequence
MSKSNFSFFR EKWDVLANLG ETAEKNMYYD PHTTLMKLRL FGETLAKVIL AMENIKEAYN 
TSQVDRMQTL RREGLLEKEL YDMFDALRKK GNNAAHEAGY GTVKEAQALL LMAFRLGIWF
MEVYGDWDFE APEYIEPEKE EKVDVSVLQK EYDEKVKQLE IELEKVRKES QYDTSEDKQR
RSRISKKYVN RLHLSEEETR TIIDEKLRQA GWEADSVNLT FQNGTRPEKG RNMAIAEWTV
KGGRADYALF IGKQLVGFIE AKAKHKKIAS VLDSQTKFYA RNVYQHEDET LMPTTGEYKA
PFLYATNGRP YLKQLKDESG IWFWDSRKPL EHSRPLEGWH SPMDLQMLLE QDDQDADKKL
EKESIEKFSL RPYQQNAVLS IESGLKEGKR RMLVAMATGT GKTRTAIALM YRLIKAKKCR
RILFLVDRKS LGTQTEDSLK DTKFDGLAFT DIYDVKTLEH MSPEIETKVH IATVQGMVKR
LFYSDNENLP TVGQYDFIIV DEAHRGYTSD REMSQEEMEF RDQNDYISQY RRVIDYFDAA
CLGLTATPAL HTTDIFGMPI YKYSYSEAVL DGALVDHEPP YVFKTELMEA GIKFEKGDEV
QVYDVDQQEL KLEEMEDTVQ FEVEQFNRKV ITEPFNRAVL NKLTDYIDPT SKEKTLIFAV
NDAHADMVVR LLKEAYKERG DEVEDDAIMK ITGYIHKPLD AIKRFKNERL PNVVVTVDLL
TTGVDVPAIT NLVFLRRVQS RILYDQMLGR ATRLCTDIGK THFNIYDAVG IYDNLKSYTD
MKPVVKQQNY SIDDLYQSLA NANDEKEADF YRDQLIAKIQ RKKQRLPEEA KQKFTELTNG
KSIDDWAHEL QSVTPEAARE QELLFEYVSQ YRTQGEKIYV SNHDDRVTKV ERGYGEGNNR
PEDYLEGFEH FVKENINLIP ALQIVCTRPK ELTRQDLREL ITILETKGFK QSHLQTAWKQ
TKNEDIAADI ITFIRQAALG DALVDHEARI KRAMQKVYSL HDWTPRQQKW LERIEKQLLQ
VPVLAPTPED AFSEEPFRSR GGYNMLKREF GEEIDKIVYT INDYLYIS