Gene GWCH70_1642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1642 
Symbol 
ID7976359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1718140 
End bp1721151 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content40% 
IMG OID644798523 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002949695 
Protein GI239827071 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTACAG AAGAGCAATT AGAAAATGTC GTGATTGAGT ATTTTCAAGA GCTTGGATAT 
AACTATCTAC CCGCAAGTGA GTTAAAGCGG GATGAGAAAG AGGTTTTGCT GTTTGACCGT
TTGGAAGCAG CGTTGGTGAG ACTGAATCCG AGTTTGTCTT TGGATGTCAT CCGTGAAGCA
ATTCGGAAAA TCCGTCATTT TGAAACAAAC GATGTGTTTA CGAACAATAA AGTGTTTCAT
AAGTATTTGA CGGAAACGGT GGAAGTGGCG GAGTTTGTGA ATGGGGAAAC GGTTTATCAC
CGCGTTCGGC TCATCGATTG GGAAGTACCT GAAAATAATG ATTTTCTTGT TGTCAATCAA
TTAGAGGTCG TTGAAAAAGG CCAGAAGAAA ATCCCTGACA TTGTGCTCTA TGTAAACGGA
ATCCCGCTTG TCGTGTTCGA GTTAAAAAGT ACGTCACGTG AAGAAGTCGA TATTGAGGAT
GCGTACAAAC AATTGAAAAA TTATATGAAC GTCCACATTC CTTCCTTATT CTATTACAAT
GCGTTTCTTG TGATTAGTGA TGGGGTGAAA GCCCGAGCTG GAACAATCAC AGCGCCGCTT
GATCGTTTTT TGGCATGGAA AAAGATTCAT ATCGAAGACG AGGTTGTTGA AAATCGTGAA
TTAGAAACAT TGATGTACGG ATTATTCAAT CAAAAACGCT TTTTGGATGT CATAAAAAAC
TTCACGTTGT TTACGAATGA AGCAAAAATT ATGGCTGCGT ATCACCAATA TTATGGAATG
AAAAAAGCTA TCGAGTCTAC AATACGGGCA GTTGGGAAAG ATGGACGCGC CGGGGTTATT
TGGCATACGC AAGGAAGCGG CAAAAGTTAT TCCATGGTAT TCCTTGCTGG AAACTTAGTG
AAACGGGAAG AATTGAAAAA CCCAACGATT GTCGTCATTA CCGACCGGAA TGATTTAGAC
GGACAGCTAT TCGAAACGTT CTCCGGAGCA AGTGAATTTT TGCGGCAAAC ACCACAACAG
GCGGAAACGC GGAGTCATAT AAAAGAGCTA TTGGAAAATC GCCAAACCGG TGGAATTATT
TTTTCAACGA TTCAAAAATT TGAAGAAGAA ACCGGCTTGC TTTCTGATCG GGAAAATATC
ATTGTCATGG TCGATGAAGC TCACCGCTCG CAATACGGTG TCGATCCGAA ATATGATATT
GTGACCGGTG AACAAAAGTA CGGCTATGCG AAATATTTGC GGGAAGCGTT GCCGAATGCG
ACGTATATTG CGTTTACTGG CACACCGATT GAAACAACGG ATAAATCGAC GACTGGATTG
TTCGGTGATG TCATTGATGT GTATGATATG ACACAAGCGG TTCAAGACGG GGCAACCGTG
AAAATTTATT ATGAATCCCG CTTGGCGAAA GTAAAACTAG ACGAGAAAAA AATGAATGAA
ATTGATCAAG AATATTGGAA TATGCAAGTC AACGAAGGTG TTGACGACTA TATCGTCGAA
CAAAGCCAGA AAAGCTTAAG CCGCATGGAG CAAATTATCG GCGATAAAGA CCGAATTAGA
GAAGTCGTAG CCGACATTAT TAGTCATTAC GAGGAGCGCG AAAATCTTGT TGCTGGGAAA
GCGATGATTG TTGCCTATTC GCGAAAAACA GCGTTCGCAA TGTATAAAGA AATCATGAGA
CAACGCCCCG ATTGGAAGGA AAAAGTGAAA ATTGTCATGA CGGAAAACAA CCAAGATCCG
GAAGAATTAG CGAAGCTTGT TGGGAATAAA CAAACGCGGA AGCAGCGGGA GAAAGAATTT
AAAGATGTCA ATCATCCGTT CAAAATCGTG ATTGTCGTTG ATATGTGGCT AACTGGTTTC
GACGTTCCAG CGCTTGATAC GATGTATATC GATAAGCCGA TGAAGGCGCA TAACTTGATG
CAAGCCATCG CTCGCGTCAA TCGTGTCTAT CCGGGCAAGA CGGGCGGATT GATTGTCGAC
TATATCGGTT TAAAGAGAGA CTTAATGGAA GCACTCAAAA CGTATACAAA GCGTGACCAA
GATAAAGTGC AGGAAAACGA GCAAGCTCGC GATATCGCGC TAAATATTCT TGAAGTTCTT
CGCAATATGT TTCACGAGTT TGACTACAGT GCGTTTTTCG GGGACAGCGA CAAAAAACGT
TATGAAGTCA TCCGGGATGG TGCGGAATTT GTTCAACAAA CGGAAAAAAG AAAATCACTG
TTTATGACAG AAACGAAGAA GTTAAAGGAT GTTTATAAAA TTTGTACCGG TTTGCTTTCC
AAAGAGCAAA AAGAGGAAAT TTCCTACTTT ATTGCCGTTC GTTCTTTTAT CATGAAATCT
TCGCGAAAAG GAGCACCGGA TTTAAAAGAA GTAAATGAAC GAATCTCAAA AATGTTAGAA
GAGGCCATTT TAGAAGATGA AGTGATGGTA TTAACGCAGG CTTCTTCATC CGAGAGTTTT
GACTTGTTAA ACGAAGAGAA TATAAAGAAA CTTCGCGCGC TGCCGCAAAA GAATATCGCC
GCTAATATTC TCATGCGCGT ACTAAAGGAA AAACTGCAAG ATGTGAAAAA GAAAAACATG
ACCGTCAGCC AAACGTTTTC CAAACGCTTT GAAAAAATAT TAGAAAAATA CAACAACCGT
AACGATTATA CGGATGTATA CGAAGTATTT GAAGAACTCA TTAAATTTAA AGAAGAATTG
GAAGCAGCGA TTCAAGCAGG AAAACAACTT GGTTTAACGG ATGAGGAAAA AGCATTTTTT
GATGTGTTAG GCTCAGATCC GGATATAAAA AAATTAATGG AAGATGAAAT TTTAATCAAA
ATCGCGAAAG AGCTGGCGAA AACAGTGAAA GAAAATCGCA CGCACGATTG GGATAAAAAA
GAACAAGCCC AAGCACGCAT GCGCCTGCAG ATAAAAAAAG TTCTACGTAA ATATGATTAT
CCTCCAAATA AACAGCCAAA AGCAGTGGAA GATGTATTAA TGCAAGCGAA GTTACAGTGT
CAGAATATGT GA
 
Protein sequence
MFTEEQLENV VIEYFQELGY NYLPASELKR DEKEVLLFDR LEAALVRLNP SLSLDVIREA 
IRKIRHFETN DVFTNNKVFH KYLTETVEVA EFVNGETVYH RVRLIDWEVP ENNDFLVVNQ
LEVVEKGQKK IPDIVLYVNG IPLVVFELKS TSREEVDIED AYKQLKNYMN VHIPSLFYYN
AFLVISDGVK ARAGTITAPL DRFLAWKKIH IEDEVVENRE LETLMYGLFN QKRFLDVIKN
FTLFTNEAKI MAAYHQYYGM KKAIESTIRA VGKDGRAGVI WHTQGSGKSY SMVFLAGNLV
KREELKNPTI VVITDRNDLD GQLFETFSGA SEFLRQTPQQ AETRSHIKEL LENRQTGGII
FSTIQKFEEE TGLLSDRENI IVMVDEAHRS QYGVDPKYDI VTGEQKYGYA KYLREALPNA
TYIAFTGTPI ETTDKSTTGL FGDVIDVYDM TQAVQDGATV KIYYESRLAK VKLDEKKMNE
IDQEYWNMQV NEGVDDYIVE QSQKSLSRME QIIGDKDRIR EVVADIISHY EERENLVAGK
AMIVAYSRKT AFAMYKEIMR QRPDWKEKVK IVMTENNQDP EELAKLVGNK QTRKQREKEF
KDVNHPFKIV IVVDMWLTGF DVPALDTMYI DKPMKAHNLM QAIARVNRVY PGKTGGLIVD
YIGLKRDLME ALKTYTKRDQ DKVQENEQAR DIALNILEVL RNMFHEFDYS AFFGDSDKKR
YEVIRDGAEF VQQTEKRKSL FMTETKKLKD VYKICTGLLS KEQKEEISYF IAVRSFIMKS
SRKGAPDLKE VNERISKMLE EAILEDEVMV LTQASSSESF DLLNEENIKK LRALPQKNIA
ANILMRVLKE KLQDVKKKNM TVSQTFSKRF EKILEKYNNR NDYTDVYEVF EELIKFKEEL
EAAIQAGKQL GLTDEEKAFF DVLGSDPDIK KLMEDEILIK IAKELAKTVK ENRTHDWDKK
EQAQARMRLQ IKKVLRKYDY PPNKQPKAVE DVLMQAKLQC QNM