Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1642 |
Symbol | |
ID | 7976359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1718140 |
End bp | 1721151 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644798523 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002949695 |
Protein GI | 239827071 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTACAG AAGAGCAATT AGAAAATGTC GTGATTGAGT ATTTTCAAGA GCTTGGATAT AACTATCTAC CCGCAAGTGA GTTAAAGCGG GATGAGAAAG AGGTTTTGCT GTTTGACCGT TTGGAAGCAG CGTTGGTGAG ACTGAATCCG AGTTTGTCTT TGGATGTCAT CCGTGAAGCA ATTCGGAAAA TCCGTCATTT TGAAACAAAC GATGTGTTTA CGAACAATAA AGTGTTTCAT AAGTATTTGA CGGAAACGGT GGAAGTGGCG GAGTTTGTGA ATGGGGAAAC GGTTTATCAC CGCGTTCGGC TCATCGATTG GGAAGTACCT GAAAATAATG ATTTTCTTGT TGTCAATCAA TTAGAGGTCG TTGAAAAAGG CCAGAAGAAA ATCCCTGACA TTGTGCTCTA TGTAAACGGA ATCCCGCTTG TCGTGTTCGA GTTAAAAAGT ACGTCACGTG AAGAAGTCGA TATTGAGGAT GCGTACAAAC AATTGAAAAA TTATATGAAC GTCCACATTC CTTCCTTATT CTATTACAAT GCGTTTCTTG TGATTAGTGA TGGGGTGAAA GCCCGAGCTG GAACAATCAC AGCGCCGCTT GATCGTTTTT TGGCATGGAA AAAGATTCAT ATCGAAGACG AGGTTGTTGA AAATCGTGAA TTAGAAACAT TGATGTACGG ATTATTCAAT CAAAAACGCT TTTTGGATGT CATAAAAAAC TTCACGTTGT TTACGAATGA AGCAAAAATT ATGGCTGCGT ATCACCAATA TTATGGAATG AAAAAAGCTA TCGAGTCTAC AATACGGGCA GTTGGGAAAG ATGGACGCGC CGGGGTTATT TGGCATACGC AAGGAAGCGG CAAAAGTTAT TCCATGGTAT TCCTTGCTGG AAACTTAGTG AAACGGGAAG AATTGAAAAA CCCAACGATT GTCGTCATTA CCGACCGGAA TGATTTAGAC GGACAGCTAT TCGAAACGTT CTCCGGAGCA AGTGAATTTT TGCGGCAAAC ACCACAACAG GCGGAAACGC GGAGTCATAT AAAAGAGCTA TTGGAAAATC GCCAAACCGG TGGAATTATT TTTTCAACGA TTCAAAAATT TGAAGAAGAA ACCGGCTTGC TTTCTGATCG GGAAAATATC ATTGTCATGG TCGATGAAGC TCACCGCTCG CAATACGGTG TCGATCCGAA ATATGATATT GTGACCGGTG AACAAAAGTA CGGCTATGCG AAATATTTGC GGGAAGCGTT GCCGAATGCG ACGTATATTG CGTTTACTGG CACACCGATT GAAACAACGG ATAAATCGAC GACTGGATTG TTCGGTGATG TCATTGATGT GTATGATATG ACACAAGCGG TTCAAGACGG GGCAACCGTG AAAATTTATT ATGAATCCCG CTTGGCGAAA GTAAAACTAG ACGAGAAAAA AATGAATGAA ATTGATCAAG AATATTGGAA TATGCAAGTC AACGAAGGTG TTGACGACTA TATCGTCGAA CAAAGCCAGA AAAGCTTAAG CCGCATGGAG CAAATTATCG GCGATAAAGA CCGAATTAGA GAAGTCGTAG CCGACATTAT TAGTCATTAC GAGGAGCGCG AAAATCTTGT TGCTGGGAAA GCGATGATTG TTGCCTATTC GCGAAAAACA GCGTTCGCAA TGTATAAAGA AATCATGAGA CAACGCCCCG ATTGGAAGGA AAAAGTGAAA ATTGTCATGA CGGAAAACAA CCAAGATCCG GAAGAATTAG CGAAGCTTGT TGGGAATAAA CAAACGCGGA AGCAGCGGGA GAAAGAATTT AAAGATGTCA ATCATCCGTT CAAAATCGTG ATTGTCGTTG ATATGTGGCT AACTGGTTTC GACGTTCCAG CGCTTGATAC GATGTATATC GATAAGCCGA TGAAGGCGCA TAACTTGATG CAAGCCATCG CTCGCGTCAA TCGTGTCTAT CCGGGCAAGA CGGGCGGATT GATTGTCGAC TATATCGGTT TAAAGAGAGA CTTAATGGAA GCACTCAAAA CGTATACAAA GCGTGACCAA GATAAAGTGC AGGAAAACGA GCAAGCTCGC GATATCGCGC TAAATATTCT TGAAGTTCTT CGCAATATGT TTCACGAGTT TGACTACAGT GCGTTTTTCG GGGACAGCGA CAAAAAACGT TATGAAGTCA TCCGGGATGG TGCGGAATTT GTTCAACAAA CGGAAAAAAG AAAATCACTG TTTATGACAG AAACGAAGAA GTTAAAGGAT GTTTATAAAA TTTGTACCGG TTTGCTTTCC AAAGAGCAAA AAGAGGAAAT TTCCTACTTT ATTGCCGTTC GTTCTTTTAT CATGAAATCT TCGCGAAAAG GAGCACCGGA TTTAAAAGAA GTAAATGAAC GAATCTCAAA AATGTTAGAA GAGGCCATTT TAGAAGATGA AGTGATGGTA TTAACGCAGG CTTCTTCATC CGAGAGTTTT GACTTGTTAA ACGAAGAGAA TATAAAGAAA CTTCGCGCGC TGCCGCAAAA GAATATCGCC GCTAATATTC TCATGCGCGT ACTAAAGGAA AAACTGCAAG ATGTGAAAAA GAAAAACATG ACCGTCAGCC AAACGTTTTC CAAACGCTTT GAAAAAATAT TAGAAAAATA CAACAACCGT AACGATTATA CGGATGTATA CGAAGTATTT GAAGAACTCA TTAAATTTAA AGAAGAATTG GAAGCAGCGA TTCAAGCAGG AAAACAACTT GGTTTAACGG ATGAGGAAAA AGCATTTTTT GATGTGTTAG GCTCAGATCC GGATATAAAA AAATTAATGG AAGATGAAAT TTTAATCAAA ATCGCGAAAG AGCTGGCGAA AACAGTGAAA GAAAATCGCA CGCACGATTG GGATAAAAAA GAACAAGCCC AAGCACGCAT GCGCCTGCAG ATAAAAAAAG TTCTACGTAA ATATGATTAT CCTCCAAATA AACAGCCAAA AGCAGTGGAA GATGTATTAA TGCAAGCGAA GTTACAGTGT CAGAATATGT GA
|
Protein sequence | MFTEEQLENV VIEYFQELGY NYLPASELKR DEKEVLLFDR LEAALVRLNP SLSLDVIREA IRKIRHFETN DVFTNNKVFH KYLTETVEVA EFVNGETVYH RVRLIDWEVP ENNDFLVVNQ LEVVEKGQKK IPDIVLYVNG IPLVVFELKS TSREEVDIED AYKQLKNYMN VHIPSLFYYN AFLVISDGVK ARAGTITAPL DRFLAWKKIH IEDEVVENRE LETLMYGLFN QKRFLDVIKN FTLFTNEAKI MAAYHQYYGM KKAIESTIRA VGKDGRAGVI WHTQGSGKSY SMVFLAGNLV KREELKNPTI VVITDRNDLD GQLFETFSGA SEFLRQTPQQ AETRSHIKEL LENRQTGGII FSTIQKFEEE TGLLSDRENI IVMVDEAHRS QYGVDPKYDI VTGEQKYGYA KYLREALPNA TYIAFTGTPI ETTDKSTTGL FGDVIDVYDM TQAVQDGATV KIYYESRLAK VKLDEKKMNE IDQEYWNMQV NEGVDDYIVE QSQKSLSRME QIIGDKDRIR EVVADIISHY EERENLVAGK AMIVAYSRKT AFAMYKEIMR QRPDWKEKVK IVMTENNQDP EELAKLVGNK QTRKQREKEF KDVNHPFKIV IVVDMWLTGF DVPALDTMYI DKPMKAHNLM QAIARVNRVY PGKTGGLIVD YIGLKRDLME ALKTYTKRDQ DKVQENEQAR DIALNILEVL RNMFHEFDYS AFFGDSDKKR YEVIRDGAEF VQQTEKRKSL FMTETKKLKD VYKICTGLLS KEQKEEISYF IAVRSFIMKS SRKGAPDLKE VNERISKMLE EAILEDEVMV LTQASSSESF DLLNEENIKK LRALPQKNIA ANILMRVLKE KLQDVKKKNM TVSQTFSKRF EKILEKYNNR NDYTDVYEVF EELIKFKEEL EAAIQAGKQL GLTDEEKAFF DVLGSDPDIK KLMEDEILIK IAKELAKTVK ENRTHDWDKK EQAQARMRLQ IKKVLRKYDY PPNKQPKAVE DVLMQAKLQC QNM
|
| |