Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2067 |
Symbol | |
ID | 7977302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2126254 |
End bp | 2129400 |
Gene Length | 3147 bp |
Protein Length | 1048 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 644798881 |
Product | type III restriction protein res subunit |
Protein accession | YP_002950051 |
Protein GI | 239827427 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAGAA ATTCAAACAG AGCAACTGCT ATTCCCTTTA ACAAACGGCT TGTTTTGAAC CGTTTTATGC TTCATTTATT TGAAGCGGAT GATTTTGATG ATTTAGTAAT GGGGATGAAA GAGCCGGAAT TAGAGGGGTG GGATAGTGAT AACATATCCC ACTTCTACCA CTTTCTTTCC AATCGGACTT TTGACCGTCA CAAGTTAAGT AATGAGATGT TACGTCAGTA TGATGAAAAT ATTGTAAGAC ATACAATGCG TATCAATCTA AAGAGAAAAG AAGCGATTAA GTGGAAATAC TTCCAATATT TGTCCCTCTT GTTTACAGAA ATATATCTGG ACCGTTATTT TACGGACCCA GAAGCGTTGT TGAATGAGTT AAATAACTTC TTAGATACAT TCAATGCAAA GGAAAATGCT CAAATTGACA AGTACAGCCG AGAGGACTTA AATAAGTTAG CCTTTTGGAA CGCAACAGGT TCTGGAAAAA CGCTATTGAT GCATGTGAAC ATATTACAAT ATTTGCATTA CTTATCAAAG CATCGTAAAG AAAATGAGCT AAATCAAATT ATCCTTCTTA CTCCAAAGGA GGACTTATCC ATTCAGCATT TAAGAGAATT TGAGCTTTCT GGAATCGATG CAGAATTGTT TGATAAAGAC CAGGGTAGCC TGTTCCGTCA GCAAAGTGTG ACCATCATTA ATATCCAAAG AATCAGAGAG GAGTCCGGGG AAAAGACGGT AGCCGTTGAT ACATTCGAAG GTAACAATTT GGTTCTTGTG GATGAGGGAC ATCGTGGTTC ACGTGGACAA GAGTGGAAGA GAATGCGAGA CCGTTTAAGT GAACAAGGGT TTGCATTCGA ATATTCGGCA ACATTTGGTC AAGCAGTTTC AGGAAATAAG GACTTACAAC AAGAATACGC CAAGTGTATC TTATTTGATT ACTCCTACCG TTATTTCTAT AGGGATGGAT TCGGAAAGGA TTTTCAAATC CTGAATTTGG CTGATGATTC AAATGAGGAT ATTCGTCGTC TCTACTTAAC AGCATGTTTG GTAACATTCT ATCAACAGCT TAAGCTATAT CAAGAGCATG AGCATTCTAT ACGACCTTTT CTGATTGAAA AACCTTTATT AGTGTTTGTA GGAAGTAGTG TAAATGCAGT TAGAAGAGAA AGAGGGAAAA AGGTTTCTGA CGTAACAGAT GTTCTTTTAT TCATTGCACA ATTTGCAAAA GAAGAAAGAC GAAGTATATC ATATCTAGAG CGGTTAATGA ATGGTAATGC TGGTTTACGG GACGAAAAAG GCAGGGAAAT ATTCCGTAAT GCATTTACCT ATTTAACAAC GTTGGAAATG ACCCCGGAAG CTCTATACGC GGATATGTTA TCTACAGTCT TCCATGCAAG TCATTCTGGA GCAGTGTTAC ATGTTGTGAA TTTGAAGGGA GTAGACAATG AAATTGCTTT ACGATTGGGG GACAATAAGC CATTTGGTGT TATCAATGTA GGTGACGCCG CGGCATTATG CAAACTATGT GAAACGTATC CTGAGTTACA TGTAACGGAA GAACAATTTA GAGATTCTTT GTTTCATCGG TTAAATGAGC GTGAATCGGA CATTCATGTA CTGATTGGTT CCAAGAAGTT TACAGAAGGT TGGAGTAGCT GGCGGGTAAG TACCATGGGG CTAATGAATA TGGGGAGGGG TGAAGGTGCG GAGATTATTC AACTCTTTGG TCGTGGTGTA CGCCTTAAAG GATATGGGAT GAGTTTGAAG CGAAGTGAAG CTTTACGATT AGAAAATACA GATATTCCAC ATCATATTAA GGTTTTAGAA ACACTTAATA TTTTTGGTGT TCGAGCTGAT TATATGCAGC AGTTTAGAGA ATACCTTGAG GATGAAGGAA TAAAAACGGA TAACGATAAG GAAACCATAC ACTTAAAAGT TATTCCTGAT TACGCAAGAA GAAAGATAAA CTTAAAAACC CTTGCCCTAA AGGATATCGA TATCAACTCC TTTAAGAAAA AGGGGCCAAA ACCTTCCCTT GAACCAGCGA CCGAAGAAAT GAATTTAAAA GTAATATTGA ATTGGTACCC TAAAATTCAG AGGACAGAAA GCCGAAAACA GTTCTCTTCA CAAGCATCGG TAATACCTGA TGTAATGGAT GAGTGTAAGT TAGAGTCCAT GCACCTTGCA TTCATGGATA TAGACCGGAT ATATTTTGAA CTTCAAAAGT TTAAAAATGA ACGTTCTTGG TACAATCTCA AGCTTTCAAA AGAAAAGATT AAAGAACTAC TCAACAGACA GGATTGGTAT ATCCTTTATA TTCCGAAAGC GGAGATGGTC TTTGATTCTT TTCAAAAGGT CAAAAGATGG GAAGAGATTG CAATTTCCCT ATTGAAGAAG TATTGTGAGC AATATTATTT GTACCGAAAA AAAGAGTGGG AAGCTCCTTT TATGGGGTAC AAGGATTTAA ACGAAGAGGA TAAGAACTTT ATTCGTGAGT ATCGCGTAAC TTATGATACA TCTGAAACGA CCTTGAAAAC AAAGCTAGAG CAGCTTCAGA AGATGCTTGA ATCAAACAAC ATGAAGCCGG TTCAACATGG TACAATGGAA ATATTTGATT TTGAGCAACA TCTTTATAAA CCTCTAATTT ATCTAGAGGG CAATGTGGCT ACTGTATCAC CGAAACCACT GAACAAGGGT GAATACCAAT TTATTAATGA TTTACGAGCT TACTATGAGA AGAATAAAAT CTTCTTCCAA GACAAAGAGT TATATTTGCT ACGTAATCAA TCAAGAGGTA AAGGAATTGG ATTCTTCGAA GCGGGAAATT TCCATCCGGA TTTTATCTTA TGGATTCTCT ATCAGGGTAA GCAATACATC ACATTTGTTG ACCCCAAAGG CATAAGAAAC ATGTCTGTGT ATGATAAGAA AATCCAGTTT TACCGGACCA TCAAGGAGAA AGAAGCTGAA CTTGGGAATA GCTCCATTGT GCTGAACTCG TTCATCATTT CAAATACGGA ATATGTAAAT CTCTTAAATA CAGGAACAAA GTTAAGCAAA GAAGAGCTTG AGAACTTTAA TGTATTGTTC CAAGTAGAGG ACAAAGCAAC ATACATTGGT AAGATGATTA ACAAGATTCT AGCATAA
|
Protein sequence | MPRNSNRATA IPFNKRLVLN RFMLHLFEAD DFDDLVMGMK EPELEGWDSD NISHFYHFLS NRTFDRHKLS NEMLRQYDEN IVRHTMRINL KRKEAIKWKY FQYLSLLFTE IYLDRYFTDP EALLNELNNF LDTFNAKENA QIDKYSREDL NKLAFWNATG SGKTLLMHVN ILQYLHYLSK HRKENELNQI ILLTPKEDLS IQHLREFELS GIDAELFDKD QGSLFRQQSV TIINIQRIRE ESGEKTVAVD TFEGNNLVLV DEGHRGSRGQ EWKRMRDRLS EQGFAFEYSA TFGQAVSGNK DLQQEYAKCI LFDYSYRYFY RDGFGKDFQI LNLADDSNED IRRLYLTACL VTFYQQLKLY QEHEHSIRPF LIEKPLLVFV GSSVNAVRRE RGKKVSDVTD VLLFIAQFAK EERRSISYLE RLMNGNAGLR DEKGREIFRN AFTYLTTLEM TPEALYADML STVFHASHSG AVLHVVNLKG VDNEIALRLG DNKPFGVINV GDAAALCKLC ETYPELHVTE EQFRDSLFHR LNERESDIHV LIGSKKFTEG WSSWRVSTMG LMNMGRGEGA EIIQLFGRGV RLKGYGMSLK RSEALRLENT DIPHHIKVLE TLNIFGVRAD YMQQFREYLE DEGIKTDNDK ETIHLKVIPD YARRKINLKT LALKDIDINS FKKKGPKPSL EPATEEMNLK VILNWYPKIQ RTESRKQFSS QASVIPDVMD ECKLESMHLA FMDIDRIYFE LQKFKNERSW YNLKLSKEKI KELLNRQDWY ILYIPKAEMV FDSFQKVKRW EEIAISLLKK YCEQYYLYRK KEWEAPFMGY KDLNEEDKNF IREYRVTYDT SETTLKTKLE QLQKMLESNN MKPVQHGTME IFDFEQHLYK PLIYLEGNVA TVSPKPLNKG EYQFINDLRA YYEKNKIFFQ DKELYLLRNQ SRGKGIGFFE AGNFHPDFIL WILYQGKQYI TFVDPKGIRN MSVYDKKIQF YRTIKEKEAE LGNSSIVLNS FIISNTEYVN LLNTGTKLSK EELENFNVLF QVEDKATYIG KMINKILA
|
| |