Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3444 |
Symbol | |
ID | 7979515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012794 |
Strand | + |
Start bp | 8223 |
End bp | 11261 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 644800206 |
Product | type III restriction protein res subunit |
Protein accession | YP_002951345 |
Protein GI | 239828722 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCTTTTG AAGATAGTGA AAAACGATTT GAAGAGGATA TAGAAACCTA CCTTCTTACG GAAGGCGGGT ATGTGAAAGG CGACCAATCG AATTATGATA AAGAAAGAGC CATTGATCTC AATCAGTTAA TTGATTTTAT CAAAGAAACA CAAGAAAAAG AATGGACTCG TTATGAAAAA ATTTACGGTG AAGAAGCACC AAAAAAATTA TATAAGCGTT TAAATGACGA AATCGAAACA AATGGACTGC TGCATGTGCT TCGTCATGGA ATTACTGATC GTGGCGTAAA GCTCAAAATA GCCTCCTTTC GCCCTGAAAC AACGTTAAAT GAAAAAACAA TCAGAGATTA TCAAGCTAAT AAACTGACTG TCATACGTCA ATTCGCTTAT TCTACAGAAA ATCATAATAC GTTGGATATG GTTTTATCGT TGAATGGGAT TCCTATTGTA GCATTGGAGT TAAAGAATCA GTTTAAAGGA CAGTCCGTTG AAAATGGAAA AAAACAGTTT ATGTATGATC GTGACCCAAG AGAAAAAATT TTTCAATTTA ACAAACGCAT ACTAGTTTAT TTTGTAGTTG ATTTATATGA GGTTTGGATG ACCACAAAGT TAGATGGAAA AAACACTGTT TTTTTACCGT TCAATCAAGG CTCAAACGGT GCGGGAGAAG TAGGAGGGGC TGGAAACCCT GAAAATCCAG ACGGTTATGC GACCTCTTAT TTATGGGAAA AAGTATTGCA TAAAGATAGC TTAATGAATA TATTGCAGCG ATTCATGCAT TTAGAAGTAA AAAAAGAAAA GTTTATCAAA AACGGAAAGG AAAGCGTCAA AACAAGTTCT AAGCTGATTT TCCCACGCTA TCATCAGTTA GATGTGGTCA GAAAACTTGT TGAAGATGTT CGTCAAAAAG GAAGCGGGGA AAATTATCTT ATCCAACACA GCGCAGGTTC TGGAAAGTCG AACAGCATAG CATGGTTGGC ATATCATTTA GCAAGTTTGC ACAATGAAGA TAATGAAAGC ATTTTTACAT CAGTTATCGT GGTAACAGAT CGAACAGTAC TTGATCGGCA ATTACAACAA ACAATTTCCA GTTTTGATCA TACAACAGGG CTTGTAGAAA CCATTGATGA CAAAAAGACT TCCAAAGATT TAAGAGATGC TATTAATAAC GGAAAACGAA TTATTATCAC GACACTTCAA AAGTTTCCTG TTATTTATGA AGAAGTAGAA GTCAACAAGG GAAGTCGCTT TGCCGTTATT GTAGATGAAG CACATTCCTC CCAGACAGGA AAAAGTGCGA AAAAATTAAA AGCAGCATTA GCAGATACAG AGGAAGCGTT ACGGGAATAT GCAGAACTGG AAGCAGAAAT CGAAGCTGAG CAGTTAGATT TTGAAGATGA AATCGTTCAG GAACTTCTGA CACATGGCAG ACATAAAAAT TTAAGTTTCT TTGCTTTTAC AGCTACTCCT AAAGAAAAGA CATTAGAAAT GTTTGGGACA AAACAACCAG ATGGAACGTT TAAACCTTTT CATATTTACA GTATGCGTCA AGCTATCGAA GAAGGATTTA TATTAGATGT ATTACAAAAT TATATGACCT ACAAAACATA TTATCGTATT GCCAAAAACA CATCAGAGAA TCCGGAATTA TCGACAACAC AGGGAGTAAA AGCTATCAAG CGTTATCAAT CATTACATCC ATATACTCTA CAACAAAAGA CGGCTATCAT GGTCGAACAA TTTCGGAATG TAACTAGACA TAAAATTGGC GGTAAAGCAA AAGCAATGGT TGTCACAGCT TCTCGTTTAC ATGCAGTCCG TTATTTCCAT GAGTTTAAGA AGTATATTAA GGAAAAAGGC TACGATGATA TAGATGTGCT GGTAGCTTTT TCAGGCGTTG TTATTGATCA AAATGAAGAA TACCGAGAAG AAACTTTGAA CAAAACAAAA GATGGAAAAC GAATTAAAGA AAGTCAATTG AAAGAAGCTT TTCATTCAGA TGATTTCAAT ATATTAATTG TTGCGGAGAA ATATCAAACA GGATTTGATG AACCATTACT TCATACAATG TTTATCGATA AGAAATTATC AGGAGTAAAA GCTGTACAAA CTTTATCTCG GTTAAACAGG ACATATCCCG GAAAAGAGGA TACATTTATT CTGGATTTTG TCAATGAAGC TGAAGATATA AAAAAAGCTT TCCAGCCTTA TTATGAAGTA ACCGAACTGG ATAAAGAGAT TGATGTAAAT CTCATATATG ATACGAGAAC AAAGCTGAGA AACTTTAAAA TTTACAATGA TCAAGACATT AAGAAACTAA CGAGAATTTA TTTTAAAAAA GGGAAACAAA CAGAAAAGGA CTTAGGAAAA ATAGCCAGCC ACTTAATTCC AATTATTAAA CGCTATGAAG AACTTGATGA AGAAACACAA TATAAATTTC GGGTTACGGT TCGTAACTTT AATAAATGGT ATTCATATAT AACGCAACTC GTCAGAATGT TTGACAAAGA GTTGCATGAA GAGTACATTT TCACATCGTA TTTAATTAAG TTTATTCCGA AAAATAGTGC AGAAAAAATC AATATCGAAG ACAAAGTTAA ATTAGAGTAT TACAAGTTAG AAGAAACATT TAAAGGGACC ATCACATTAG AATCAAACAG CCCAGAAAAT GTACTCAAAA ATTCAGATAA TGTTGATACA GGCATAAAAC CTCCAGATGA TCAAGACTTA TTAGAAAATA TTATTCAACG AGTAAACAAA AGATTTGAAG GCAAATTTAC AGAAGCTGAT CGAGTCATCG TAGAAGGAAT TTACAAAAAA ACTGTTAAAG GCAATGAAAA ATTAAGGAGA TTTGCCAGAA ACAATGACGA AGAAATGTTT AATAAAAGTA TCTTTCCTGA CGTGTTTGAA AAGGTCGCCC AAGAACTGTA TATGGAACAA ATGAACGCTT ATTCTAAATT GTTTGAGGAT CGATCCTTTT ATAATGCAGT AATGGAGGCT GTAGCGAAAG AAGTGTATAA GGAATTGAGA CGTGAATGA
|
Protein sequence | MAFEDSEKRF EEDIETYLLT EGGYVKGDQS NYDKERAIDL NQLIDFIKET QEKEWTRYEK IYGEEAPKKL YKRLNDEIET NGLLHVLRHG ITDRGVKLKI ASFRPETTLN EKTIRDYQAN KLTVIRQFAY STENHNTLDM VLSLNGIPIV ALELKNQFKG QSVENGKKQF MYDRDPREKI FQFNKRILVY FVVDLYEVWM TTKLDGKNTV FLPFNQGSNG AGEVGGAGNP ENPDGYATSY LWEKVLHKDS LMNILQRFMH LEVKKEKFIK NGKESVKTSS KLIFPRYHQL DVVRKLVEDV RQKGSGENYL IQHSAGSGKS NSIAWLAYHL ASLHNEDNES IFTSVIVVTD RTVLDRQLQQ TISSFDHTTG LVETIDDKKT SKDLRDAINN GKRIIITTLQ KFPVIYEEVE VNKGSRFAVI VDEAHSSQTG KSAKKLKAAL ADTEEALREY AELEAEIEAE QLDFEDEIVQ ELLTHGRHKN LSFFAFTATP KEKTLEMFGT KQPDGTFKPF HIYSMRQAIE EGFILDVLQN YMTYKTYYRI AKNTSENPEL STTQGVKAIK RYQSLHPYTL QQKTAIMVEQ FRNVTRHKIG GKAKAMVVTA SRLHAVRYFH EFKKYIKEKG YDDIDVLVAF SGVVIDQNEE YREETLNKTK DGKRIKESQL KEAFHSDDFN ILIVAEKYQT GFDEPLLHTM FIDKKLSGVK AVQTLSRLNR TYPGKEDTFI LDFVNEAEDI KKAFQPYYEV TELDKEIDVN LIYDTRTKLR NFKIYNDQDI KKLTRIYFKK GKQTEKDLGK IASHLIPIIK RYEELDEETQ YKFRVTVRNF NKWYSYITQL VRMFDKELHE EYIFTSYLIK FIPKNSAEKI NIEDKVKLEY YKLEETFKGT ITLESNSPEN VLKNSDNVDT GIKPPDDQDL LENIIQRVNK RFEGKFTEAD RVIVEGIYKK TVKGNEKLRR FARNNDEEMF NKSIFPDVFE KVAQELYMEQ MNAYSKLFED RSFYNAVMEA VAKEVYKELR RE
|
| |