Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3122 |
Symbol | |
ID | 7976767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 3145551 |
End bp | 3149168 |
Gene Length | 3618 bp |
Protein Length | 1205 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644799908 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_002951047 |
Protein GI | 239828423 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.950129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAAGA GAAACCAATC TAAAAAACTA CTTTCCCTAG GAATGGCTTT ATCTTTAGCA TTTGGAGGAA TTCTCCCTTA TGCGAACGTT TCTGCAGCGG AACTAAAACA ACAAGAAACC GCTAATCAAC TGTTGAATCG ATTTGGCCTT TATTCAACAA ACGAAGTGAG AGCGAGTACG GTGCAAAAGA ATAAGGAAAA ACAGTTGTTT AGCGAAAATC AGCTGATTGT TAAATATAAA AATGCACTTT CATCAACAGA ACATCGAAAA GCTGGAACGA AACTTGTCAG ACGCATTTCT TCTTTACATT ATGATGTCGT ACAAATAACA GGAAAGAAAA AATTAGAAGA TGTCGTTAAC GCTTATGCAA AAAATCCAAA TGTCATTGCC GTATCACGTA GCGCATATTT TTCGAGAGCG GCTGTCGATG ATGCAAAAAG CTACGACATG TATCATCTTA CAACATTGCA AGTGAACAAA GCTCTTGCAC TAGCCGGCAA GCATCCTGTC AAAGTAGCGG TCGTCGACAC CGGAATAGAT ACTCATCACC CTGAATTAAA GAATAAAATT ATTTCAAACT ATAACGTTAT GAATCCTATG CAAAAAGGAG CCGTTGATGT TCACGGCACG CATGTTGCCG GAATCATCGC AGGGGAGAAA GGAAACGGAG TCGGTGGATA TGGCGTGTTC CCAAATGCAC AAATTATCTC GATCGACGTG TTCAACCGTT CCTTTTTCAG TTCCGATTAC ATTGTAGCAG AAGGAATTTT AGAAGCGATT CGCCAAAAGG CACAAGTGAT CAACTTAAGC TTAAGTTCAT CTGTTCCTTC TCCAATTGTA GAGGAAGCGA TTAAAAAAGC CATTGACGCA AACATTACTG TTGTCGCCGC AGCCGGCAAC TCCGGAATCA ATACATATGA ATATCCAGCT GCTCTTGAAG GCGTAATCGG AGTGGGAGCA ACCAATGCAA AGAATGAACT CGCCGATTTC TCTACTTATG GGCCTGCGGT CGATGTCGTC GCCCCTGGTG AAGATATTTA CAGTTCCGTT TACGATATCG ACAAAAAATC AACGTTTGCG AAACTAAGCG GAACGTCCAT GGCAACGCCG ATAGTAACAG CAACGGTGGC GATGCTGCTG TCGAAAAATC CAAAGCTGAC ACCGTATGAA ATTAATTATA TTTTAAATAA GACGGCGAAA GATTTAGGAG AAAAAGGATA TGACCTCAAA TACGGTTATG GATTAGTCAA TCCAGTTGCA GCGCTGCAGT TTAATCCAAA AAATATTCCA GCGAACCCTT ATATTGCCGA AAAAGATTTG TTAAAGAAGG CAAAAAACAT TAACGTAACG TCCTATTCTG TACAAACTGG CACAATCAAG AAATTGAACC AAACGGACTG GTATCAGTTT AAAGTTCAAA AAGGAGATTA CATTCAAACA AGATTGAAAG GATCAGCCGA CTACGACTAT AAATTGGACA TCTTATTCTT CCCTGCCGGC AAAGCCGCAC CGTCTACAAA AGTAAAAGTA AATGATACGT TGCAAGGTAA AGAAGAAGGC TACTTATTCG AAGTTCCTCA AGACGGAACA CTGGTGATCG GTGTAAAAGA CGCACTCGGC AACTATAGCG AAAGCGGCAA ATCGCGCTAT ACTCTTTCCA TTGATCGAAC AAGGAACAAA CTCGATGATG GCAACAATGC AGAAAATCCT TTCTTGATTC ACTCATTGCC ATATCATTCA CAGAGTAAAC ATGGATCACT GTTCTTTACA AACGAGTTAA CAGACACAGA ATCACCAGCT AACGAAGAGG CAACAGAAAC GGAAGAACCT AACGAAGAAA AAACGCTGGA AGATACTGAA CAAGAAGAAA CGAAACCAAT GCCGGGAGAC AGCGATTACT TCCGTTTTGC CGTACCAGAA TCGGAAGATG GAATGGAGCA AACGGTTCAT GTTTCGCTAA CAGGAGTTCC TGGAATTGAT TCCACCATCA ACTTATATGC AATCGAAAAA CTGGAAGAGG AACCAAGCTC TGAAGGAGAA ACGGAGCAAA CGCCTGAAGA ACAGCCGGCG GAAAATCAAT TTATGATTGA TTCCGTCAAC ATGAAAGGGT ATGGCGAAGG CGAGGAGCTC ACATTTAACG CGATACCGGG CATGGAATAT ATGATTGAAG TGACGAACAA ACCAACCTTC GACCCATTGA TGACGTTCCT TTTTGGAAAG CCGGAAATTG ATTTAACAAG AAACTTCTCT TCTCACTTGC CTTATCAGTT AACCATTGAT GCAACCACTT TACCAACCGA CGAAGATGGT TTTCCTATGA CAGAGGAAAT ACCGGAAGAA GAGTTAATAA AAGGAAACGT AGAAGCATAT TTAGCGAAAA AAGAAGCGTT AAAAGAGAAA CTAAGTGCAG TTATCTCAGA TTCAATGATG TTCTTTAGCG GCGATGACTG GATCAAAAAC ATTCGCAAAG CAGCAATGCC TTACCATTTA GGTCAAACAG CAAGCGGATA TTTTCAGTAC AACGGAGATG AAGATTGGTT TGCCTTTACG CCTAAAGAAA ACGGAATTTA TGAGTTCCGC TTTGCTGCAG ACGAAAATAA TGATGTTCCG ATGATGAACA TATTCGCCTA CAATGAAAAA CAAAAAGATT TCTCTTACCT CGGCTCTAAC ACTTCTTACG AATTTGAACC AAAAGATCGC TATCGGATTG GACTAAAAAG CGGTGAAACT TACTATATTC AATTGAACGA AAAAAGTTAT CGTCCATCCG TGAAAGCCTA TACGTTTACG TCTACACTGC TGGCAAGCAA CATAGCAGAT AAGTTTGAGA ATAACGATGA CTTTGAACAA GCAACAAACA TCGGACTAAA AGCAATTACA GGAAACTTTG CTTCCGCTCA AGATATAGAT GTTTACTACT TCAAACCAGA ACAAAACGGG CTTTACGGAT TCGTGGTGAA ACCGTTAAAT ATCCCGGCAA AATACACCAA GTTGCCAAAA GAACTGTTAA CGCCAATTGA CCCAGTAGTC ATTGTCATTG AAGATACGAA TGGCAACAAA AAATTAGACA AGGACGAGGA AGGCAAAGAG TTGTTGACCG ACCGCGGCTT CTCAAACGAA GAAGAGCGTG GTGCCTTTAA AGCGGTGAAA ACGAAAGGAT ACTTTATCGT TACTCTTCAT TATTTCGGAG ATACTTCGCT AATGCCATAC CAATTTACAT TGGCAAAAGC AGACTTACAA GATGAAGACA AAAATTCGGT TGTGAAAAAC AATATTCCAT CAAAACCGCT TTCATTGAAA AAAGCTAATA AAAACACATT CTATAATAGC GGTTATATGA ATGTTACCAA CAATAAAGGC GATGTTGATT ATTATGTGTT TACAGCTAAT GAGGAGCGAG CATATACATT TAAGCTCGAA CTGCCTTCAG ACCTTGATGG GATTATCAGC GTTTATGATG CAAAAGGCAA ACAAGTTGGC AAAGCAGATT ACTATATGAG CGGGGACGCC GAATATTTGA AATTAAAATT AAAGAAAGGA AAATACTTTA TTAAAGTGGA AGATGCATTT GGCAATGCAA GCATTAACCC ATATAAACTC ATTGTACGAA AAGAATAA
|
Protein sequence | MKKRNQSKKL LSLGMALSLA FGGILPYANV SAAELKQQET ANQLLNRFGL YSTNEVRAST VQKNKEKQLF SENQLIVKYK NALSSTEHRK AGTKLVRRIS SLHYDVVQIT GKKKLEDVVN AYAKNPNVIA VSRSAYFSRA AVDDAKSYDM YHLTTLQVNK ALALAGKHPV KVAVVDTGID THHPELKNKI ISNYNVMNPM QKGAVDVHGT HVAGIIAGEK GNGVGGYGVF PNAQIISIDV FNRSFFSSDY IVAEGILEAI RQKAQVINLS LSSSVPSPIV EEAIKKAIDA NITVVAAAGN SGINTYEYPA ALEGVIGVGA TNAKNELADF STYGPAVDVV APGEDIYSSV YDIDKKSTFA KLSGTSMATP IVTATVAMLL SKNPKLTPYE INYILNKTAK DLGEKGYDLK YGYGLVNPVA ALQFNPKNIP ANPYIAEKDL LKKAKNINVT SYSVQTGTIK KLNQTDWYQF KVQKGDYIQT RLKGSADYDY KLDILFFPAG KAAPSTKVKV NDTLQGKEEG YLFEVPQDGT LVIGVKDALG NYSESGKSRY TLSIDRTRNK LDDGNNAENP FLIHSLPYHS QSKHGSLFFT NELTDTESPA NEEATETEEP NEEKTLEDTE QEETKPMPGD SDYFRFAVPE SEDGMEQTVH VSLTGVPGID STINLYAIEK LEEEPSSEGE TEQTPEEQPA ENQFMIDSVN MKGYGEGEEL TFNAIPGMEY MIEVTNKPTF DPLMTFLFGK PEIDLTRNFS SHLPYQLTID ATTLPTDEDG FPMTEEIPEE ELIKGNVEAY LAKKEALKEK LSAVISDSMM FFSGDDWIKN IRKAAMPYHL GQTASGYFQY NGDEDWFAFT PKENGIYEFR FAADENNDVP MMNIFAYNEK QKDFSYLGSN TSYEFEPKDR YRIGLKSGET YYIQLNEKSY RPSVKAYTFT STLLASNIAD KFENNDDFEQ ATNIGLKAIT GNFASAQDID VYYFKPEQNG LYGFVVKPLN IPAKYTKLPK ELLTPIDPVV IVIEDTNGNK KLDKDEEGKE LLTDRGFSNE EERGAFKAVK TKGYFIVTLH YFGDTSLMPY QFTLAKADLQ DEDKNSVVKN NIPSKPLSLK KANKNTFYNS GYMNVTNNKG DVDYYVFTAN EERAYTFKLE LPSDLDGIIS VYDAKGKQVG KADYYMSGDA EYLKLKLKKG KYFIKVEDAF GNASINPYKL IVRKE
|
| |