Gene GWCH70_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3122 
Symbol 
ID7976767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3145551 
End bp3149168 
Gene Length3618 bp 
Protein Length1205 aa 
Translation table11 
GC content40% 
IMG OID644799908 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002951047 
Protein GI239828423 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.950129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAGA GAAACCAATC TAAAAAACTA CTTTCCCTAG GAATGGCTTT ATCTTTAGCA 
TTTGGAGGAA TTCTCCCTTA TGCGAACGTT TCTGCAGCGG AACTAAAACA ACAAGAAACC
GCTAATCAAC TGTTGAATCG ATTTGGCCTT TATTCAACAA ACGAAGTGAG AGCGAGTACG
GTGCAAAAGA ATAAGGAAAA ACAGTTGTTT AGCGAAAATC AGCTGATTGT TAAATATAAA
AATGCACTTT CATCAACAGA ACATCGAAAA GCTGGAACGA AACTTGTCAG ACGCATTTCT
TCTTTACATT ATGATGTCGT ACAAATAACA GGAAAGAAAA AATTAGAAGA TGTCGTTAAC
GCTTATGCAA AAAATCCAAA TGTCATTGCC GTATCACGTA GCGCATATTT TTCGAGAGCG
GCTGTCGATG ATGCAAAAAG CTACGACATG TATCATCTTA CAACATTGCA AGTGAACAAA
GCTCTTGCAC TAGCCGGCAA GCATCCTGTC AAAGTAGCGG TCGTCGACAC CGGAATAGAT
ACTCATCACC CTGAATTAAA GAATAAAATT ATTTCAAACT ATAACGTTAT GAATCCTATG
CAAAAAGGAG CCGTTGATGT TCACGGCACG CATGTTGCCG GAATCATCGC AGGGGAGAAA
GGAAACGGAG TCGGTGGATA TGGCGTGTTC CCAAATGCAC AAATTATCTC GATCGACGTG
TTCAACCGTT CCTTTTTCAG TTCCGATTAC ATTGTAGCAG AAGGAATTTT AGAAGCGATT
CGCCAAAAGG CACAAGTGAT CAACTTAAGC TTAAGTTCAT CTGTTCCTTC TCCAATTGTA
GAGGAAGCGA TTAAAAAAGC CATTGACGCA AACATTACTG TTGTCGCCGC AGCCGGCAAC
TCCGGAATCA ATACATATGA ATATCCAGCT GCTCTTGAAG GCGTAATCGG AGTGGGAGCA
ACCAATGCAA AGAATGAACT CGCCGATTTC TCTACTTATG GGCCTGCGGT CGATGTCGTC
GCCCCTGGTG AAGATATTTA CAGTTCCGTT TACGATATCG ACAAAAAATC AACGTTTGCG
AAACTAAGCG GAACGTCCAT GGCAACGCCG ATAGTAACAG CAACGGTGGC GATGCTGCTG
TCGAAAAATC CAAAGCTGAC ACCGTATGAA ATTAATTATA TTTTAAATAA GACGGCGAAA
GATTTAGGAG AAAAAGGATA TGACCTCAAA TACGGTTATG GATTAGTCAA TCCAGTTGCA
GCGCTGCAGT TTAATCCAAA AAATATTCCA GCGAACCCTT ATATTGCCGA AAAAGATTTG
TTAAAGAAGG CAAAAAACAT TAACGTAACG TCCTATTCTG TACAAACTGG CACAATCAAG
AAATTGAACC AAACGGACTG GTATCAGTTT AAAGTTCAAA AAGGAGATTA CATTCAAACA
AGATTGAAAG GATCAGCCGA CTACGACTAT AAATTGGACA TCTTATTCTT CCCTGCCGGC
AAAGCCGCAC CGTCTACAAA AGTAAAAGTA AATGATACGT TGCAAGGTAA AGAAGAAGGC
TACTTATTCG AAGTTCCTCA AGACGGAACA CTGGTGATCG GTGTAAAAGA CGCACTCGGC
AACTATAGCG AAAGCGGCAA ATCGCGCTAT ACTCTTTCCA TTGATCGAAC AAGGAACAAA
CTCGATGATG GCAACAATGC AGAAAATCCT TTCTTGATTC ACTCATTGCC ATATCATTCA
CAGAGTAAAC ATGGATCACT GTTCTTTACA AACGAGTTAA CAGACACAGA ATCACCAGCT
AACGAAGAGG CAACAGAAAC GGAAGAACCT AACGAAGAAA AAACGCTGGA AGATACTGAA
CAAGAAGAAA CGAAACCAAT GCCGGGAGAC AGCGATTACT TCCGTTTTGC CGTACCAGAA
TCGGAAGATG GAATGGAGCA AACGGTTCAT GTTTCGCTAA CAGGAGTTCC TGGAATTGAT
TCCACCATCA ACTTATATGC AATCGAAAAA CTGGAAGAGG AACCAAGCTC TGAAGGAGAA
ACGGAGCAAA CGCCTGAAGA ACAGCCGGCG GAAAATCAAT TTATGATTGA TTCCGTCAAC
ATGAAAGGGT ATGGCGAAGG CGAGGAGCTC ACATTTAACG CGATACCGGG CATGGAATAT
ATGATTGAAG TGACGAACAA ACCAACCTTC GACCCATTGA TGACGTTCCT TTTTGGAAAG
CCGGAAATTG ATTTAACAAG AAACTTCTCT TCTCACTTGC CTTATCAGTT AACCATTGAT
GCAACCACTT TACCAACCGA CGAAGATGGT TTTCCTATGA CAGAGGAAAT ACCGGAAGAA
GAGTTAATAA AAGGAAACGT AGAAGCATAT TTAGCGAAAA AAGAAGCGTT AAAAGAGAAA
CTAAGTGCAG TTATCTCAGA TTCAATGATG TTCTTTAGCG GCGATGACTG GATCAAAAAC
ATTCGCAAAG CAGCAATGCC TTACCATTTA GGTCAAACAG CAAGCGGATA TTTTCAGTAC
AACGGAGATG AAGATTGGTT TGCCTTTACG CCTAAAGAAA ACGGAATTTA TGAGTTCCGC
TTTGCTGCAG ACGAAAATAA TGATGTTCCG ATGATGAACA TATTCGCCTA CAATGAAAAA
CAAAAAGATT TCTCTTACCT CGGCTCTAAC ACTTCTTACG AATTTGAACC AAAAGATCGC
TATCGGATTG GACTAAAAAG CGGTGAAACT TACTATATTC AATTGAACGA AAAAAGTTAT
CGTCCATCCG TGAAAGCCTA TACGTTTACG TCTACACTGC TGGCAAGCAA CATAGCAGAT
AAGTTTGAGA ATAACGATGA CTTTGAACAA GCAACAAACA TCGGACTAAA AGCAATTACA
GGAAACTTTG CTTCCGCTCA AGATATAGAT GTTTACTACT TCAAACCAGA ACAAAACGGG
CTTTACGGAT TCGTGGTGAA ACCGTTAAAT ATCCCGGCAA AATACACCAA GTTGCCAAAA
GAACTGTTAA CGCCAATTGA CCCAGTAGTC ATTGTCATTG AAGATACGAA TGGCAACAAA
AAATTAGACA AGGACGAGGA AGGCAAAGAG TTGTTGACCG ACCGCGGCTT CTCAAACGAA
GAAGAGCGTG GTGCCTTTAA AGCGGTGAAA ACGAAAGGAT ACTTTATCGT TACTCTTCAT
TATTTCGGAG ATACTTCGCT AATGCCATAC CAATTTACAT TGGCAAAAGC AGACTTACAA
GATGAAGACA AAAATTCGGT TGTGAAAAAC AATATTCCAT CAAAACCGCT TTCATTGAAA
AAAGCTAATA AAAACACATT CTATAATAGC GGTTATATGA ATGTTACCAA CAATAAAGGC
GATGTTGATT ATTATGTGTT TACAGCTAAT GAGGAGCGAG CATATACATT TAAGCTCGAA
CTGCCTTCAG ACCTTGATGG GATTATCAGC GTTTATGATG CAAAAGGCAA ACAAGTTGGC
AAAGCAGATT ACTATATGAG CGGGGACGCC GAATATTTGA AATTAAAATT AAAGAAAGGA
AAATACTTTA TTAAAGTGGA AGATGCATTT GGCAATGCAA GCATTAACCC ATATAAACTC
ATTGTACGAA AAGAATAA
 
Protein sequence
MKKRNQSKKL LSLGMALSLA FGGILPYANV SAAELKQQET ANQLLNRFGL YSTNEVRAST 
VQKNKEKQLF SENQLIVKYK NALSSTEHRK AGTKLVRRIS SLHYDVVQIT GKKKLEDVVN
AYAKNPNVIA VSRSAYFSRA AVDDAKSYDM YHLTTLQVNK ALALAGKHPV KVAVVDTGID
THHPELKNKI ISNYNVMNPM QKGAVDVHGT HVAGIIAGEK GNGVGGYGVF PNAQIISIDV
FNRSFFSSDY IVAEGILEAI RQKAQVINLS LSSSVPSPIV EEAIKKAIDA NITVVAAAGN
SGINTYEYPA ALEGVIGVGA TNAKNELADF STYGPAVDVV APGEDIYSSV YDIDKKSTFA
KLSGTSMATP IVTATVAMLL SKNPKLTPYE INYILNKTAK DLGEKGYDLK YGYGLVNPVA
ALQFNPKNIP ANPYIAEKDL LKKAKNINVT SYSVQTGTIK KLNQTDWYQF KVQKGDYIQT
RLKGSADYDY KLDILFFPAG KAAPSTKVKV NDTLQGKEEG YLFEVPQDGT LVIGVKDALG
NYSESGKSRY TLSIDRTRNK LDDGNNAENP FLIHSLPYHS QSKHGSLFFT NELTDTESPA
NEEATETEEP NEEKTLEDTE QEETKPMPGD SDYFRFAVPE SEDGMEQTVH VSLTGVPGID
STINLYAIEK LEEEPSSEGE TEQTPEEQPA ENQFMIDSVN MKGYGEGEEL TFNAIPGMEY
MIEVTNKPTF DPLMTFLFGK PEIDLTRNFS SHLPYQLTID ATTLPTDEDG FPMTEEIPEE
ELIKGNVEAY LAKKEALKEK LSAVISDSMM FFSGDDWIKN IRKAAMPYHL GQTASGYFQY
NGDEDWFAFT PKENGIYEFR FAADENNDVP MMNIFAYNEK QKDFSYLGSN TSYEFEPKDR
YRIGLKSGET YYIQLNEKSY RPSVKAYTFT STLLASNIAD KFENNDDFEQ ATNIGLKAIT
GNFASAQDID VYYFKPEQNG LYGFVVKPLN IPAKYTKLPK ELLTPIDPVV IVIEDTNGNK
KLDKDEEGKE LLTDRGFSNE EERGAFKAVK TKGYFIVTLH YFGDTSLMPY QFTLAKADLQ
DEDKNSVVKN NIPSKPLSLK KANKNTFYNS GYMNVTNNKG DVDYYVFTAN EERAYTFKLE
LPSDLDGIIS VYDAKGKQVG KADYYMSGDA EYLKLKLKKG KYFIKVEDAF GNASINPYKL
IVRKE