Gene GWCH70_3123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3123 
Symbol 
ID7976768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3149209 
End bp3151131 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content44% 
IMG OID644799909 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002951048 
Protein GI239828424 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00188421 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA AATGGTGTCT CGTTTTTTTC AGTGTGGTTC TCGCAATGAT GAGTTTTCTA 
CCCGTTGCTT CCACTTTTGC CGCTGAAACA AAGCGGGTAA ATGTGATTAT TCTTTTTAAA
AAGCATGCCG ACGAAAAAAC AGTGGAACAG CTGCAAGGGA AAGTGTATCG ACAATTTGAA
GTGATTCCAG CTTTATCTGC CAATGTGCCG TTGGCTTCTA TAGATATTTT GCGGCAATTA
CCGGAAGTAG AATACGTGCA GCAGGATGAA ACGGTTGCGA TTGATGGACA GGTGGCAGAC
TGGGGCGTAA AGAAAACAAA AGCAACCGAC ATGCATGCTC GCGGCGTTAC GGGTAAAGGC
GTGAAGATAG CGATTTTAGA TACGGGAATC GATACGAAGC ATCCAGATTT ACGTGTATCA
GGCGGTGCCT GTATGTTATC GTATTGCCCT AATTCTTATA ATGACGATAA TGGCCACGGC
ACCCATGTGG CAGGGATTAT CGCAGCGAAA AATAATCGTA TAGGGGTGCT CGGAGTTGCT
CCGGAGGCAA GCATCTATGC AGTGAAAGTA TTAAACCGCT TCGGCGAGGG AACGGTATCG
ACTGTTTTAG CAGGGATTGA ATGGGCGATA CAAAACCATA TGGATATTAT TAACTTGAGT
CTTGCTACTC CGGAAGGTTC TCCTGTGTTA AAAGAAGCGA TTGATAAAGC GTATCAGAAA
GGAATTTTGA TTGTCGCGGC AGCTGGAAAT AACGGCAATA GAAATGGAAT AGGAGATACT
GTCGAGTATC CTGCGAAATA TGACAGTGTC ATTGGTGTCT CTGCTGTGAA CAAAAACAAC
GTTCGTATTG CCTATTCTGC GACAGGCCCA GCAGTCGAAG TCGCAGCGCC TGGAGAAGAC
ATTTACAGCA CCGTTCCAGT TGCCTATGAT TCGGACAGCA ACCCGGACGG GTATACATGG
ATGTCGGGAA CGTCCATGGC GGCTCCGTTT GTCAGCGGCA TATTGGCATT ATATAAACAG
CAGTATCCAG ATAAAACCAA TGTTGAACTT CGCCAAATGT TAAGGGCGAA CGCACTCGAT
TTAGGCGCGC CTGGAAAAGA CAATTGGTAC GGTTATGGAC TAGTACAGGC AAAACCAAAT
AAACCGCCGC AGTTAGAAGT GAAGCTGCAA TCAAATGCGG CGGGGGTAGT AAACTTTTCC
GTTACACCAC TTGCTGACAA TATTAAAGGA TATAATGTGT ATCGCAATGG AAAGCAAATA
AAAGAGCTAC AGACTAGTTC GTCATTTACA GACTACGTAC TCAAAGGGAA TTACCGTTAT
CAGTTTTCTT CCGTATATAG CGATGGAACA GAATCAGCAT TGTCTAATCC GATGACAGTA
ACTGTCTCAG CGCCAGATTA TAAGGATTTA ACAAACGACA TTTGGTACGC GCCTCCGATC
ATTTATTTAT CCAGCAGGGG AATCGTTACT GGATACAATA ATGGTACTAT CAAGCCGAAT
GATTTTGTGA CGCGTGCTGA AGCGGTAGCG ATGTTAGGAC GGGCGTTACA TTTGGATTCG
ACAAAACGGC CTACCGTTTT TAAAGATGTT GACCCTGCTA ATTTTGCATC TGGATATATT
CAGTCTGCAT ATGAGAAGGG ATGGCTGAAC GGATTTCCAG ATGGCACGTT CCGACCAAAG
CAGCCAATCA CCCGCGCGGA AACAGCTATA CTGTTAGCAA AGGCATACCA ATTTCCTAGT
GCGCCTTCTC TCGCGTTTAA AGACGTTACG GACAAAGTGA CGGGACATAA GGAAATTTAC
AAAGTGGCTG CAGCAAAAAT TACAAAGGGA TATCCTGATG GCACATTTAA ACCATATCAA
TCAGTGAAAA GGCTAGAATT TTTCGTATTC ATCGCTCGCG CTGAAAATGA CGCATTTAAA
TAA
 
Protein sequence
MMKKWCLVFF SVVLAMMSFL PVASTFAAET KRVNVIILFK KHADEKTVEQ LQGKVYRQFE 
VIPALSANVP LASIDILRQL PEVEYVQQDE TVAIDGQVAD WGVKKTKATD MHARGVTGKG
VKIAILDTGI DTKHPDLRVS GGACMLSYCP NSYNDDNGHG THVAGIIAAK NNRIGVLGVA
PEASIYAVKV LNRFGEGTVS TVLAGIEWAI QNHMDIINLS LATPEGSPVL KEAIDKAYQK
GILIVAAAGN NGNRNGIGDT VEYPAKYDSV IGVSAVNKNN VRIAYSATGP AVEVAAPGED
IYSTVPVAYD SDSNPDGYTW MSGTSMAAPF VSGILALYKQ QYPDKTNVEL RQMLRANALD
LGAPGKDNWY GYGLVQAKPN KPPQLEVKLQ SNAAGVVNFS VTPLADNIKG YNVYRNGKQI
KELQTSSSFT DYVLKGNYRY QFSSVYSDGT ESALSNPMTV TVSAPDYKDL TNDIWYAPPI
IYLSSRGIVT GYNNGTIKPN DFVTRAEAVA MLGRALHLDS TKRPTVFKDV DPANFASGYI
QSAYEKGWLN GFPDGTFRPK QPITRAETAI LLAKAYQFPS APSLAFKDVT DKVTGHKEIY
KVAAAKITKG YPDGTFKPYQ SVKRLEFFVF IARAENDAFK