Gene GWCH70_1928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1928 
Symbol 
ID7978754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1987029 
End bp1988387 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content43% 
IMG OID644798758 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002949928 
Protein GI239827304 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC AAACTCTTTT CGCTATCACT GCTGGAGTTG CTATTCTTTT TATTATGATT 
GCAGCAATTG TGACGACATA TCGTCCGACA GTCCACCATG AAGAAACGAA TCAGTTAAGA
CCGAAACCAG CCCCTTATGT GGCAACAACG CCCCGTTCCC ATCCGACTGT CGTTACATTG
AATAGTCTAG CAATGGGCGA AAAAATCAAA AACAAGTTAA ACAGCCACCC GAAAATAATA
AAAATCGATC ATAACGGCCG CGACAAAAGC CATTATTTTA ATCATGAAAT TACGGTTCGG
TTTCGAAAAC TTCCTGCAGC ACAAGAGCTC CAACGAATAG AACGAGCAAT CGATGGGAAA
TTGATCAATC AGTTTGGCCG CTTTTTTATC TTTCGTTCTA ACAGCAAAAC ATATCATGAA
TTACATGATT ATTTTGAATC CATCCCGACT GTTTCCTACA GTGAACCAAA CTATATTTAT
TTACAAAACG AAATTCCAAA TGATTTATTA TATTCACGTT ATCAATGGAA TTTGCCAGCA
ATCGACACTG AGGCAGGCTG GACGTTATCG CGCGGAAAAA AAGGCGTTGC CATTGCCGTT
ATTGACAGCG GGATTGATTT AGATCATCCC GATCTCGTTC ATCGCCTGCA AAAAGGATAT
AACGTTCTCG CGGATAATGC CATTCCCGAA GATGATAACG GTCACGGTAC GCACGTAGCG
GGCATTATCG CCTCACAGCC AAACAACCGC GAAGGTGTTG CTGGAATAAC TTGGTTTAAC
CCCATTATGC CGATAAAAGC ATTAAATTCC GAAGGATACG GCACAAGTTT TGACGTGGCC
AAAGCCATTC ACTGGGCAGT TGACCACGGT GCAAAAGTCA TCAATTTAAG TCTTGGGAAT
TATCAGCCTT CAACCATGTT GGAAGAAGCT ATTCGCTATG CATATGATCG CGACGTCGTA
CTCATTGCCG CTTCTGGAAA TGATAGCACG GCGCAGCCTA GCTTTCCAGC TGCATACCCT
GAAGTTATTA GCGTTGGCGC TGTTAATCCT GATCTTTCTT TCGCTCACTA TTCTAATTAC
GGAACTTATT TAGATGTGGT AGCGCCAGGA ACAAATATTG CCAGCACTTT TTCGCAACAT
CGATACGCGG CGCTTTCTGG AACATCCATG GCTGCACCGC ACGTAACTGC ATTAGCTGGC
CTCATTCGTT CATTAAATCC ACATCTTACA AACGATGATG TAAAACAAAT CATTATCAAG
ACAGCTACCG ATCTTGGAGA AAACGGCAAA GATCCGTATT ATGGATATGG TTTAATCAAT
GTATATCGAG CGCTAGAGCT GGCAAACCAT TGGCGTTAG
 
Protein sequence
MNKQTLFAIT AGVAILFIMI AAIVTTYRPT VHHEETNQLR PKPAPYVATT PRSHPTVVTL 
NSLAMGEKIK NKLNSHPKII KIDHNGRDKS HYFNHEITVR FRKLPAAQEL QRIERAIDGK
LINQFGRFFI FRSNSKTYHE LHDYFESIPT VSYSEPNYIY LQNEIPNDLL YSRYQWNLPA
IDTEAGWTLS RGKKGVAIAV IDSGIDLDHP DLVHRLQKGY NVLADNAIPE DDNGHGTHVA
GIIASQPNNR EGVAGITWFN PIMPIKALNS EGYGTSFDVA KAIHWAVDHG AKVINLSLGN
YQPSTMLEEA IRYAYDRDVV LIAASGNDST AQPSFPAAYP EVISVGAVNP DLSFAHYSNY
GTYLDVVAPG TNIASTFSQH RYAALSGTSM AAPHVTALAG LIRSLNPHLT NDDVKQIIIK
TATDLGENGK DPYYGYGLIN VYRALELANH WR