Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1928 |
Symbol | |
ID | 7978754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1987029 |
End bp | 1988387 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644798758 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_002949928 |
Protein GI | 239827304 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC AAACTCTTTT CGCTATCACT GCTGGAGTTG CTATTCTTTT TATTATGATT GCAGCAATTG TGACGACATA TCGTCCGACA GTCCACCATG AAGAAACGAA TCAGTTAAGA CCGAAACCAG CCCCTTATGT GGCAACAACG CCCCGTTCCC ATCCGACTGT CGTTACATTG AATAGTCTAG CAATGGGCGA AAAAATCAAA AACAAGTTAA ACAGCCACCC GAAAATAATA AAAATCGATC ATAACGGCCG CGACAAAAGC CATTATTTTA ATCATGAAAT TACGGTTCGG TTTCGAAAAC TTCCTGCAGC ACAAGAGCTC CAACGAATAG AACGAGCAAT CGATGGGAAA TTGATCAATC AGTTTGGCCG CTTTTTTATC TTTCGTTCTA ACAGCAAAAC ATATCATGAA TTACATGATT ATTTTGAATC CATCCCGACT GTTTCCTACA GTGAACCAAA CTATATTTAT TTACAAAACG AAATTCCAAA TGATTTATTA TATTCACGTT ATCAATGGAA TTTGCCAGCA ATCGACACTG AGGCAGGCTG GACGTTATCG CGCGGAAAAA AAGGCGTTGC CATTGCCGTT ATTGACAGCG GGATTGATTT AGATCATCCC GATCTCGTTC ATCGCCTGCA AAAAGGATAT AACGTTCTCG CGGATAATGC CATTCCCGAA GATGATAACG GTCACGGTAC GCACGTAGCG GGCATTATCG CCTCACAGCC AAACAACCGC GAAGGTGTTG CTGGAATAAC TTGGTTTAAC CCCATTATGC CGATAAAAGC ATTAAATTCC GAAGGATACG GCACAAGTTT TGACGTGGCC AAAGCCATTC ACTGGGCAGT TGACCACGGT GCAAAAGTCA TCAATTTAAG TCTTGGGAAT TATCAGCCTT CAACCATGTT GGAAGAAGCT ATTCGCTATG CATATGATCG CGACGTCGTA CTCATTGCCG CTTCTGGAAA TGATAGCACG GCGCAGCCTA GCTTTCCAGC TGCATACCCT GAAGTTATTA GCGTTGGCGC TGTTAATCCT GATCTTTCTT TCGCTCACTA TTCTAATTAC GGAACTTATT TAGATGTGGT AGCGCCAGGA ACAAATATTG CCAGCACTTT TTCGCAACAT CGATACGCGG CGCTTTCTGG AACATCCATG GCTGCACCGC ACGTAACTGC ATTAGCTGGC CTCATTCGTT CATTAAATCC ACATCTTACA AACGATGATG TAAAACAAAT CATTATCAAG ACAGCTACCG ATCTTGGAGA AAACGGCAAA GATCCGTATT ATGGATATGG TTTAATCAAT GTATATCGAG CGCTAGAGCT GGCAAACCAT TGGCGTTAG
|
Protein sequence | MNKQTLFAIT AGVAILFIMI AAIVTTYRPT VHHEETNQLR PKPAPYVATT PRSHPTVVTL NSLAMGEKIK NKLNSHPKII KIDHNGRDKS HYFNHEITVR FRKLPAAQEL QRIERAIDGK LINQFGRFFI FRSNSKTYHE LHDYFESIPT VSYSEPNYIY LQNEIPNDLL YSRYQWNLPA IDTEAGWTLS RGKKGVAIAV IDSGIDLDHP DLVHRLQKGY NVLADNAIPE DDNGHGTHVA GIIASQPNNR EGVAGITWFN PIMPIKALNS EGYGTSFDVA KAIHWAVDHG AKVINLSLGN YQPSTMLEEA IRYAYDRDVV LIAASGNDST AQPSFPAAYP EVISVGAVNP DLSFAHYSNY GTYLDVVAPG TNIASTFSQH RYAALSGTSM AAPHVTALAG LIRSLNPHLT NDDVKQIIIK TATDLGENGK DPYYGYGLIN VYRALELANH WR
|
| |