Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3123 |
Symbol | |
ID | 7976768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3149209 |
End bp | 3151131 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644799909 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_002951048 |
Protein GI | 239828424 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00188421 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA AATGGTGTCT CGTTTTTTTC AGTGTGGTTC TCGCAATGAT GAGTTTTCTA CCCGTTGCTT CCACTTTTGC CGCTGAAACA AAGCGGGTAA ATGTGATTAT TCTTTTTAAA AAGCATGCCG ACGAAAAAAC AGTGGAACAG CTGCAAGGGA AAGTGTATCG ACAATTTGAA GTGATTCCAG CTTTATCTGC CAATGTGCCG TTGGCTTCTA TAGATATTTT GCGGCAATTA CCGGAAGTAG AATACGTGCA GCAGGATGAA ACGGTTGCGA TTGATGGACA GGTGGCAGAC TGGGGCGTAA AGAAAACAAA AGCAACCGAC ATGCATGCTC GCGGCGTTAC GGGTAAAGGC GTGAAGATAG CGATTTTAGA TACGGGAATC GATACGAAGC ATCCAGATTT ACGTGTATCA GGCGGTGCCT GTATGTTATC GTATTGCCCT AATTCTTATA ATGACGATAA TGGCCACGGC ACCCATGTGG CAGGGATTAT CGCAGCGAAA AATAATCGTA TAGGGGTGCT CGGAGTTGCT CCGGAGGCAA GCATCTATGC AGTGAAAGTA TTAAACCGCT TCGGCGAGGG AACGGTATCG ACTGTTTTAG CAGGGATTGA ATGGGCGATA CAAAACCATA TGGATATTAT TAACTTGAGT CTTGCTACTC CGGAAGGTTC TCCTGTGTTA AAAGAAGCGA TTGATAAAGC GTATCAGAAA GGAATTTTGA TTGTCGCGGC AGCTGGAAAT AACGGCAATA GAAATGGAAT AGGAGATACT GTCGAGTATC CTGCGAAATA TGACAGTGTC ATTGGTGTCT CTGCTGTGAA CAAAAACAAC GTTCGTATTG CCTATTCTGC GACAGGCCCA GCAGTCGAAG TCGCAGCGCC TGGAGAAGAC ATTTACAGCA CCGTTCCAGT TGCCTATGAT TCGGACAGCA ACCCGGACGG GTATACATGG ATGTCGGGAA CGTCCATGGC GGCTCCGTTT GTCAGCGGCA TATTGGCATT ATATAAACAG CAGTATCCAG ATAAAACCAA TGTTGAACTT CGCCAAATGT TAAGGGCGAA CGCACTCGAT TTAGGCGCGC CTGGAAAAGA CAATTGGTAC GGTTATGGAC TAGTACAGGC AAAACCAAAT AAACCGCCGC AGTTAGAAGT GAAGCTGCAA TCAAATGCGG CGGGGGTAGT AAACTTTTCC GTTACACCAC TTGCTGACAA TATTAAAGGA TATAATGTGT ATCGCAATGG AAAGCAAATA AAAGAGCTAC AGACTAGTTC GTCATTTACA GACTACGTAC TCAAAGGGAA TTACCGTTAT CAGTTTTCTT CCGTATATAG CGATGGAACA GAATCAGCAT TGTCTAATCC GATGACAGTA ACTGTCTCAG CGCCAGATTA TAAGGATTTA ACAAACGACA TTTGGTACGC GCCTCCGATC ATTTATTTAT CCAGCAGGGG AATCGTTACT GGATACAATA ATGGTACTAT CAAGCCGAAT GATTTTGTGA CGCGTGCTGA AGCGGTAGCG ATGTTAGGAC GGGCGTTACA TTTGGATTCG ACAAAACGGC CTACCGTTTT TAAAGATGTT GACCCTGCTA ATTTTGCATC TGGATATATT CAGTCTGCAT ATGAGAAGGG ATGGCTGAAC GGATTTCCAG ATGGCACGTT CCGACCAAAG CAGCCAATCA CCCGCGCGGA AACAGCTATA CTGTTAGCAA AGGCATACCA ATTTCCTAGT GCGCCTTCTC TCGCGTTTAA AGACGTTACG GACAAAGTGA CGGGACATAA GGAAATTTAC AAAGTGGCTG CAGCAAAAAT TACAAAGGGA TATCCTGATG GCACATTTAA ACCATATCAA TCAGTGAAAA GGCTAGAATT TTTCGTATTC ATCGCTCGCG CTGAAAATGA CGCATTTAAA TAA
|
Protein sequence | MMKKWCLVFF SVVLAMMSFL PVASTFAAET KRVNVIILFK KHADEKTVEQ LQGKVYRQFE VIPALSANVP LASIDILRQL PEVEYVQQDE TVAIDGQVAD WGVKKTKATD MHARGVTGKG VKIAILDTGI DTKHPDLRVS GGACMLSYCP NSYNDDNGHG THVAGIIAAK NNRIGVLGVA PEASIYAVKV LNRFGEGTVS TVLAGIEWAI QNHMDIINLS LATPEGSPVL KEAIDKAYQK GILIVAAAGN NGNRNGIGDT VEYPAKYDSV IGVSAVNKNN VRIAYSATGP AVEVAAPGED IYSTVPVAYD SDSNPDGYTW MSGTSMAAPF VSGILALYKQ QYPDKTNVEL RQMLRANALD LGAPGKDNWY GYGLVQAKPN KPPQLEVKLQ SNAAGVVNFS VTPLADNIKG YNVYRNGKQI KELQTSSSFT DYVLKGNYRY QFSSVYSDGT ESALSNPMTV TVSAPDYKDL TNDIWYAPPI IYLSSRGIVT GYNNGTIKPN DFVTRAEAVA MLGRALHLDS TKRPTVFKDV DPANFASGYI QSAYEKGWLN GFPDGTFRPK QPITRAETAI LLAKAYQFPS APSLAFKDVT DKVTGHKEIY KVAAAKITKG YPDGTFKPYQ SVKRLEFFVF IARAENDAFK
|
| |