Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcer98_2220 |
Symbol | |
ID | 5346660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cytotoxicus NVH 391-98 |
Kingdom | Bacteria |
Replicon accession | NC_009674 |
Strand | + |
Start bp | 2308875 |
End bp | 2310068 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640839739 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001375465 |
Protein GI | 152975948 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000312805 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAGGA AAATTAAATT TTTACTGCTT ACTGTATCTC TAGTCTTTGG TTGCTTCTTT TTTCATCCTT CTTCATCCAA TGCAGCACCT TCATCTAACG AATACGTCCC AAACCAGATT ATTGTAAAAT TTAAAGATAA TACTTCCCTT TCTAAATCCC AAGAATTCCA TAAGTCAGTC GGAGCGGAAG TGGTGTCAAA AGATGATGTA TTGGGCTTTG AAGTTGTCAA ATTCACTAAA GGCACTGTCA AAGATAAAAT AAAAATGTAC CAAAATAACC CTAATGTGGA ATATGCTGAA CCAAATTATT ATTTCTATGC GTTTTGGACT CCTAATGATC CCTACTTTAA TAATCAATAT GGATTATTAA AAATTCAAGC ACCTCAAGCA TGGGATGCAC AAAGAAGTGA CCCTGGTGTA AAAATAGCAA TTATTGATAC TGGTGTACAA GGGAACCATC CTGATTTATC ATCCAAAGTT ATATACGGTT ACGATTACGT TGATAATGAT GGTCAGTCGG ATGATGGAAA TGGGCATGGT ACACATTGCG CCGGTATTGC AGGTGCGATT ACAAATAATA ATATTGGCAT TGCAGGTGTT GCTCCTCAGT CCTCCCTATA TGCGGTACGC GTACTAGATA ATCAAGGAAG TGGTACTCTT GATGCTGTGG CAAAAGGCAT TAGAGAATCT GCTGATGCTG GTGCAAAAGT GATTAGCTTA AGTCTAGGGG CTACTAATGG AGGAACTGCC TTACAACAAG CTGTACAATA TGCTTGGAAC AAAGGTGCTG TCATTGTTGC TGCAGCAGGA AATGATGGAA ACACAAGACC AAATTATCCC GCTTATTACT CTGAAGTAAT TGCAGTAGCA TCTACAGATC AAAATGATCA AAAATCTTAT TTCTCCAATT ATGGAAGTTG GGTAGATGTA GCGGCACCTG GGTCAAGTAT TTATTCTACC TATAAAGGCA GTACTTATCG CTCATTAAGC GGTACATCTA TGGCAACACC TCATGTAGCA GGTGTTGCAG GATTACTAGC AAATCAAGGA TATACGAATG TACAAATTCG CCAAATCATG GAAACTACAG CTGATAAAGT ACCAGGAACA GGAACCTATT GGAAAAACGG AAGAGTGAAC GCAAATAAAG CTGTACAATA TGGAAATACA TTAAATGAAA ACAAAGCTTC CTAA
|
Protein sequence | MKRKIKFLLL TVSLVFGCFF FHPSSSNAAP SSNEYVPNQI IVKFKDNTSL SKSQEFHKSV GAEVVSKDDV LGFEVVKFTK GTVKDKIKMY QNNPNVEYAE PNYYFYAFWT PNDPYFNNQY GLLKIQAPQA WDAQRSDPGV KIAIIDTGVQ GNHPDLSSKV IYGYDYVDND GQSDDGNGHG THCAGIAGAI TNNNIGIAGV APQSSLYAVR VLDNQGSGTL DAVAKGIRES ADAGAKVISL SLGATNGGTA LQQAVQYAWN KGAVIVAAAG NDGNTRPNYP AYYSEVIAVA STDQNDQKSY FSNYGSWVDV AAPGSSIYST YKGSTYRSLS GTSMATPHVA GVAGLLANQG YTNVQIRQIM ETTADKVPGT GTYWKNGRVN ANKAVQYGNT LNENKAS
|
| |