Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B0761 |
Symbol | vpR |
ID | 7183806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 4300210 |
End bp | 4302963 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643552266 |
Product | minor extracellular protease VpR |
Protein accession | YP_002447935 |
Protein GI | 218899524 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00881527 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA CTACATCTAC ACTATTAAGC ATGGCGCTTG TCTTTTCTAG TTTTGGAGCT TTAAGCGCTC ATGCTGAATC ACTACAAAAG GAGAAGCAAT TTAGTCCACA ATTAAAAACA ACGATTGAAC AGTGGGGAGA ACATAAAATC GCTCAAAATG TTGAAACGAA AACAACAAAA GAAATATCAG TAATTGTAGA ATTACAACAT GCACCTCTCG CTGCTCAAAC TAATATTCAG CATGCTCCAG ATTTACAAAA TAGTAATGCG CAGTCTTATC ATGCTGAGCT AAAAAAAGCA CAAGAAGATA CGACTAAGAA AATAAAAGAA AAAGCACCTG GTGCAAAAAT TAAAGAAGTG TATAATACGT TATTTTCTGG ATTCTCTATT TCAGTTCCTG GAGATCAAAT TACCTCTCTT GCCTCTTTAC CTGAAGTAAA AGCAGTCTAT CCGAACTTAA CATATAAATT GCATGAAACA TCAAAAAGTG CTACTAACGA AGAAGCACCA AATACCGGTG GACTGACAAT TGGTGCACCT GAAGCGTGGA ATTTAAAAGA TCCATCTGGC AAACCGCTTG ATGGAAAAGG CATGAAAGTA GCCATTATCG ATTCTGGCGT AGACTATACA CACCCTGACT TAAAGGCAAA TTATATCGGT GGATATGACA CGGTTGATGA AGATAACGAT CCAATGGATG GTAACGTACA TGGTACTCAT GTAGCTGGAA TTATTGCAGG TAATGGAAAA ATTAAAGGCG TTGCTCCAAA CGCTTCTATT CTAGCCTATC GTGTAATGAA TGACGGTGGA ACTGGTACAA CAGATGATAT TATCCAAGGA ATTGAGCGAG CAATTCAAGA TGGTGCGGAT GTGTTAAACC TCTCCCTTGG GCAGGATTTA AATGTACCTG ATCAGCCTGT AACATTAACG TTAGAACGAG CAGCGAAACT TGGTGTTACT GCTGTCGTTT CAAATGGAAA TGACGGACCA AAACCTTGGT CTGTTGATGC TCCCGGTAAC GCAAGTAGTG TCATTTCTGT TGGAGCATCT ACGGTTTCTA TCCCGTTTCC AACATTTCAA GTAACTGGTT CCAGCAAAAC ATATCAAGGG TTACCATTAT CAAAATCCGA TTTTCCAATA GGAAATGATT CTCCTCTTGT ATATGTTGGT TATGGCAATC CAAGTGATTA TGCAAAACAA GATGTGAAAG GGAAATTCGC ACTTATTTTA CAAGGTACTT CTAGTACATT AGTAAAAGCA GAACAAGCGA AGCAAGCTGG TGCACTTGGT GTGTTATTAA TCTCTAGCGA AAAAGAAATT AATATGATGC CGGAATATTT TTCACGTGAA CATCTAGCTG TCCCAGTAAT GCAATTATCA AATACAAATG GGGAAGAATT AAAAACTTTA ATTACAAAAC GAAAGAAAAA TATAAAAATT GGACAACCAA AGCAAACAGA ACTTATCGGT AACTTTAGTT CAAGAGGACC ATCACAAGGA AGTTGGCTAA TAAAGCCTGA TGTTGTTGCA CCTGGAGTAC AAATTACTAG TACAGTACCG CGAGGCGGCT ATGAATCGCA TAACGGAACA AGTATGGCTG CTCCGCAAGT AGCTGGAGCG GTTGCCCTCT TGCGTCAAAT GCATCCTGAT TGGACGACAG AACAATTGAA AGCGGCTCTT GCTAACAATG CAAAAACATT ACATGATGTC AATGAAAATA CATACCCTGT TATGGCACAA GGATCAGGTT TAATTAACAT TCCGAAAGCA GCTCAAACAA ATGTATTAGT AAAACCTAAC AATGTCAGCT TTGGTCTTAT TAAGCCAAAT AGCGGAAAAG TAAAACTGAC GCAAAATGTT ACATTACAAA ACCTTTCTAG TAAAAAGAAA AGTTTTTCAC CTCGTGTGGA GTTACTAGAT GCAAACACAA ACACAAAAGT AAAAGCTTCT GTCCCTTCAT CGATTAGCAT TCAACCGAAT AGTAGTACCG AAAAACCATT TACTATCACT GTAGATAGCT CACTACCACA AGGTGTGTAT ACTGGAAATG TATATGTAAA AGAACAAGGG GCAAAAGAAG AAATTCGAAT TCCATTCACA TTTAGTATCG AGCCTAAAGA TTATAAACGT ATCGATGGAC TTGAAATTAT TAATTCTACT TTTAGCCCAA ATGGCGACCA CGTACTAGAT GATAATCTCA TCAACTACTA TTTAGTTGCG CCTGTGGATG ATGTAACATT GCATGCAAAT TTAGTTACGA AAGAACGTGT AACGTATCAA GGGATTATCC ATCAAGCTAA AAATGCAACT CCTGGATACA AACCTTTCAA ATGGAATGGT ACAAAAGCAG ATGGCACTCC TTTAGCTGAC GGGCTATATC AAATCGAAGC AGTTGCTTCT AATTCTGGCG GAGAAACAAA ACAAACAGCT GCTGTATTTC TTGATCGAAC TGCACCTAAG TTAACATACG AAGTTGACCA AGAAAATCTC GTAATTAGAG GAAAAGTTGA TGATATTCTA CTAGATTGGA TGTCAGAATC TGGTTGGATA GCACCTGGTA TTCCAGTGAG AATGCAATAT GAAATTAACG GAAATGGTGT ATGGGACCAG GCATTCCTGA ACCCTTGGGA GAAAAGCTAT GACATTTATT TCGACCGTAC TCAATTACAA GAAGGAAAAA ATACTATTCA CATTGTAGCA ACTGATGCAG CTGGCAATAC CTCTAATTTA ACTGTTAATT TAGAAGTGAA ATAA
|
Protein sequence | MKKTTSTLLS MALVFSSFGA LSAHAESLQK EKQFSPQLKT TIEQWGEHKI AQNVETKTTK EISVIVELQH APLAAQTNIQ HAPDLQNSNA QSYHAELKKA QEDTTKKIKE KAPGAKIKEV YNTLFSGFSI SVPGDQITSL ASLPEVKAVY PNLTYKLHET SKSATNEEAP NTGGLTIGAP EAWNLKDPSG KPLDGKGMKV AIIDSGVDYT HPDLKANYIG GYDTVDEDND PMDGNVHGTH VAGIIAGNGK IKGVAPNASI LAYRVMNDGG TGTTDDIIQG IERAIQDGAD VLNLSLGQDL NVPDQPVTLT LERAAKLGVT AVVSNGNDGP KPWSVDAPGN ASSVISVGAS TVSIPFPTFQ VTGSSKTYQG LPLSKSDFPI GNDSPLVYVG YGNPSDYAKQ DVKGKFALIL QGTSSTLVKA EQAKQAGALG VLLISSEKEI NMMPEYFSRE HLAVPVMQLS NTNGEELKTL ITKRKKNIKI GQPKQTELIG NFSSRGPSQG SWLIKPDVVA PGVQITSTVP RGGYESHNGT SMAAPQVAGA VALLRQMHPD WTTEQLKAAL ANNAKTLHDV NENTYPVMAQ GSGLINIPKA AQTNVLVKPN NVSFGLIKPN SGKVKLTQNV TLQNLSSKKK SFSPRVELLD ANTNTKVKAS VPSSISIQPN SSTEKPFTIT VDSSLPQGVY TGNVYVKEQG AKEEIRIPFT FSIEPKDYKR IDGLEIINST FSPNGDHVLD DNLINYYLVA PVDDVTLHAN LVTKERVTYQ GIIHQAKNAT PGYKPFKWNG TKADGTPLAD GLYQIEAVAS NSGGETKQTA AVFLDRTAPK LTYEVDQENL VIRGKVDDIL LDWMSESGWI APGIPVRMQY EINGNGVWDQ AFLNPWEKSY DIYFDRTQLQ EGKNTIHIVA TDAAGNTSNL TVNLEVK
|
| |