Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCE_4437 |
Symbol | vpR |
ID | 2750837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus ATCC 10987 |
Kingdom | Bacteria |
Replicon accession | NC_003909 |
Strand | + |
Start bp | 4110177 |
End bp | 4112930 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637281235 |
Product | minor extracellular protease VpR |
Protein accession | NP_980730 |
Protein GI | 42783483 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CTACATCTAC ACTATTAAGT ATGGCGCTCG TCTTTTCCAG TTTTGGAGCT TTAAGCGCAC ATGCTGAATC ACTGCAAAAG GAGAAGCAAT TTAGTCCACA ACTAAAAACA ACGATTGAAC AGTGGGGAGA AAATAAAATT GCGCAAAATG TTGAAACAAA AACTGCAAAA GAAATTTCAG TCATTGTAGA ATTGCAACAT GCACCTCTCG CAGCACAAGG TAATATTCAG CATGCTCCAG ATTTACAAAG TAATCATGCA CAGTCTTATC ATGCCGAGCT AAAAAAAGCA CAAGAAGATA CGACTAAGAA AATAAAAGAA AAAGCACCTG GTGCAAAAAT TAAAGAAGTG TATAATACGT TATTTTCCGG ATTCTCTATT TCAGTTCCTG GAGATCAAAT TACTGCTCTT GCTTCTTTAC CTGAAGTAAA AGCAGTCTAT CCGAACTTAA CATATAAACT ACATGAAACA TCAAAAAGTA CTACTAATGA AGAAGCACCA AACATCGGCG GACCGACAAT TGGTGCAACT GAAGCGTGGA ATTTAAAAGA CCCATCTGGC AAACCTCTTG ATGGAAAAGG GATGAAAGTA GCAATTATCG ATTCTGGCGT AGACTATACA CATCCTGACT TAAAAGCAAA TTATATTGGT GGATATGACA CAGTCGATGA AGATAATGAT CCTATGGATG GTAACGTACA TGGTACTCAT GTAGCAGGAA TTATTGCTGG TAATGGAAAA ATTAAGGGCG TTGCTCCAAA CGCTTCTATT TTAGCCTATC GTGTCATGAA TGACGGTGGA GCTGGTACAA CAGACGATAT TATTCAAGGT ATTGAACGAG CAATTCAAGA TGGTGCTGAT GTGTTAAATC TGTCTCTTGG ACAAGATTTA AATGTACCTG ATCAACCTGT AACTTTAACG TTAGAACGTG CAGCAAAGCT TGGGGTTACT GCAGTCGTTT CAAATGGAAA TGATGGCCCA AAACCGTGGT CTGTTGATGC CCCTGCCAAT GCAAGTAGTG TTATATCAGT TGGAGCATCT ACAGTTTCTA TCCCGTTTCC AACATTCCAA GTAGCTGGTT CTACCAAAAC ATATCAAGGG TTACCGTTAT CAAAATCGGA TTTCCCAATA GGAAATGATT CTCCACTTGT ATATGTTGGC TATGGTAATC CAAGCGATTA TGCAAAACAA GATGTGAAAG GAAAGTTTGC ACTTGTTTTA CAAGGTACTT CAAATACGTT AGTAAAAGCT GAACAAGCGA AACAAGCTGG TGCAATTGGT GTACTATTAA TTTCTAACGA AAAAGAAATC AATATTATGC CTGAATACTT TGCACGAGAA AATCTAGCTC TTCCAGTTAT GCAATTATCA AATGCAAATG GTGAAGAGCT AAAAAACTTA ATTACAAAGC GCAAGAAAAA TATAAAAATT GGACAACCAG TTCCAACTGA ATTAATTGGT AACTTTAGTT CTAGAGGACC ATCACAAGGT AGTTGGCTTA TAAAACCAGA TATCGTTGCA CCAGGAGTAC AAATTACGAG TACTGTACCT AGAGGTGGCT ATGAGTCTCA CAACGGAACC AGTATGGCTG CCCCTCAAGT AGCTGGAGCT GTTGCTTTAT TACGTCAAAT GCACCCTGAT TGGACGCTAG AACAATTGAA AGCATCTCTT GCCAATACAG CAAAAACTTT AAAAGATGTA AATGAAAATA CATATCCTGT CATGGCGCAA GGATCTGGTT TAATTAACAT CCCGAAAGCA GCTCAAACAG ATGTATTAGT CAAACCTAAC AATGTGAGCT TCGGTCTTAT TAAGCCAAAC AGTGGTAAAG TAAAACTGAC GCAAAATATT ACGTTACAAA ATCTATCTAG TAAAAAGAAA AGTTTTTCAA CTCGTGTGGA ATTGCTAGAT GCAAACACAA AAACAAAAGT AAAAACTTCT GTACCTTCAT CGATTAGCAT ACAGCCGAAT AGTAGTACCG AAAAACCATT TACTATCACT GTCGATAGTT CGCTACCCCA AGGTGTATAT ACTGGAAATG TGTATGTAAA AGAACAAGGT AAGAATGAAG AAACTCGCAT TCCATTTACA TTTAGTATCG ATCCTAAAGA TTACAAACGT ATAGATGGAC TCGAAATTAT TAATTCTACT TTTAGTCCAA ATGGTGACCA AATATTAGAC GATAATCTCA TCAACTACTA TTTAGTTGCA CCTGTGGATG ATGTAACATT GCATGCAAAT TTAGTTACAA AAGAACATGT AACGTACCAA GGGATTATCC ATCAAGCTAA AAATGAAACA GCTGGATACA AACCTTTCAA ATGGAATGGT ACAAAAACAG ATGGTACTCC GTTAGCTGAC GGTCTATACC AAATCGAAGC AGTTGCCTCT AATTCTGGTA GGGAAACGAA ACAAACAGCT GCTGTATTTC TTGATCGAAC TGCACCTAAG TTAACATACG AAGTTGACCA AGAAAACCTT GTAATTACAG GAAAAGTGGA TGATATCCTG CTAGATTGGA TGACAGAATC TGGTTGGGTA GCACCTGGTA TTCCAGTGAG AATGCAATAT GAAATCAACG GAAATGGTGT ATGGGAAAGT GTGTTCCTAA ATCCTTGGGA GAAAAATTAC AGCATTTATT TCGATCGTAG TCAATTACAA GAAGGAAAGA ATACAATTCA TATTGTAGCA ACTGATGCAG CTGGAAATAC ATCTAATTTA AATGTTGATT TAGATGTGAA ATAA
|
Protein sequence | MKKTTSTLLS MALVFSSFGA LSAHAESLQK EKQFSPQLKT TIEQWGENKI AQNVETKTAK EISVIVELQH APLAAQGNIQ HAPDLQSNHA QSYHAELKKA QEDTTKKIKE KAPGAKIKEV YNTLFSGFSI SVPGDQITAL ASLPEVKAVY PNLTYKLHET SKSTTNEEAP NIGGPTIGAT EAWNLKDPSG KPLDGKGMKV AIIDSGVDYT HPDLKANYIG GYDTVDEDND PMDGNVHGTH VAGIIAGNGK IKGVAPNASI LAYRVMNDGG AGTTDDIIQG IERAIQDGAD VLNLSLGQDL NVPDQPVTLT LERAAKLGVT AVVSNGNDGP KPWSVDAPAN ASSVISVGAS TVSIPFPTFQ VAGSTKTYQG LPLSKSDFPI GNDSPLVYVG YGNPSDYAKQ DVKGKFALVL QGTSNTLVKA EQAKQAGAIG VLLISNEKEI NIMPEYFARE NLALPVMQLS NANGEELKNL ITKRKKNIKI GQPVPTELIG NFSSRGPSQG SWLIKPDIVA PGVQITSTVP RGGYESHNGT SMAAPQVAGA VALLRQMHPD WTLEQLKASL ANTAKTLKDV NENTYPVMAQ GSGLINIPKA AQTDVLVKPN NVSFGLIKPN SGKVKLTQNI TLQNLSSKKK SFSTRVELLD ANTKTKVKTS VPSSISIQPN SSTEKPFTIT VDSSLPQGVY TGNVYVKEQG KNEETRIPFT FSIDPKDYKR IDGLEIINST FSPNGDQILD DNLINYYLVA PVDDVTLHAN LVTKEHVTYQ GIIHQAKNET AGYKPFKWNG TKTDGTPLAD GLYQIEAVAS NSGRETKQTA AVFLDRTAPK LTYEVDQENL VITGKVDDIL LDWMTESGWV APGIPVRMQY EINGNGVWES VFLNPWEKNY SIYFDRSQLQ EGKNTIHIVA TDAAGNTSNL NVDLDVK
|
| |