Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK4101 |
Symbol | vpr |
ID | 3026087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 4206717 |
End bp | 4209470 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637548315 |
Product | minor extracellular protease |
Protein accession | YP_085680 |
Protein GI | 52141149 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CTACATCTAC ACTATTAAGT ATGGCGCTCG TCTTTTCCAG TTTTGGAGCT TTAAGCGCAC ATGCTGAATC GCTGCAAAAG GAGAAACAGT TTAGTCCACA GTTAAAAGCG AACATTGAAC AATGGGGAGA AAACAAAATT GCGCAAAATG TTGAAACAAA AACATCAAAA GAAATTTCTG TCATTGTAGA ATTGCAACAT GCTCCACTCG CTTCACAAAG TAACATTCAG CATGCTCCAG ATTTACAAAA TAATAATGCA CAGTCTTATC ATACCGAGCT TAAAAAAGCA CAAGAAGATA CGACTAAGAA AATAAAAGAA AAAGCACCTG GTGCAAAAAT TAAAGAAGTG TATAATACGT TATTTTCTGG ATTCTCTATT TCAGTTCCTG GAGATCAAAT TACCGCTCTT GCTTCTTTAC CCGAAGTAAA AGCAGTCTAC CCGAACTTAA CATATAAATT ACATGAAACA TCAAAAAGCA CTACTAATGA AGAAGCACCA AATATCGGCG GACCGACAAT TGGTGCAACT GAAGCATGGA ATTTAAAAGA CCCATCTGGC AAACCGCTTG ATGGAAAAGG TATGAAAGTA GCGATTATCG ACTCTGGTGT AGACTATACA CATCCTGACT TAAAAGCAAA TTATATCGGT GGATATGACA CAGTTGATGA AGATAACGAT CCAATGGATG GTAACGTACA TGGTACCCAT GTAGCTGGAA TTATTGCTGG TAACGGAAAA ATTAAAGGCG TTGCTCCAAA TGCTTCTATT CTAGCCTATC GTGTAATGAA TGATGGTGGA ACTGGTACAA CAGAAGATAT TATTCAAGGT ATTGAACGAG CAATCCAAGA CGGTGCTGAC GTTTTAAATT TATCTCTTGG ACAAAGCTTA AATACACCTG ATCAGCCTGT AACATTAACA TTAGAACGTG CAGCAAAACT TGGGGTTACT GCAGTTGTTT CAAATGGAAA TGATGGCCCA CATCCTTGGT CTGTTGATGC ACCTGGAAAT GCAAGCAGCG TTATATCAGT TGGAGCATCT ACTGTTTCTA TCCCGTTTCC AACGTTCCAA GTAGCTGGTT CCAGCAAAAC ATATCAAGGG TTACCGTTAT CAAAGTCCGA TTTCCCAATA GGAAATGATT CCCCTCTTGT ATACGTTGGC TATGGTAATC CAAGTGATTA TGCAAAACAA GATGTGAAAG GGAAATTTGC ACTTGTTTTA CAAGGTACTT CTAGTACGTT AGTAAAAGCA GAACAAGCGA AACAAGCTGG TGCAGTTGGT GTACTATTCA TTTCTACAGA AAAAGAAATC AATATTATGC CAGAATATTT TATGCGTGAA AATCTAGCCC TTCCAGTTAT GCAATTATCA AATGTACACG GTGAAGAGTT GAAAAACTTA ATTACAAAGC GTAAGAAAAA TATAAAAATT GGCCAACCAA ATCCAACCGA ATTGATTGCT AACTTCAGTT CCAGAGGTCC GTCACAAGGA AGTTGGCTTA TAAAACCAGA TATAATTGCA CCTGGCGTAC AAATTATGAG TACAGTACCG AGAGGCGGCT ATGAATCTCA TAATGGTACA AGTATGGCTG CTCCACAAGT AGCTGGAGCG GTTGCCCTCT TGCGTCAAAT GCATCCTGAT TGGACGACAG AGCAATTAAA AGCATCCCTT GCCAATACCG CAAAAATTTT AAAAGATGTG AATGAAAATA CATATCCTAT TATGACACAA GGATCTGGTT TAATTAACAT CCCGAAAGCA GCTCAAACAG ATGTATTAGT CAAACCTAAC AATGTGAGCT TCGGTCTTAT TAAGCCAAAC AGTGGTAAAG TAAAACTGAC GCAAAATATT ACGTTACAAA ATCTATCTAG TAAAAAGAAA AGTTTTTCAA CTCGTGTGGA ATTGCTAGAT ACAAACACAA AAACAAAAGT AAAAACTTCT GTACCTTCAT CGATTAGCAT ACAGCCGAAT AATAGTACCG AAAAACCATT TACTATCACT GTCGATAGTT CGCTACCCCA AGGTGTCTAT ACTGGAAATG TGTATATAAA AGAACAAGGT AAGAAGGAAG AAACTCGCAT TCCATTTACA TTTAGTATCG ATCCTAAAGA TTACAAACGT ATAGATGGAC TCGAAATTAT TAATTCTACA TTTAGTCCAA ATGGTGACCA AATATTAGAC GATAATCTCA TCAACTACTA TTTAGTTGCA CCTGTAGATG ATGTAACATT GCATGCAAAT TTAGTTACAA AAGAACATGT AACGTACCAA GGGATTATCC ATCAAGCTAA AAATGAAACA GCTGGATACA AACCTTTCAA ATGGAATGGT ACAAAAGCAG ATGGTACTCC GTTAGCTGAC GGTCTATACC AAATCGAAGC AGTTGCCTCT AATTCTGGTG GGGAAACAAA ACAAACAGCT GCTGTATTTC TTGATAGAAC TGCACCTAAG TTAACATACG AAATTGACCA AGAAAACCTT GTAATCACAG GAAAAGTGGA TGATATCTTA CTAGATTGGA TGACAGAATC TGGTTGGGTA GCACCTGGAA TTCCAGTGAG ACTACAGTAT GAAATCAACG GAAATGGTGT ATGGGAAAGT GCATTCCTAA ATCCTTGGGA GAAAAATTAC GGCATTTATT TAGATCGTAC TCAATTACAA GAAGTTAAAA ATACAATTCA CATTGTAGCA ACAGATGCAG CTGGAAATAC ATCTAATTTA AATGTTGATT TAGATGTGAA ATAA
|
Protein sequence | MKKTTSTLLS MALVFSSFGA LSAHAESLQK EKQFSPQLKA NIEQWGENKI AQNVETKTSK EISVIVELQH APLASQSNIQ HAPDLQNNNA QSYHTELKKA QEDTTKKIKE KAPGAKIKEV YNTLFSGFSI SVPGDQITAL ASLPEVKAVY PNLTYKLHET SKSTTNEEAP NIGGPTIGAT EAWNLKDPSG KPLDGKGMKV AIIDSGVDYT HPDLKANYIG GYDTVDEDND PMDGNVHGTH VAGIIAGNGK IKGVAPNASI LAYRVMNDGG TGTTEDIIQG IERAIQDGAD VLNLSLGQSL NTPDQPVTLT LERAAKLGVT AVVSNGNDGP HPWSVDAPGN ASSVISVGAS TVSIPFPTFQ VAGSSKTYQG LPLSKSDFPI GNDSPLVYVG YGNPSDYAKQ DVKGKFALVL QGTSSTLVKA EQAKQAGAVG VLFISTEKEI NIMPEYFMRE NLALPVMQLS NVHGEELKNL ITKRKKNIKI GQPNPTELIA NFSSRGPSQG SWLIKPDIIA PGVQIMSTVP RGGYESHNGT SMAAPQVAGA VALLRQMHPD WTTEQLKASL ANTAKILKDV NENTYPIMTQ GSGLINIPKA AQTDVLVKPN NVSFGLIKPN SGKVKLTQNI TLQNLSSKKK SFSTRVELLD TNTKTKVKTS VPSSISIQPN NSTEKPFTIT VDSSLPQGVY TGNVYIKEQG KKEETRIPFT FSIDPKDYKR IDGLEIINST FSPNGDQILD DNLINYYLVA PVDDVTLHAN LVTKEHVTYQ GIIHQAKNET AGYKPFKWNG TKADGTPLAD GLYQIEAVAS NSGGETKQTA AVFLDRTAPK LTYEIDQENL VITGKVDDIL LDWMTESGWV APGIPVRLQY EINGNGVWES AFLNPWEKNY GIYLDRTQLQ EVKNTIHIVA TDAAGNTSNL NVDLDVK
|
| |