Gene BCZK4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK4101 
Symbolvpr 
ID3026087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp4206717 
End bp4209470 
Gene Length2754 bp 
Protein Length917 aa 
Translation table11 
GC content37% 
IMG OID637548315 
Productminor extracellular protease 
Protein accessionYP_085680 
Protein GI52141149 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CTACATCTAC ACTATTAAGT ATGGCGCTCG TCTTTTCCAG TTTTGGAGCT 
TTAAGCGCAC ATGCTGAATC GCTGCAAAAG GAGAAACAGT TTAGTCCACA GTTAAAAGCG
AACATTGAAC AATGGGGAGA AAACAAAATT GCGCAAAATG TTGAAACAAA AACATCAAAA
GAAATTTCTG TCATTGTAGA ATTGCAACAT GCTCCACTCG CTTCACAAAG TAACATTCAG
CATGCTCCAG ATTTACAAAA TAATAATGCA CAGTCTTATC ATACCGAGCT TAAAAAAGCA
CAAGAAGATA CGACTAAGAA AATAAAAGAA AAAGCACCTG GTGCAAAAAT TAAAGAAGTG
TATAATACGT TATTTTCTGG ATTCTCTATT TCAGTTCCTG GAGATCAAAT TACCGCTCTT
GCTTCTTTAC CCGAAGTAAA AGCAGTCTAC CCGAACTTAA CATATAAATT ACATGAAACA
TCAAAAAGCA CTACTAATGA AGAAGCACCA AATATCGGCG GACCGACAAT TGGTGCAACT
GAAGCATGGA ATTTAAAAGA CCCATCTGGC AAACCGCTTG ATGGAAAAGG TATGAAAGTA
GCGATTATCG ACTCTGGTGT AGACTATACA CATCCTGACT TAAAAGCAAA TTATATCGGT
GGATATGACA CAGTTGATGA AGATAACGAT CCAATGGATG GTAACGTACA TGGTACCCAT
GTAGCTGGAA TTATTGCTGG TAACGGAAAA ATTAAAGGCG TTGCTCCAAA TGCTTCTATT
CTAGCCTATC GTGTAATGAA TGATGGTGGA ACTGGTACAA CAGAAGATAT TATTCAAGGT
ATTGAACGAG CAATCCAAGA CGGTGCTGAC GTTTTAAATT TATCTCTTGG ACAAAGCTTA
AATACACCTG ATCAGCCTGT AACATTAACA TTAGAACGTG CAGCAAAACT TGGGGTTACT
GCAGTTGTTT CAAATGGAAA TGATGGCCCA CATCCTTGGT CTGTTGATGC ACCTGGAAAT
GCAAGCAGCG TTATATCAGT TGGAGCATCT ACTGTTTCTA TCCCGTTTCC AACGTTCCAA
GTAGCTGGTT CCAGCAAAAC ATATCAAGGG TTACCGTTAT CAAAGTCCGA TTTCCCAATA
GGAAATGATT CCCCTCTTGT ATACGTTGGC TATGGTAATC CAAGTGATTA TGCAAAACAA
GATGTGAAAG GGAAATTTGC ACTTGTTTTA CAAGGTACTT CTAGTACGTT AGTAAAAGCA
GAACAAGCGA AACAAGCTGG TGCAGTTGGT GTACTATTCA TTTCTACAGA AAAAGAAATC
AATATTATGC CAGAATATTT TATGCGTGAA AATCTAGCCC TTCCAGTTAT GCAATTATCA
AATGTACACG GTGAAGAGTT GAAAAACTTA ATTACAAAGC GTAAGAAAAA TATAAAAATT
GGCCAACCAA ATCCAACCGA ATTGATTGCT AACTTCAGTT CCAGAGGTCC GTCACAAGGA
AGTTGGCTTA TAAAACCAGA TATAATTGCA CCTGGCGTAC AAATTATGAG TACAGTACCG
AGAGGCGGCT ATGAATCTCA TAATGGTACA AGTATGGCTG CTCCACAAGT AGCTGGAGCG
GTTGCCCTCT TGCGTCAAAT GCATCCTGAT TGGACGACAG AGCAATTAAA AGCATCCCTT
GCCAATACCG CAAAAATTTT AAAAGATGTG AATGAAAATA CATATCCTAT TATGACACAA
GGATCTGGTT TAATTAACAT CCCGAAAGCA GCTCAAACAG ATGTATTAGT CAAACCTAAC
AATGTGAGCT TCGGTCTTAT TAAGCCAAAC AGTGGTAAAG TAAAACTGAC GCAAAATATT
ACGTTACAAA ATCTATCTAG TAAAAAGAAA AGTTTTTCAA CTCGTGTGGA ATTGCTAGAT
ACAAACACAA AAACAAAAGT AAAAACTTCT GTACCTTCAT CGATTAGCAT ACAGCCGAAT
AATAGTACCG AAAAACCATT TACTATCACT GTCGATAGTT CGCTACCCCA AGGTGTCTAT
ACTGGAAATG TGTATATAAA AGAACAAGGT AAGAAGGAAG AAACTCGCAT TCCATTTACA
TTTAGTATCG ATCCTAAAGA TTACAAACGT ATAGATGGAC TCGAAATTAT TAATTCTACA
TTTAGTCCAA ATGGTGACCA AATATTAGAC GATAATCTCA TCAACTACTA TTTAGTTGCA
CCTGTAGATG ATGTAACATT GCATGCAAAT TTAGTTACAA AAGAACATGT AACGTACCAA
GGGATTATCC ATCAAGCTAA AAATGAAACA GCTGGATACA AACCTTTCAA ATGGAATGGT
ACAAAAGCAG ATGGTACTCC GTTAGCTGAC GGTCTATACC AAATCGAAGC AGTTGCCTCT
AATTCTGGTG GGGAAACAAA ACAAACAGCT GCTGTATTTC TTGATAGAAC TGCACCTAAG
TTAACATACG AAATTGACCA AGAAAACCTT GTAATCACAG GAAAAGTGGA TGATATCTTA
CTAGATTGGA TGACAGAATC TGGTTGGGTA GCACCTGGAA TTCCAGTGAG ACTACAGTAT
GAAATCAACG GAAATGGTGT ATGGGAAAGT GCATTCCTAA ATCCTTGGGA GAAAAATTAC
GGCATTTATT TAGATCGTAC TCAATTACAA GAAGTTAAAA ATACAATTCA CATTGTAGCA
ACAGATGCAG CTGGAAATAC ATCTAATTTA AATGTTGATT TAGATGTGAA ATAA
 
Protein sequence
MKKTTSTLLS MALVFSSFGA LSAHAESLQK EKQFSPQLKA NIEQWGENKI AQNVETKTSK 
EISVIVELQH APLASQSNIQ HAPDLQNNNA QSYHTELKKA QEDTTKKIKE KAPGAKIKEV
YNTLFSGFSI SVPGDQITAL ASLPEVKAVY PNLTYKLHET SKSTTNEEAP NIGGPTIGAT
EAWNLKDPSG KPLDGKGMKV AIIDSGVDYT HPDLKANYIG GYDTVDEDND PMDGNVHGTH
VAGIIAGNGK IKGVAPNASI LAYRVMNDGG TGTTEDIIQG IERAIQDGAD VLNLSLGQSL
NTPDQPVTLT LERAAKLGVT AVVSNGNDGP HPWSVDAPGN ASSVISVGAS TVSIPFPTFQ
VAGSSKTYQG LPLSKSDFPI GNDSPLVYVG YGNPSDYAKQ DVKGKFALVL QGTSSTLVKA
EQAKQAGAVG VLFISTEKEI NIMPEYFMRE NLALPVMQLS NVHGEELKNL ITKRKKNIKI
GQPNPTELIA NFSSRGPSQG SWLIKPDIIA PGVQIMSTVP RGGYESHNGT SMAAPQVAGA
VALLRQMHPD WTTEQLKASL ANTAKILKDV NENTYPIMTQ GSGLINIPKA AQTDVLVKPN
NVSFGLIKPN SGKVKLTQNI TLQNLSSKKK SFSTRVELLD TNTKTKVKTS VPSSISIQPN
NSTEKPFTIT VDSSLPQGVY TGNVYIKEQG KKEETRIPFT FSIDPKDYKR IDGLEIINST
FSPNGDQILD DNLINYYLVA PVDDVTLHAN LVTKEHVTYQ GIIHQAKNET AGYKPFKWNG
TKADGTPLAD GLYQIEAVAS NSGGETKQTA AVFLDRTAPK LTYEIDQENL VITGKVDDIL
LDWMTESGWV APGIPVRLQY EINGNGVWES AFLNPWEKNY GIYLDRTQLQ EVKNTIHIVA
TDAAGNTSNL NVDLDVK