Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK4623 |
Symbol | vpr |
ID | 3023660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 4715098 |
End bp | 4719318 |
Gene Length | 4221 bp |
Protein Length | 1406 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637548835 |
Product | subtilisin-like serine protease |
Protein accession | YP_086198 |
Protein GI | 52140632 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAAATCGTTT TGTTACCGGA ATTGTAACGG CTGGCGTTCT ATTTTCAACT GCCCTTCCGT TCAATGTACT TGCTGAAAAT CCTATTAACC AAATTGACTC TTCAAACGCC CAGTCTTTAC TATCAAAACT TTCTAAAGAA CAGCGCCAGG CATTACAAAC GTTAGATGCC AACCCTGGTT TTACTATCTC GCCAGATATT AATACGAGTA GTTCTGAGCC TGTAAATATC ATTGTAGAAT TTCAAACAGC TCCAGTAAAG ATTAATGAAT TAAAACAAAA AGAAAAAGGG TTACAAGCTG CTCCTAAAAA TATAAAGGCA CGTATTGATC AAGAACATGA GGAATTTCAA AAAGGACTAA AACGTCTTTA TCATTTCAGC CCATCAGTAA AGTCAGGGAA TTTCCAGTCT GTACAAATTA AACGTTCTTA TAAACATGCG ATAAATGGGG TTGCGCTGAC ATTACCTGCA AATACAGTAG AAGAATTGTT GCGGATTGGT GTTGTAAAAC GTATTTGGAA AGATTACGAA GTGAAATTAA ATTTACCAAA ACAATCTGAG CAAAAAGTAC CTCAAAAACT AACAGATAGC ATTCCGCAAA TTGGGGTCGA TAAGTTACAT AGCGAAGGTA TTACCGGAAA AGGAATTAAA GTTGGTGTAT TAGATACAGG TATTGATTAC AATCATCCTG ACTTAAAAGA CGTGTATAAA GGCTATCGCG CAAAACCTGG CGAAGATTCT AGCAAGATAG ATCCAAACTC CGTAAAAGGT TGGGACTTTA TTAGTAACGA CGCAGATCCG ATGGAAACAA CTTATACAGA ATGGCAACAA TCTGGTGCTC CTGAATTTGA TGATCGTGGT TCTTCCTTCT ACACAGCCCA CGGTACTCAC GTAGCAGGTA TCGTTTCTGC TCAAAAGAAA AACAAATCAG ATTCCGCAGT AAAAGGTGTT GCACCTGACA TTGAACTATA TAACTATCGT GTACTCGGTC CATACGGAAG TGGAGCTAGT TCAGGTATTA TTGCTGCAAT CGATAAATCT ATTTCTGACG GTATGAACGT CATTAACTTA TCGTTAGGTG ATGAGAGCAA CAATCCGCTT GATCCAACGT CTATCGCTGT TAACAACGCA ATGCTTTCTG GTGTCGTTAC AGTCGTTGCC GCTGGTAACT CTGGACCAAA TCCAGCGACA CTCGGATCAC CAGGTGCTTC TCCCTTTGCG ATTACGGTTG GAGCTAGTGA TAGCTCTATT TCCTTACCGA AACTATCAAG TCATGCTGGA CAAGTACAAT TCCCTAACTT GATTTTATTT GGCAAAAACT TCACTGATAA AATAGAAGAT TTCAAGGGAC AAACGTTATC TATCGAATCG GTAGGAATCG GTACGCCTGA TGAATTTAGT AAAAAAGATG TAAAAGGAAA AATCGCTCTC GTTGCACGCG GTACACTCTC ATTTGATGAA AAAATCGCAA ATGCAAAACA AGCTGGTGCA AAAGCCGTTA TTATTTATAA CAATGTGGAT GGAGAGATTC CTTTCTATGT TGGCGAAAGT ACAAAATACA TTCCAGCATT CCGCCTAACG AAAGAAGACG GTGAAAAACT AAAAGCTCAA ATCGAACAAG GTAATACATC CTTTACTTTT GACGAACTTA GTTATATTCA AACAGAAGGC GATCATCTTG CAGACTTTAG CTCACGCGGC CCTGTTACAT CAAATGATGA TATTAAACCT GATATTACAG CTCCTGGTGT TGCTGTCCTT TCAACAGTCC CTGAATATAT TAACGACCCA CAAGAGGGTG AAAACTATGA CGTTTCCTAT GAGCGTATGC AAGGAACATC CATGGCTTCT CCACATATCG CTGGCGTTGC TGCTCTTATT TTACAAGAAC ATCCGGAATA CTCTCCATTC GATGTAAAAG CGTCTCTTAT GAACACAGCT GATGATTTAA AAGAAAAGTA TTCTGTATAC GAAGTTGGAG CTGGGCGAGT AGATGCTTAT AATTCTGTTC ATACGGAAAC AAGTTTTAAA GTGTTAGATA CAACAAAAAC AGTTGTAAAT GATGAAGTAA TTGAAGTACC AGAAGAAACT GGCTCTATTG CATTCGGGAA GTTTTATCAA AAAGATGGTG AAGCTCTTGA ACAAAAACGA AATATAAAAG TTGAAAACCA TAATAAACAA GAGAAGAAAG AATTTAAAAC AGAAATTTCT TATACACCAG CTTCTTCTAT TAACGATGCT ATCGCAAATG GTGTAAAAGT TTCTACACCA GAAACAATTA CTCTTGATGC TGGCAAATCA GATGAAATTG AAGCAAAAGT TAATGTCCCA GCCGGTGTGA AACAAGGCCG ATACGAAGGA TATATTCACA TTACAAATAC GAAAAATAAA GAAGAAACGT ATCAAATTCC ATTCTCTATT CGTGTATCAG AACCTGGTAT TGAAGACGCT GTTCTATCTA GAAAATCTAT TTCAACGGAT ACAACTAAAT TTAATCCTTA CCGCGAATCA TACGTGCACG GGGCATTCCA ATTAAACAGC AATCTGGAAA CGTTAGACCT TATCGTGAAA GATTCAAAAA CAAATAAAGC GCTTGGCTTC CTTGGGACGT TAAACACGGG TGGACTGCAA ACGGATGTTT ATTACTACTT AGATGCATTC TTTAGCGGAA AAGTATATCC TTTTACAAAT GATCCTGCAA AACCAATCGG TGATGAGAAA ATTAATTTAC CAGAAGGCGA TTACACAATT GAATTTGTTG GCTATGATAA AGATGGGAAA GCACGTGTGA AAGGTGATTA TGTCATCATC GATAACACAC CGCCAGAAGT GAAATTAACT GGATTAAAAC CTGGTATTTA CGAGTTAAAT GAAGAAAATT ATACAGTAGA AGATGGTAAA AAAGCACTCT GGATTAAAGG AAATGTGTAT GATTCAAACG TTGACTACTT AAAAGGAAAA GGGTTAAACA TTACGCAAGA AGCAAATGGA GTTATGTATT ATGACTATTC TCCATACTTG CATAAGTTTT TACCTATTGA TGCAAATGGT GACTTTAAAG TTGGCATTAC TCCTGAATCC TTTAAAGTGG AAGGTCCAAT GAACACAGGC GTGTATATTT TCGATTATGC AACTGCAGGT CCTGACTATC CAAGCGGTGT AAATAATTAT TGGTTTATCC AGCAAGGAGC TCAATATGCA AAGGCTAATT ATGATAAAAA AGAATTATAT AAAGGGGACG AGTTCACAGT AACCCTAAAC GCGAAACACG TAAAACAATT CGTTGCTGGG GAATTTAACG TAAAATTCCT AGAGAAGAAC TTCAAATTTG CAAATGCGAA ATTGAATCCT GCTTTTGAAA AACTCCTTTC TGAAAAAGGG GTAACAGCAA AAGTAAATGA ACCGAAACTA GAAGAAGGTT CTGTTACAGT TGGCGGATCT ATCGATGACA AAAACTTCGC TGGATTAGAT GGTGATTTCC CGTTCATTGA TGTAACGTTC AAAGTAGAAA ATGATGAGTT TTATGAGGCA AATGCTCAAT TAGAGTCACC AGTGTTTGTA TACTGGAAAC CAGGAGAAAC AGAGCCAAAT AGAATGCGTG TCCTTCAAGA CCAAACATTC TCAGTTATGT CTTTAAACTC ACTAGTAGAA GGAAATATTA AACCTGGTGC ATTCTTAAAT GAACGTGGCT ATTTAGATGA GAAATTCGAT TATACAAAGC TTGGCGTAAA AGTATATGCA ACTGATTCTT ACCGTCATAA GTTTGAAGGC TCACTAGACA AATACGGATA CTTCAAACTA AATGGACTTC CTGTTAATAA ACGTGACTAT AACTTATTTG TAGAAGTACC AGGGCATTTA ACAAGTCGCC TAACAACAAA ATTAGGTACT GAAAAAGATG GTAAGCTGTT TGGCCAATAT TATTATGCAC GCCTTGATGA AAACCTTGCC GGTGATGTAA ATGGAGATAA AGTAATTGAT ATAAAAGATG CTGAAATTAT CGCTAGCAAC TATGGTAAAA AAGGAGTATC CGTGAAAGAC GGTGACCTAA ACAAAGATGG TATTGTTGAC GAAAAAGATA TTCGCTTCGT TGAAAAGAAC TTCTTGAAAA AAGGTCCCGA TGCATCTAAA TCACAAACAG CTGCTGAAAA ATCAAAAAGC GGTACACTGG CAGATATTTT AAAGAAATTG GGATTAACGC CTAAAAAATA A
|
Protein sequence | MKKKNRFVTG IVTAGVLFST ALPFNVLAEN PINQIDSSNA QSLLSKLSKE QRQALQTLDA NPGFTISPDI NTSSSEPVNI IVEFQTAPVK INELKQKEKG LQAAPKNIKA RIDQEHEEFQ KGLKRLYHFS PSVKSGNFQS VQIKRSYKHA INGVALTLPA NTVEELLRIG VVKRIWKDYE VKLNLPKQSE QKVPQKLTDS IPQIGVDKLH SEGITGKGIK VGVLDTGIDY NHPDLKDVYK GYRAKPGEDS SKIDPNSVKG WDFISNDADP METTYTEWQQ SGAPEFDDRG SSFYTAHGTH VAGIVSAQKK NKSDSAVKGV APDIELYNYR VLGPYGSGAS SGIIAAIDKS ISDGMNVINL SLGDESNNPL DPTSIAVNNA MLSGVVTVVA AGNSGPNPAT LGSPGASPFA ITVGASDSSI SLPKLSSHAG QVQFPNLILF GKNFTDKIED FKGQTLSIES VGIGTPDEFS KKDVKGKIAL VARGTLSFDE KIANAKQAGA KAVIIYNNVD GEIPFYVGES TKYIPAFRLT KEDGEKLKAQ IEQGNTSFTF DELSYIQTEG DHLADFSSRG PVTSNDDIKP DITAPGVAVL STVPEYINDP QEGENYDVSY ERMQGTSMAS PHIAGVAALI LQEHPEYSPF DVKASLMNTA DDLKEKYSVY EVGAGRVDAY NSVHTETSFK VLDTTKTVVN DEVIEVPEET GSIAFGKFYQ KDGEALEQKR NIKVENHNKQ EKKEFKTEIS YTPASSINDA IANGVKVSTP ETITLDAGKS DEIEAKVNVP AGVKQGRYEG YIHITNTKNK EETYQIPFSI RVSEPGIEDA VLSRKSISTD TTKFNPYRES YVHGAFQLNS NLETLDLIVK DSKTNKALGF LGTLNTGGLQ TDVYYYLDAF FSGKVYPFTN DPAKPIGDEK INLPEGDYTI EFVGYDKDGK ARVKGDYVII DNTPPEVKLT GLKPGIYELN EENYTVEDGK KALWIKGNVY DSNVDYLKGK GLNITQEANG VMYYDYSPYL HKFLPIDANG DFKVGITPES FKVEGPMNTG VYIFDYATAG PDYPSGVNNY WFIQQGAQYA KANYDKKELY KGDEFTVTLN AKHVKQFVAG EFNVKFLEKN FKFANAKLNP AFEKLLSEKG VTAKVNEPKL EEGSVTVGGS IDDKNFAGLD GDFPFIDVTF KVENDEFYEA NAQLESPVFV YWKPGETEPN RMRVLQDQTF SVMSLNSLVE GNIKPGAFLN ERGYLDEKFD YTKLGVKVYA TDSYRHKFEG SLDKYGYFKL NGLPVNKRDY NLFVEVPGHL TSRLTTKLGT EKDGKLFGQY YYARLDENLA GDVNGDKVID IKDAEIIASN YGKKGVSVKD GDLNKDGIVD EKDIRFVEKN FLKKGPDASK SQTAAEKSKS GTLADILKKL GLTPKK
|
| |