Gene BCZK5056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK5056 
Symbol 
ID3022987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp5168746 
End bp5170146 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content39% 
IMG OID637549289 
Productaminopeptidase 
Protein accessionYP_086626 
Protein GI52140205 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.418047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT CTTTGAAACA AAAAATAGTA AGCTCCTTGC TTGCTGTATC ACTCGCTGTT 
AGCTTAGCTC CGATTGGACA AGCTAAAGCT GATTCCACGT CAGAAATCAA GCAGACTTCA
TCTATCACAA AACAAGTTGA TGCAAGCCGC GCTATCGAAC ACATCCGTTT CTTATCCGAA
ACAATTGGTC CTCGACCTGG CGGGACAAAA TCAGAAGAAT GGGCTTCCCG CTACGTTGGT
ATGCAGCTTA AATCAATGGG CTACGAAGTA GAATATCAAC CATTCCAAGT GCCGGATCAA
TACGTTGGAT TTATTGAATC ACCATTATCC ACAAAGCGTA ATTGGCAAGC TGGTGCTGCC
CCTAACGCAC TAATTTCTAC AGAATCTGTT ACAGCTCCTC TTATCTTTGT TCAAGGTGGG
ACAAAATTAG AGGATATCCC AAATGAAGTA AATGGAAAAA TTGTTCTATT CGAAAGAGGA
ACAACAGTAG CTGACTATAA TAAACAAGTT GAAAATGCTG TTAGCAAAGG AGCAAAAGGT
GTTCTTTTAT ACAGTTTAAT TGGTGGACGT GGAAACTACG GACAAACTTT CAATCCCCGC
CTAACGAAAA AGCAATCTAT CCCTGTCTTT GGTCTTGCTT ATGCGCAAGG AAATGCATTT
AAAGAAGAAA TCGCTAAAAA AGGAACAACA ATTCTTTCCC TAAAAGCGAG ACATGAATCT
AATTTAACAT CATTAAACGT CATCGCTAAA AAGAAACCAA AAAACAGTAC AGGTAATGAA
AAAGCTGTCG TTGTAAGTTC ACACTACGAT AGTGTCGTTG GAGCACCTGG AGCAAATGAT
AATGCTTCTG GTACAGGATT AGTATTAGAA TTAGCTCGTG CTTTTCAAAA TGTAGAAACT
GATAAGGAAA TTCGTTTTAT TGCTTTTGGT TCTGAAGAGA CTGGCTTACT TGGCTCCGAT
TATTACGTTA ATAGTTTATC CCAAAAAGAA CGCGATCGAA TTTTAGGTGT CTTTAACGCA
GACATGGTCG CAACAAATTA CGATAAAGCA AAGAATCTAT ATGCTATGAC GCCTAACGGT
TCTCCAAACC TTGTAACAGA CGCAGCCTTA CAAGCAGGTA AACAGTTAAA TAATGACCTT
GTACTGCAAG GAAAATTCGG CTCTAGTGAT CACGTCCCAT TTGCTGAAGT TGGCATTCCT
GCCGCTCTAT TTATTTGGAT GGGTGTCGAT AGCTGGAATC CATTAATCTA TCATATCGAG
AAGGTATATC ACACACCTCA AGATAACGTA TTTGAGAACA TTTCACCTGA ACGTATGAAA
ATGGCACTAG AAGTAATCGG AACTGGTGTT TATAACACTC TTCAAAAACC TGTTACGCAA
ACCGAACAGA AAGCTGCTTA A
 
Protein sequence
MKKSLKQKIV SSLLAVSLAV SLAPIGQAKA DSTSEIKQTS SITKQVDASR AIEHIRFLSE 
TIGPRPGGTK SEEWASRYVG MQLKSMGYEV EYQPFQVPDQ YVGFIESPLS TKRNWQAGAA
PNALISTESV TAPLIFVQGG TKLEDIPNEV NGKIVLFERG TTVADYNKQV ENAVSKGAKG
VLLYSLIGGR GNYGQTFNPR LTKKQSIPVF GLAYAQGNAF KEEIAKKGTT ILSLKARHES
NLTSLNVIAK KKPKNSTGNE KAVVVSSHYD SVVGAPGAND NASGTGLVLE LARAFQNVET
DKEIRFIAFG SEETGLLGSD YYVNSLSQKE RDRILGVFNA DMVATNYDKA KNLYAMTPNG
SPNLVTDAAL QAGKQLNNDL VLQGKFGSSD HVPFAEVGIP AALFIWMGVD SWNPLIYHIE
KVYHTPQDNV FENISPERMK MALEVIGTGV YNTLQKPVTQ TEQKAA