Gene BCZK4355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK4355 
SymbolpepQ 
ID3026915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp4471846 
End bp4472943 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content39% 
IMG OID637548571 
ProductXaa-Pro dipeptidase (proline dipeptidase) 
Protein accessionYP_085934 
Protein GI52140895 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCTA GATTAGAAAA TTTAATGCAA TGGCTAAAAG AAAAAAACGT AGAAGCTGTG 
TTCTTAACTT CTACACCAAA CGTCTTCTAC ATGACAAACT TCCACTGTGA ACCACACGAA
AGACTTCTTG GTATGTTTGT ATTCCAAGAA AAAGAGCCTA TTTTAATTTG TCCTAAAATG
GAAGAAGGTC AAGCGCGTAA CGCGGGATGG GCACATGAAA TTATCGGATT TACTGATACT
GACAGACCAT GGGATATGAT TGCAAAAGCA ATTAAAGACC GCGGTATCCA TGCAAATGCA
GTTGCAATTG AAAAAGAACA TTTAAACGTA GAGCGCTACG AAGAATTAAC AAAACTATTC
CCAAATGCAG CTTTCACATC AGCTGAGGAA AAAGTTCGTG AACTTCGTTT AATTAAAGAT
GAAAAAGAGC TTTCTATTTT ACGCGAAGCA GCTAAAATGG CAGACTATGC TGTTGAAGTT
GGTGTAAATG CAATTAAAGA AGATCGCAGC GAACTAGAAG TATTAGCAAT TATTGAACAC
GAATTAAAAA CAAAAGGCAT ACATAAAATG TCATTTGATA CGATGGTATT AGCTGGTGCA
AACTCTGCTC TTCCACACGG TATTCCAGGT GCAAACAAAA TGAAACGCGG CGATTTCATA
CTATTTGATT TAGGCGTAAT CATTGACGGT TATTGCTCTG ACATTACACG TACAGTGGCA
TTTGGCGAGA TTTCTGAAGA ACAAACTCGC ATTTACAACA CTGTACTTGC TGGACAACTA
CAAGCAGTTG AAGCATGTAA ACCAGGTATT ACACTTGGCG CAATCGACAA CGCTGCTCGT
TCTGTTATCG CAGATGCAGG TTATGGTGAC TTTTTCCCGC ACCGCCTTGG TCACGGACTT
GGAATTAGCG TACACGAATA TCCAGATGTA AAAGCTGGTA ACGAGTCTCC ATTAAAAGAA
GGTATGGTCT TCACAATTGA ACCAGGTATT TACGTACCAA ACGTAGGTGG CGTTCGTATT
GAAGATGATA TTTATATCAC AAAAGACGGA TCAGAAATTT TAACGAAGTT CCCGAAAGAA
TTACAATTTG TAAAATAA
 
Protein sequence
MNARLENLMQ WLKEKNVEAV FLTSTPNVFY MTNFHCEPHE RLLGMFVFQE KEPILICPKM 
EEGQARNAGW AHEIIGFTDT DRPWDMIAKA IKDRGIHANA VAIEKEHLNV ERYEELTKLF
PNAAFTSAEE KVRELRLIKD EKELSILREA AKMADYAVEV GVNAIKEDRS ELEVLAIIEH
ELKTKGIHKM SFDTMVLAGA NSALPHGIPG ANKMKRGDFI LFDLGVIIDG YCSDITRTVA
FGEISEEQTR IYNTVLAGQL QAVEACKPGI TLGAIDNAAR SVIADAGYGD FFPHRLGHGL
GISVHEYPDV KAGNESPLKE GMVFTIEPGI YVPNVGGVRI EDDIYITKDG SEILTKFPKE
LQFVK