Gene BCAH187_A4742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH187_A4742 
SymbolpepQ2 
ID7077802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH187 
KingdomBacteria 
Replicon accessionNC_011658 
Strand
Start bp4400415 
End bp4401512 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content39% 
IMG OID643453154 
ProductX-Pro dipeptidase 
Protein accessionYP_002340665 
Protein GI217962095 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.928976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCTA GATTAGAAAA TTTAATGCAA TGGCTAAAAG AAAAAAACGT AGAAGCTGCG 
TTCTTAACTT CTACACCAAA CGTCTTCTAC ATGACAAACT TCCACTGTGA ACCACACGAA
AGACTTCTTG GTATGTTTGT ATTCCAAGAA AAAGAACCTA TTTTAATTTG CCCTAAAATG
GAAGAAGGCC AAGCACGTAA CGCTGGCTGG GCACATGAAA TTATCGGATT TACTGATACT
GACAGACCAT GGGATATGAT TGCAAAAGCA ATTAAAGACC GCGGTATCAA TGCAAATGCA
GTTGCAATTG AAAAAGAACA TTTAAACGTA GAGCGCTACG AAGAATTAAC AAAATTATTC
CCAAATGCAG CTTTCACGTC AGCTGAGGAA AAAGTTCGTG AACTTCGTTT AATTAAAGAC
GAAAAAGAAC TTTCTATTTT ACGCGAAGCA GCTAAAATGG CAGACTATGC TGTTGAAGTT
GGTGTAAATG CAATTAAAGA AGATCGTAGC GAACTAGAAG TATTAGCAAT TATTGAACAC
GAATTAAAAA CAAAAGGCAT ACATAAAATG TCATTTGATA CGATGGTATT AGCTGGTGCA
AACTCTGCTC TTCCACACGG TATTCCAGGT GCAAACAAAA TGAAACGCGG CGATTTCGTA
CTATTTGATT TAGGCGTAAT CATTGATGGC TATTGCTCTG ACATTACACG TACAGTAGCA
TTCGGCGAGA TTTCTGAAGA ACAAACTCGT ATTTACAACA CTGTACTTGC TGGACAACTA
CAAGCAGTTG AAGCATGTAA ACCAGGTGTT ACGCTTGGCA CAATCGACAA CGCTGCTCGT
TCTGTTATCG CAGATGCAGG TTACGGCGAC TTCTTCCCGC ACCGTCTTGG TCACGGACTT
GGAATTAGCG TACATGAATA TCCAGATGTA AAAGCTGGCA ACGAATCTCC ATTAAAAGAA
GGTATGGTCT TCACAATCGA GCCAGGTATT TACGTACCAA ACGTAGGTGG TGTTCGTATT
GAAGATGATA TTTATATCAC AAAAGACGGA TCAGAAATTT TAACGAAATT CCCGAAAGAA
TTACAATTTG TAAAATAA
 
Protein sequence
MNARLENLMQ WLKEKNVEAA FLTSTPNVFY MTNFHCEPHE RLLGMFVFQE KEPILICPKM 
EEGQARNAGW AHEIIGFTDT DRPWDMIAKA IKDRGINANA VAIEKEHLNV ERYEELTKLF
PNAAFTSAEE KVRELRLIKD EKELSILREA AKMADYAVEV GVNAIKEDRS ELEVLAIIEH
ELKTKGIHKM SFDTMVLAGA NSALPHGIPG ANKMKRGDFV LFDLGVIIDG YCSDITRTVA
FGEISEEQTR IYNTVLAGQL QAVEACKPGV TLGTIDNAAR SVIADAGYGD FFPHRLGHGL
GISVHEYPDV KAGNESPLKE GMVFTIEPGI YVPNVGGVRI EDDIYITKDG SEILTKFPKE
LQFVK