Gene BCZK3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK3951 
SymbolpepQ 
ID3025866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp4072325 
End bp4073386 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content39% 
IMG OID637548165 
Productproline dipeptidase, Xaa-Pro dipeptidase 
Protein accessionYP_085531 
Protein GI52141299 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00282404 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAAA TCGAAAGATT AAGAAGTGCA TTTGATGAGG CTGGTATTGA CGGTGTTTTG 
TTAACAAATG AACATAGTCG TAGATATATG GCTAATTTCA CAGGAACAGC TGGTGTTGTA
CTGATTTCGA AAAAACGCGC TCAATTTATT ACAGATTTCC GTTACGTAGA GCAGGCTAGT
AAACAAGCGG TTGGATACGA GATTGTACAG CATGCAGGAT TAATTATCGA TGAAGTTGCA
AAGCAAGTGA AAGAACTAGG AATTCAAAAG CTTGGCTTTG AGCAAGATAC TCTTACATAT
AGTTCTTATT CAGCTCATAA AGAAGTGATC GATGCTGAAT TTATCCCAAC TTCTGGGCTT
GTAGAAAAGT TACGCTTGAT AAAGACTGAT TCAGAGATTA AGATATTAAA GGAAGCTGCA
CAGATTGCAG ATGCTGCCTT TGAACATATT CTATCATTCA TTCGCCCGGG AGTATCTGAA
ATTGAAGTGT CAAATGAACT TGAATTTTTC ATGAGAAAAC AAGGAGCAAC ATCTTCTTCG
TTTGATATTA TCGTTGCTTC AGGTCTTCGT TCGGCATTAC CGCACGGCGT GGCATCTGAA
AAAGTGATAG AAACAGGAGA TTTCGTTACA TTAGACTTCG GGGCTTATTA CAAAGGATAT
TGCTCTGATA TTACTCGTAC GATCGCAGTT GGTGAACCAT CTGATAAATT GAAAGAAATT
TATAATGTCG TTTTAGAAGC ACAACTACGT GGTGTGAACG GTATTAAAGC TGGTTTAACT
GGCCGTGAAG CGGATGCGTT AACGCGTGAT TACATAACGG AAAAAGGATA CGGTGAATAC
TTCGGACATT CTACTGGTCA TGGAATCGGT CTTGAAATCC ATGAAGCACC AGGTTTAGCA
TTCCGTTCTG ATACAGTACT TGAACCAGGT ATGGCTGTAA CAGTAGAGCC AGGTATTTAT
ATTCCAGGTA TTGGCGGCGT ACGTATTGAA GATGATATCA TTGTGACAAG TGAAGGTAAT
GAAGTAATTA CGAAATCACC AAAAGAACTT ATTATTTTGT AA
 
Protein sequence
MEKIERLRSA FDEAGIDGVL LTNEHSRRYM ANFTGTAGVV LISKKRAQFI TDFRYVEQAS 
KQAVGYEIVQ HAGLIIDEVA KQVKELGIQK LGFEQDTLTY SSYSAHKEVI DAEFIPTSGL
VEKLRLIKTD SEIKILKEAA QIADAAFEHI LSFIRPGVSE IEVSNELEFF MRKQGATSSS
FDIIVASGLR SALPHGVASE KVIETGDFVT LDFGAYYKGY CSDITRTIAV GEPSDKLKEI
YNVVLEAQLR GVNGIKAGLT GREADALTRD YITEKGYGEY FGHSTGHGIG LEIHEAPGLA
FRSDTVLEPG MAVTVEPGIY IPGIGGVRIE DDIIVTSEGN EVITKSPKEL IIL