Gene BCAH820_4218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_4218 
SymbolpepQ1 
ID7188023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp4023796 
End bp4024857 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content39% 
IMG OID643557629 
ProductX-Pro dipeptidase 
Protein accessionYP_002453168 
Protein GI218905334 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones141 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAAAA TCGAAAGATT AAGAAGTGCA TTTGATGAGG CTGGTATTGA CGGTATTTTG 
TTAACAAATG AACATAGTCG TAGATATATG GCTAACTTCA CAGGAACAGC TGGTGTTGTA
CTGATTTCGA AAAAACGCGC CCAATTTATT ACAGATTTCC GTTACGTAGA GCAGGCTAGT
AAACAAGCGG TTGGATACGA GATTGTACAG CATGCAGGAT TAATTATCGA TGAAGTTGCA
AAGCAAGTGA AAGAACTAGG AATTCAAAAG CTTGGCTTTG AGCAAGATAC TCTTACATAT
AGTTCTTATT CAGCTCATAA AGAAGCGATC GATGCTGAAT TTATCCCAAC TTCTGGGCTT
GTAGAAAAGT TACGCTTGAT AAAGACTGAT TCAGAGATTA AGATATTAAA GGAAGCTGCA
CAGATTGCAG ATGCTGCCTT TGAACATATT CTATCATTCA TTCGCCCGGG AGTATCTGAA
ATTGAAGTGT CAAATGAACT TGAATTTTTC ATGAGAAAAC AAGGAGCAAC ATCTTCTTCG
TTTGATATTA TCGTTGCTTC AGGTCTTCGT TCGGCATTAC CGCACGGCGT GGCATCTGAA
AAAGTGATAG AAACAGGAGA TTTCGTTACA TTAGACTTCG GCGCTTATTA CAAAGGATAT
TGCTCTGATA TTACTCGTAC GATTGCAGTT GGTGAACCAT CTGATAAATT GAAAGAAATT
TATAATATCG TTTTAGAAGC ACAACTACGT GGTGTGAACG GTATTAAAGC TGGTTTAACT
GGCCGTGAAG CGGATGCGTT AACGCGTGAT TACATAACGG AAAAAGGATA CGGTGAATAC
TTCGGACATT CTACTGGTCA TGGAATCGGT CTTGAAATCC ATGAAGCACC AGGTTTAGCA
TTCCGTTCTG ATACAGTACT TGAACCAGGT ATGGCTGTAA CAGTAGAGCC AGGTATTTAT
ATTCCAGGTA TTGGCGGCGT ACGTATTGAA GATGATATCA TTGTGACAAG TGAAGGTAAT
GAAGTAATTA CGAAATCACC AAAAGAACTT ATTATTTTGT AA
 
Protein sequence
MEKIERLRSA FDEAGIDGIL LTNEHSRRYM ANFTGTAGVV LISKKRAQFI TDFRYVEQAS 
KQAVGYEIVQ HAGLIIDEVA KQVKELGIQK LGFEQDTLTY SSYSAHKEAI DAEFIPTSGL
VEKLRLIKTD SEIKILKEAA QIADAAFEHI LSFIRPGVSE IEVSNELEFF MRKQGATSSS
FDIIVASGLR SALPHGVASE KVIETGDFVT LDFGAYYKGY CSDITRTIAV GEPSDKLKEI
YNIVLEAQLR GVNGIKAGLT GREADALTRD YITEKGYGEY FGHSTGHGIG LEIHEAPGLA
FRSDTVLEPG MAVTVEPGIY IPGIGGVRIE DDIIVTSEGN EVITKSPKEL IIL