Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK3951 |
Symbol | pepQ |
ID | 3025866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | - |
Start bp | 4072325 |
End bp | 4073386 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637548165 |
Product | proline dipeptidase, Xaa-Pro dipeptidase |
Protein accession | YP_085531 |
Protein GI | 52141299 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00282404 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAAAA TCGAAAGATT AAGAAGTGCA TTTGATGAGG CTGGTATTGA CGGTGTTTTG TTAACAAATG AACATAGTCG TAGATATATG GCTAATTTCA CAGGAACAGC TGGTGTTGTA CTGATTTCGA AAAAACGCGC TCAATTTATT ACAGATTTCC GTTACGTAGA GCAGGCTAGT AAACAAGCGG TTGGATACGA GATTGTACAG CATGCAGGAT TAATTATCGA TGAAGTTGCA AAGCAAGTGA AAGAACTAGG AATTCAAAAG CTTGGCTTTG AGCAAGATAC TCTTACATAT AGTTCTTATT CAGCTCATAA AGAAGTGATC GATGCTGAAT TTATCCCAAC TTCTGGGCTT GTAGAAAAGT TACGCTTGAT AAAGACTGAT TCAGAGATTA AGATATTAAA GGAAGCTGCA CAGATTGCAG ATGCTGCCTT TGAACATATT CTATCATTCA TTCGCCCGGG AGTATCTGAA ATTGAAGTGT CAAATGAACT TGAATTTTTC ATGAGAAAAC AAGGAGCAAC ATCTTCTTCG TTTGATATTA TCGTTGCTTC AGGTCTTCGT TCGGCATTAC CGCACGGCGT GGCATCTGAA AAAGTGATAG AAACAGGAGA TTTCGTTACA TTAGACTTCG GGGCTTATTA CAAAGGATAT TGCTCTGATA TTACTCGTAC GATCGCAGTT GGTGAACCAT CTGATAAATT GAAAGAAATT TATAATGTCG TTTTAGAAGC ACAACTACGT GGTGTGAACG GTATTAAAGC TGGTTTAACT GGCCGTGAAG CGGATGCGTT AACGCGTGAT TACATAACGG AAAAAGGATA CGGTGAATAC TTCGGACATT CTACTGGTCA TGGAATCGGT CTTGAAATCC ATGAAGCACC AGGTTTAGCA TTCCGTTCTG ATACAGTACT TGAACCAGGT ATGGCTGTAA CAGTAGAGCC AGGTATTTAT ATTCCAGGTA TTGGCGGCGT ACGTATTGAA GATGATATCA TTGTGACAAG TGAAGGTAAT GAAGTAATTA CGAAATCACC AAAAGAACTT ATTATTTTGT AA
|
Protein sequence | MEKIERLRSA FDEAGIDGVL LTNEHSRRYM ANFTGTAGVV LISKKRAQFI TDFRYVEQAS KQAVGYEIVQ HAGLIIDEVA KQVKELGIQK LGFEQDTLTY SSYSAHKEVI DAEFIPTSGL VEKLRLIKTD SEIKILKEAA QIADAAFEHI LSFIRPGVSE IEVSNELEFF MRKQGATSSS FDIIVASGLR SALPHGVASE KVIETGDFVT LDFGAYYKGY CSDITRTIAV GEPSDKLKEI YNVVLEAQLR GVNGIKAGLT GREADALTRD YITEKGYGEY FGHSTGHGIG LEIHEAPGLA FRSDTVLEPG MAVTVEPGIY IPGIGGVRIE DDIIVTSEGN EVITKSPKEL IIL
|
| |