Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK4355 |
Symbol | pepQ |
ID | 3026915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 4471846 |
End bp | 4472943 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637548571 |
Product | Xaa-Pro dipeptidase (proline dipeptidase) |
Protein accession | YP_085934 |
Protein GI | 52140895 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCTA GATTAGAAAA TTTAATGCAA TGGCTAAAAG AAAAAAACGT AGAAGCTGTG TTCTTAACTT CTACACCAAA CGTCTTCTAC ATGACAAACT TCCACTGTGA ACCACACGAA AGACTTCTTG GTATGTTTGT ATTCCAAGAA AAAGAGCCTA TTTTAATTTG TCCTAAAATG GAAGAAGGTC AAGCGCGTAA CGCGGGATGG GCACATGAAA TTATCGGATT TACTGATACT GACAGACCAT GGGATATGAT TGCAAAAGCA ATTAAAGACC GCGGTATCCA TGCAAATGCA GTTGCAATTG AAAAAGAACA TTTAAACGTA GAGCGCTACG AAGAATTAAC AAAACTATTC CCAAATGCAG CTTTCACATC AGCTGAGGAA AAAGTTCGTG AACTTCGTTT AATTAAAGAT GAAAAAGAGC TTTCTATTTT ACGCGAAGCA GCTAAAATGG CAGACTATGC TGTTGAAGTT GGTGTAAATG CAATTAAAGA AGATCGCAGC GAACTAGAAG TATTAGCAAT TATTGAACAC GAATTAAAAA CAAAAGGCAT ACATAAAATG TCATTTGATA CGATGGTATT AGCTGGTGCA AACTCTGCTC TTCCACACGG TATTCCAGGT GCAAACAAAA TGAAACGCGG CGATTTCATA CTATTTGATT TAGGCGTAAT CATTGACGGT TATTGCTCTG ACATTACACG TACAGTGGCA TTTGGCGAGA TTTCTGAAGA ACAAACTCGC ATTTACAACA CTGTACTTGC TGGACAACTA CAAGCAGTTG AAGCATGTAA ACCAGGTATT ACACTTGGCG CAATCGACAA CGCTGCTCGT TCTGTTATCG CAGATGCAGG TTATGGTGAC TTTTTCCCGC ACCGCCTTGG TCACGGACTT GGAATTAGCG TACACGAATA TCCAGATGTA AAAGCTGGTA ACGAGTCTCC ATTAAAAGAA GGTATGGTCT TCACAATTGA ACCAGGTATT TACGTACCAA ACGTAGGTGG CGTTCGTATT GAAGATGATA TTTATATCAC AAAAGACGGA TCAGAAATTT TAACGAAGTT CCCGAAAGAA TTACAATTTG TAAAATAA
|
Protein sequence | MNARLENLMQ WLKEKNVEAV FLTSTPNVFY MTNFHCEPHE RLLGMFVFQE KEPILICPKM EEGQARNAGW AHEIIGFTDT DRPWDMIAKA IKDRGIHANA VAIEKEHLNV ERYEELTKLF PNAAFTSAEE KVRELRLIKD EKELSILREA AKMADYAVEV GVNAIKEDRS ELEVLAIIEH ELKTKGIHKM SFDTMVLAGA NSALPHGIPG ANKMKRGDFI LFDLGVIIDG YCSDITRTVA FGEISEEQTR IYNTVLAGQL QAVEACKPGI TLGAIDNAAR SVIADAGYGD FFPHRLGHGL GISVHEYPDV KAGNESPLKE GMVFTIEPGI YVPNVGGVRI EDDIYITKDG SEILTKFPKE LQFVK
|
| |