Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK3858 |
Symbol | |
ID | 3026634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 3985690 |
End bp | 3986688 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637548072 |
Product | prolyl aminopeptidase |
Protein accession | YP_085438 |
Protein GI | 52141392 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTTC GTAGCTATAC CCCGCAGTTC TATAATGAAA ATAAACACCC CATTCCAAAT AGTATCGCTA CGATGGAAAG CGTTATGATT AACAACCGAA AACAAACTCT CCTTATTCGC GGGCAAAACG TAGAGCAGCC TATTTTATTA TGCTGTCACG GTGGACCCGG TATGGCACAA ATCGGATTTA TTCGTCATTT TCAAAAAGAA TTGGAGAAAC ACTTCATCGT AATTAATTGG GATCAGCGCG GGGCAGGTAA ATCCTTTTCA ATGAAAGATT TTGGAGCAAA TTTTACAATC GAACAATTCA TTTCCGATGC AAAAGAAGTG ATTCAATATG TACTGAAAAA GTTCAGTAAA CAGAAACTAT TTCTCGCTGG TCATTCTTGG GGCAGCATTA TCGGACTTAA CATAGCACAC CAATATCCAC AATATATCGA GGCTTATATC GGTATTGGGC AAATTGTACA TATGAAACAA AACGAAGAAT TACTATATCA GCATTTAATT CGTTCTGCGA AAAAACATGA TCATAAAAAA GCATTAGCCT CCCTTTTAAA ATTAGGTAAA CCGCCATTTT TAGATACGAG ACGTCTTATT ATTCAAAGAA AGTGGCTTGG CACATTCGGA GGAGCAATCC AAAACGGATC TTCCTTTTCG TTCATACGAA AAGGTTTCTT TTCTCCTGAA TATACGCTAT TAGATTGGTT CAAGTTTCTA GCAGGAAATT TGAAATCTGG CGTTTTATGG GAAGAAATGT TGACAATCGA TTTCTTTTCT TCTATTACGA GTTTATCTAT CCCTGTTTAT TTTTGTTCTG GTCGCTATGA TTATCAAACT CCTTATGCAC TTGTTCAGGA ATATTGTGAT GTGATAGAAG CACCCATAAA AAAGATGATT TGGTTCCCAA ATTCAGCACA TTCTCCTGAT TTAGAAGAAC CAGAATTATT CGCTCATTCT TTACAATCAA TTAAACAAGA GCTAGCTTTT CAGCATTAA
|
Protein sequence | MLFRSYTPQF YNENKHPIPN SIATMESVMI NNRKQTLLIR GQNVEQPILL CCHGGPGMAQ IGFIRHFQKE LEKHFIVINW DQRGAGKSFS MKDFGANFTI EQFISDAKEV IQYVLKKFSK QKLFLAGHSW GSIIGLNIAH QYPQYIEAYI GIGQIVHMKQ NEELLYQHLI RSAKKHDHKK ALASLLKLGK PPFLDTRRLI IQRKWLGTFG GAIQNGSSFS FIRKGFFSPE YTLLDWFKFL AGNLKSGVLW EEMLTIDFFS SITSLSIPVY FCSGRYDYQT PYALVQEYCD VIEAPIKKMI WFPNSAHSPD LEEPELFAHS LQSIKQELAF QH
|
| |