Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK4102 |
Symbol | phhA |
ID | 3026076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 4210360 |
End bp | 4212114 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637548316 |
Product | phenylalanine 4-monooxygenase |
Protein accession | YP_085681 |
Protein GI | 52141148 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3186] Phenylalanine-4-hydroxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAAGA AAACAGAAAT TCCATCGCAT TTAAAACCAT TCGTATCCAC ACAGCATTAT GATCAATACA CACCGGTGAA CCACGCTGTA TGGCGTTACA TTATGAGACA AAATCATAGT TTCTTAAAAG ACGTTGCTCA TCCAGCCTAT GTGAATGGAC TACAATCATC TGGTATTAAT ATAGAGGCAA TCCCAAAGGT AGAAGAAATG AATGAATGTT TGGCGTCAAG CGGCTGGGGC GCTGTAACGA TTGACGGACT TATTCCTGGC GTCGCATTTT TCGATTTTCA AGGGCACGGA TTGCTACCAA TCGCAACAGA TATTCGCAAA GTAGAAAACA TTGAGTATAC ACCTGCTCCA GATATTGTAC ACGAAGCAGC AGGACACGCA CCTATTTTAC TTGATCCTAC ATATGCAAAA TATGTGAAAC GATTTGGACA AATTGGTGCA AAAGCTTTCT CTACAAAAGA AGAACATGAT GCATTCGAAG CTGTTCGTAC ATTAACAATT GTAAAGGAAA GCCCTACTTC TACTCCTGAT GAAGTTACAG CTGCTGAAAA TAATGTACTT GAAAAACAAA AGTTAGTTTC TGGCTTATCA GAAGCAGAAC AAATTTCACG TCTTTTCTGG TGGACAGTAG AATATGGATT AATCGGAAAT ATAGATGCTC CAAAAATATA TGGTGCTGGT CTCCTTTCTT CTGTTGGCGA AAGCAAACAT TGCTTAACAG ACGCTGTAGA AAAGGTTCCA TTCTCTATAG AGACATGTAC AAGTACAACT TATGACGTAA CAAAAATGCA GCCACAACTA TTTGTTTGCG AATCATTTGA AGAATTAACA GAAGCACTTG AGAAATTCTC TGAAACGATG GCCTTTAAAA CAGGTGGTAA AGAAGGATTA GAAAAAGCAA TTCGCTCTGA AAATCATGCA ACTGCTGAAT TAAATAGCGG ATTACAAATT ACAGGCACAT TTACCGAGAC AATTGAAAAC GATGCAGGTG AATTGATTTA CATGCGAACG AGCTCGCCAA CAGCATTAGC AATTCACAAT AAACAACTAG CGAATCATTC TACGTCTGTA CACAGTGACG GGTTCGGAAC ACCAATTGGA TTACTCACTG AAAATATTGC ATTAGAAAAT TGTACAGATG AACAATTACA ATCATTAGGA ATTACAATCG GAAATAAAGC AGCATTTACT TTTGCAAGTG GTATTCATGT AAAAGGAACA GTAACAGATA TTGTAAAAAA CGATAAAAAA ATTGCGCTTA TTTCCTTTAT CAATTGCACA GTTACTTATA ACGACCGCGT TTTATTTGAT GCTTCATGGG GCTCATTTGA TATGGCTGTT GGTTCAACAA TTACTTCGGT ATTCCCAGGT GCCGCAGATG CTGCAGCATT TTTCCCAATG GATGAAGAAA TCCAAGAAAT TCCTGCTCCA CTTGTACTGA ATGAGCTTGA ACGTATGTAT CAAACAGTTC GCGATATCCG AAATGAAGGT ATTTTACACG ACGCGCATAT CGAGCAATTA GTAGCAATTC AAGAAGTATT AAATAAATTC TATACGAAAG AATGGTTACT TCGCCTCGAA ATATTAGAAT TGCTTTTAGA GCATAACAAA GGGCACGAAA CATCAGCAGC ATTATTACAA CAACTTTCTA CTTTCACAAC GGATGAAGCT GTAACACGTC TTATTAACAA TGGTCTTACG CTACTTCCAG TAAAGGATGT GAAAAATGAT GCTACGATTA ACTGA
|
Protein sequence | MTKKTEIPSH LKPFVSTQHY DQYTPVNHAV WRYIMRQNHS FLKDVAHPAY VNGLQSSGIN IEAIPKVEEM NECLASSGWG AVTIDGLIPG VAFFDFQGHG LLPIATDIRK VENIEYTPAP DIVHEAAGHA PILLDPTYAK YVKRFGQIGA KAFSTKEEHD AFEAVRTLTI VKESPTSTPD EVTAAENNVL EKQKLVSGLS EAEQISRLFW WTVEYGLIGN IDAPKIYGAG LLSSVGESKH CLTDAVEKVP FSIETCTSTT YDVTKMQPQL FVCESFEELT EALEKFSETM AFKTGGKEGL EKAIRSENHA TAELNSGLQI TGTFTETIEN DAGELIYMRT SSPTALAIHN KQLANHSTSV HSDGFGTPIG LLTENIALEN CTDEQLQSLG ITIGNKAAFT FASGIHVKGT VTDIVKNDKK IALISFINCT VTYNDRVLFD ASWGSFDMAV GSTITSVFPG AADAAAFFPM DEEIQEIPAP LVLNELERMY QTVRDIRNEG ILHDAHIEQL VAIQEVLNKF YTKEWLLRLE ILELLLEHNK GHETSAALLQ QLSTFTTDEA VTRLINNGLT LLPVKDVKND ATIN
|
| |