Gene BCZK4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK4102 
SymbolphhA 
ID3026076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp4210360 
End bp4212114 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content38% 
IMG OID637548316 
Productphenylalanine 4-monooxygenase 
Protein accessionYP_085681 
Protein GI52141148 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3186] Phenylalanine-4-hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAGA AAACAGAAAT TCCATCGCAT TTAAAACCAT TCGTATCCAC ACAGCATTAT 
GATCAATACA CACCGGTGAA CCACGCTGTA TGGCGTTACA TTATGAGACA AAATCATAGT
TTCTTAAAAG ACGTTGCTCA TCCAGCCTAT GTGAATGGAC TACAATCATC TGGTATTAAT
ATAGAGGCAA TCCCAAAGGT AGAAGAAATG AATGAATGTT TGGCGTCAAG CGGCTGGGGC
GCTGTAACGA TTGACGGACT TATTCCTGGC GTCGCATTTT TCGATTTTCA AGGGCACGGA
TTGCTACCAA TCGCAACAGA TATTCGCAAA GTAGAAAACA TTGAGTATAC ACCTGCTCCA
GATATTGTAC ACGAAGCAGC AGGACACGCA CCTATTTTAC TTGATCCTAC ATATGCAAAA
TATGTGAAAC GATTTGGACA AATTGGTGCA AAAGCTTTCT CTACAAAAGA AGAACATGAT
GCATTCGAAG CTGTTCGTAC ATTAACAATT GTAAAGGAAA GCCCTACTTC TACTCCTGAT
GAAGTTACAG CTGCTGAAAA TAATGTACTT GAAAAACAAA AGTTAGTTTC TGGCTTATCA
GAAGCAGAAC AAATTTCACG TCTTTTCTGG TGGACAGTAG AATATGGATT AATCGGAAAT
ATAGATGCTC CAAAAATATA TGGTGCTGGT CTCCTTTCTT CTGTTGGCGA AAGCAAACAT
TGCTTAACAG ACGCTGTAGA AAAGGTTCCA TTCTCTATAG AGACATGTAC AAGTACAACT
TATGACGTAA CAAAAATGCA GCCACAACTA TTTGTTTGCG AATCATTTGA AGAATTAACA
GAAGCACTTG AGAAATTCTC TGAAACGATG GCCTTTAAAA CAGGTGGTAA AGAAGGATTA
GAAAAAGCAA TTCGCTCTGA AAATCATGCA ACTGCTGAAT TAAATAGCGG ATTACAAATT
ACAGGCACAT TTACCGAGAC AATTGAAAAC GATGCAGGTG AATTGATTTA CATGCGAACG
AGCTCGCCAA CAGCATTAGC AATTCACAAT AAACAACTAG CGAATCATTC TACGTCTGTA
CACAGTGACG GGTTCGGAAC ACCAATTGGA TTACTCACTG AAAATATTGC ATTAGAAAAT
TGTACAGATG AACAATTACA ATCATTAGGA ATTACAATCG GAAATAAAGC AGCATTTACT
TTTGCAAGTG GTATTCATGT AAAAGGAACA GTAACAGATA TTGTAAAAAA CGATAAAAAA
ATTGCGCTTA TTTCCTTTAT CAATTGCACA GTTACTTATA ACGACCGCGT TTTATTTGAT
GCTTCATGGG GCTCATTTGA TATGGCTGTT GGTTCAACAA TTACTTCGGT ATTCCCAGGT
GCCGCAGATG CTGCAGCATT TTTCCCAATG GATGAAGAAA TCCAAGAAAT TCCTGCTCCA
CTTGTACTGA ATGAGCTTGA ACGTATGTAT CAAACAGTTC GCGATATCCG AAATGAAGGT
ATTTTACACG ACGCGCATAT CGAGCAATTA GTAGCAATTC AAGAAGTATT AAATAAATTC
TATACGAAAG AATGGTTACT TCGCCTCGAA ATATTAGAAT TGCTTTTAGA GCATAACAAA
GGGCACGAAA CATCAGCAGC ATTATTACAA CAACTTTCTA CTTTCACAAC GGATGAAGCT
GTAACACGTC TTATTAACAA TGGTCTTACG CTACTTCCAG TAAAGGATGT GAAAAATGAT
GCTACGATTA ACTGA
 
Protein sequence
MTKKTEIPSH LKPFVSTQHY DQYTPVNHAV WRYIMRQNHS FLKDVAHPAY VNGLQSSGIN 
IEAIPKVEEM NECLASSGWG AVTIDGLIPG VAFFDFQGHG LLPIATDIRK VENIEYTPAP
DIVHEAAGHA PILLDPTYAK YVKRFGQIGA KAFSTKEEHD AFEAVRTLTI VKESPTSTPD
EVTAAENNVL EKQKLVSGLS EAEQISRLFW WTVEYGLIGN IDAPKIYGAG LLSSVGESKH
CLTDAVEKVP FSIETCTSTT YDVTKMQPQL FVCESFEELT EALEKFSETM AFKTGGKEGL
EKAIRSENHA TAELNSGLQI TGTFTETIEN DAGELIYMRT SSPTALAIHN KQLANHSTSV
HSDGFGTPIG LLTENIALEN CTDEQLQSLG ITIGNKAAFT FASGIHVKGT VTDIVKNDKK
IALISFINCT VTYNDRVLFD ASWGSFDMAV GSTITSVFPG AADAAAFFPM DEEIQEIPAP
LVLNELERMY QTVRDIRNEG ILHDAHIEQL VAIQEVLNKF YTKEWLLRLE ILELLLEHNK
GHETSAALLQ QLSTFTTDEA VTRLINNGLT LLPVKDVKND ATIN