Gene BCAH820_4438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_4438 
Symbol 
ID7189833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp4196834 
End bp4198588 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content38% 
IMG OID643557849 
Productphenylalanine 4-monooxygenase 
Protein accessionYP_002453387 
Protein GI218905553 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3186] Phenylalanine-4-hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones238 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGA AAACAGAAAT TCCATCGCAT TTAAAACCAT TCGTATCCAC ACAGCATTAT 
GATCAATACA CACCGGTGAA CCACGCTGTA TGGCGTTACA TTATGAGACA AAATCATAGT
TTCTTAAAAG ACGTTGCCCA TCCAGCCTAT GTGAATGGAC TACAATCATC TGGTATTAAT
ATAGAGGCAA TCCCAAAGGT AGAAGAAATG AATGAATGTT TGGCGTCAAG CGGCTGGGGC
GCTGTAACGA TTGACGGACT TATTCCTGGC GTCGCATTTT TCGATTTTCA AGGGCACGGA
TTGCTACCAA TCGCAACAGA TATTCGCAAA GTAGAAAACA TCGAGTATAC ACCTGCTCCA
GATATTGTAC ACGAAGCAGC AGGACACGCA CCTATTTTAC TTGATCCTAC ATATGCAAAA
TATGTAAAAC GATTTGGGCA AATTGGTGCA AAAGCTTTTT CTACAAAAGA AGAACATGAT
GCATTCGAGG CTGTTCGTAC ATTAACGATT GTAAAAGAAA GCCCTACTTC TACTCCTGAT
GAAGTTACAG CTGCTGAAAA TAATGTAATT GAAAAACAAA ACTTAGTTTC TGGCTTATCA
GAAGCGGAAC AAATTTCACG TCTTTTCTGG TGGACAGTGG AATACGGATT AATTGGAGAT
ATAGACAATC CAAAAATATA TGGTGCTGGT CTCCTTTCTT CTGTTGGCGA AAGCAAACAT
TGCTTAACAG ACGCTGTAGA AAAGGTTCCA TTCTCTATAG AGGCATGTAC AAGTACAACT
TATGACGTAA CAAAAATGCA ACCACAACTA TTTGTTTGTA AATCATTTGA AGAATTAACA
GAAGCACTTG AGAAATTTGC TGAAACGATG GCCTTTAAAA CAGGTGGTAA AGAAGGCTTA
GAAAAAGCAA TTCGCTCTGA AAACCATGCA ACTGCTGAAT TAAATAGCGG ATTACAAATT
ACAGGTACAT TCACCGAGAC AATTGAAAAC GATGCGGGTG AATTGATTTA TATGCGAACA
AGTTCGCCAA CAGCATTAGC CATTCACAAT AAAGAGCTAG CGAATCATTC TACGGCTGTA
CACAGTGACG GCTTTGGAAC ACCGATTGGA TTACTCACTG AAAATATTGC ATTAGAAAAT
TGTACAGATG AACAACTACA AGCATTAGGA ATTACAATTG GAAATATTGC TGAGTTTACT
TTTGAAAGTG ATATTCATGT AAAAGGAACA GTAACAGATA TTGTAAAAAA CGATAATAAA
ATCGCTCTTA TTTCCTTTAT CAATTGTACA GTTACTTATA ACGACCGCGT TTTATTTGAT
GCTTCATGGG GCGCATTTGA TATGGCTGTT GGTTCAACAA TCACTTCCGT ATTCCCAGGT
GCCGCAGATG CTGCAGCATT TTTCCCAATG GATGAAGAAA TTCAAGAAAT TCCTGCTCCA
CTTGTACTGA ATGAACTTGA ACGTATGTAT CAAACAGTTC GTGATATCCG AAATGAAGGT
ATTTTACACG ACGCACATAT CGAGCAATTA GTAGCAATTC AAGAGGTATT AAATAAATTC
TATACGAAAG AATGGTTACT TCGCCTTGAA ATATTAGAAT TGCTTTTAGA ACATAACAAA
GGGCACGAAA CATCAGCAGC ATTATTACAA CAACTTTCTA CTTTCACAAC TGATGAAGCT
GTAACACGCC TTATTAACAA TGGTCTTACG CTACTTCCAG TAAAGGGTGT GAAAAATGAT
GCTACGATTA ACTGA
 
Protein sequence
MTKKTEIPSH LKPFVSTQHY DQYTPVNHAV WRYIMRQNHS FLKDVAHPAY VNGLQSSGIN 
IEAIPKVEEM NECLASSGWG AVTIDGLIPG VAFFDFQGHG LLPIATDIRK VENIEYTPAP
DIVHEAAGHA PILLDPTYAK YVKRFGQIGA KAFSTKEEHD AFEAVRTLTI VKESPTSTPD
EVTAAENNVI EKQNLVSGLS EAEQISRLFW WTVEYGLIGD IDNPKIYGAG LLSSVGESKH
CLTDAVEKVP FSIEACTSTT YDVTKMQPQL FVCKSFEELT EALEKFAETM AFKTGGKEGL
EKAIRSENHA TAELNSGLQI TGTFTETIEN DAGELIYMRT SSPTALAIHN KELANHSTAV
HSDGFGTPIG LLTENIALEN CTDEQLQALG ITIGNIAEFT FESDIHVKGT VTDIVKNDNK
IALISFINCT VTYNDRVLFD ASWGAFDMAV GSTITSVFPG AADAAAFFPM DEEIQEIPAP
LVLNELERMY QTVRDIRNEG ILHDAHIEQL VAIQEVLNKF YTKEWLLRLE ILELLLEHNK
GHETSAALLQ QLSTFTTDEA VTRLINNGLT LLPVKGVKND ATIN