Gene BCG9842_B0759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B0759 
Symbol 
ID7186478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp4303854 
End bp4305608 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content38% 
IMG OID643552268 
Productphenylalanine 4-monooxygenase 
Protein accessionYP_002447937 
Protein GI218899526 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3186] Phenylalanine-4-hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGA AAACAGAAAT TCCATCGCAT TTAAAACCAT TTGTATCCAC ACAACATTAT 
GATCAATACA CACCGGTGAA TCACGCTGTG TGGCGTTATA TTATGAGACA AAATCATAGC
TTCTTAAAAG ACGTTGCTCA TCCAGCCTAT GTGAACGGAC TACAATCATC TGGTATTAAT
ATAGATGCAA TTCCAAAAGT AGAAGAAATG AATGAGTGTT TGGCACCAAG CGGCTGGGGC
GCTGTAACAA TTGACGGACT TATTCCTGGC GTAGCATTCT TCGATTTTCA AGGACACGGA
TTACTACCGA TTGCAACAGA TATCCGGAAA GTAGAAAATA TCGAGTACAC ACCAGCTCCA
GATATCGTAC ACGAAGCCGC AGGACACGCA CCGATTTTAC TTGATCCTAC ATATGCAAAA
TATGTGAAAC GTTTTGGGCA AATTGGTGCA AAAGCTTTCT CTACAAAAGA AGAACATGAT
GCATTTGAAG CTGTTCGTAC ATTAACGATA GTAAAAGAAA GCCCTACTTC TACTCCTGAT
GAAGTTAAGG CTGCTGAAAA TGCTGTAATT GAAAAACAAA ACTTAGTTTC TGGTTTATCG
GAAGCTGAAC AAATTTCACG TCTTTTCTGG TGGACAGTAG AATATGGATT GATCGGAAAT
ATAGATGATC CAAAGATATA TGGTGCTGGT CTCCTTTCTT CTGTTGGCGA AAGCAAACAT
TGCTTAACAG ACGCTGTAGA AAAGGTTCCA TTTTCTATCG AAGCATGTAC AGGGACAACT
TATGACGTAA CAAAAATGCA ACCACAACTA TTTGTTTGTG AATCCTTTGA AGAATTAACA
GATGCGCTTG AAACATTTTC TAAAACAATG GCCTTTAAAA CAGGTGGAAA AGAAGGTTTA
GAAAAAGCAA TTCGCTCTGA GAACTATGCA ACAGCTGAGC TAAATAGTGG ATTACAAATT
ACAGGTACAT TTAGCGAGAC AATTGAAAAC GATGCAGGTG AATTAATTTA CATGCGAACA
AATTCGCCAA CGGCATTAGC GCTTCATAAT AAACAGTTAG CGAATCATTC TACTTCTGTA
CACAGTGACG GATTTGGAAC ACCGATTGGA TTACTCACTG AAAATATTGC ATTAGAAAAT
TGTACAGACG AACAACTACA ATCATTAGGA ATTACAATTG GAACTATCGC TGAGTTTACT
TTCGCAAGTG GTATTCATGT AAAAGGAACA GTAACAGATA TTGTGAAAAA CGATAAGAAA
ATTGCTCTTA TTTCCTTTAT CGATTGCACA GTTACTTATA ACGCCCGCGT TTTATTTGAT
GCTTCATGGG GCGCATTTGA TATGGCTGTT GGCTCACAAA TCACTTCAGT ATTCCCAGGT
GCCGCAGATG CCGCAGCATT TTTCCCAATG GATGAAGAAG TTCAAGGACT TCCTGCTCCA
CTTGTACTGA ATGAACTTGA ACGTATGTAT CAAACAGTGC GAGATATTCG AAGTGAGGGG
ATTTTACACA ACGCGCATAT CGATCAATTA GTAGCAATTC AAGAAGTATT AAATAAATTT
TATGCGAAAG AATGGTTGCT GCGCCTTGAA ATATTAGAGT TACTTTTAGA GCATAACAAA
GGGCATGAAG CATCGGCAAC ATTACTACAA CAACTTTCTA CTTTCACAAC TGATGAAGCT
GTAACACGCC TTATTAACAA TGGTCTTGCT TTACTTCCAG TAAAGGATGT GAAAAATGAT
GCTAAGATTA ACTGA
 
Protein sequence
MTKKTEIPSH LKPFVSTQHY DQYTPVNHAV WRYIMRQNHS FLKDVAHPAY VNGLQSSGIN 
IDAIPKVEEM NECLAPSGWG AVTIDGLIPG VAFFDFQGHG LLPIATDIRK VENIEYTPAP
DIVHEAAGHA PILLDPTYAK YVKRFGQIGA KAFSTKEEHD AFEAVRTLTI VKESPTSTPD
EVKAAENAVI EKQNLVSGLS EAEQISRLFW WTVEYGLIGN IDDPKIYGAG LLSSVGESKH
CLTDAVEKVP FSIEACTGTT YDVTKMQPQL FVCESFEELT DALETFSKTM AFKTGGKEGL
EKAIRSENYA TAELNSGLQI TGTFSETIEN DAGELIYMRT NSPTALALHN KQLANHSTSV
HSDGFGTPIG LLTENIALEN CTDEQLQSLG ITIGTIAEFT FASGIHVKGT VTDIVKNDKK
IALISFIDCT VTYNARVLFD ASWGAFDMAV GSQITSVFPG AADAAAFFPM DEEVQGLPAP
LVLNELERMY QTVRDIRSEG ILHNAHIDQL VAIQEVLNKF YAKEWLLRLE ILELLLEHNK
GHEASATLLQ QLSTFTTDEA VTRLINNGLA LLPVKDVKND AKIN