Gene BAS4253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4253 
SymbolphhA 
ID2853174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4167276 
End bp4169030 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content38% 
IMG OID637507489 
Productphenylalanine 4-monooxygenase 
Protein accessionYP_030501 
Protein GI49187249 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3186] Phenylalanine-4-hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAGA AAACAGAAAT TCCATCGCAT TTAAAACCAT TCGTATCCAC ACAGCATTAT 
GATCAATACA CACCGGTGAA CCACGCTGTA TGGCGTTACA TTATGAGACA AAATCATAGT
TTCTTAAAAG ACGTTGCCCA TCCAGCCTAT GTGAATGGAC TACAATCATC TGGTATTAAT
ATAGAGGCAA TCCCAAAGGT AGAAGAAATG AATGAATGTT TGGCGTCAAG CGGCTGGGGC
GCTGTAACGA TTGACGGACT TATTCCTGGC GTCGCATTTT TCGATTTTCA AGGGCACGGA
TTGCTACCAA TCGCAACAGA TATTCGCAAA GTAGAAAACA TCGAGTATAC ACCTGCTCCA
GATATTGTAC ACGAAGCAGC AGGACACGCA CCTATTTTAC TTGATCCTAC ATATGCAAAA
TATGTAAAAC GATTTGGGCA AATTGGTGCA AAAGCTTTTT CTACAAAAGA AGAACATGAT
GCATTCGAGG CTGTTCGTAC ATTAACGATT GTAAAAGAAA GCCCTACTTC TACTCCTGAT
GAAGTTACAG CTGCTGAAAA TAATGTAATT GAAAAACAAA ACTTAGTTTC TGGCTTATCA
GAAGCGGAAC AAATTTCACG TCTTTTCTGG TGGACAGTGG AATACGGATT GATTGGAGAT
ATAGACAATC CAAAAATATA TGGTGCTGGT CTCCTTTCTT CTGTTGGCGA AAGCAAACAT
TGCTTAACAG ACGCTGTAGA AAAGGTTCCA TTCTCTATAG AGGCATGTAC AAGTACAACT
TATGACGTAA CAAAAATGCA ACCACAACTA TTTGTTTGTA AATCATTTGA AGAATTAACA
GAAGCACTTG AGAAATTTGC TGAAACGATG GCCTTTAAAA CAGGTGGTAA AGAAGGCTTA
GAAAAAGCAA TTCGCTCTGA AAACCATGCA ACTGCTGAAT TAAATAGCGG ATTACAAATT
ACAGGTACAT TCACCGAGAC AATTGAAAAC GATGCGGGTG AATTGATTTA TATGCGAACA
AGTTCGCCAA CAGCATTAGC CATTCACAAT AAAGAGCTAG CGAATCATTC TACGGCTGTA
CACAGTGACG GCTTTGGAAC ACCGATTGGA TTACTCACTG AAAATATTGC ATTAGAAAAT
TGTACAGATG AACAACTACA AGCATTAGGA ATTACAATTG GAAATATTGC TGAGTTTACT
TTTGAAAGTG ATATTCATGT AAAAGGAACA GTAACAGATA TTGTAAAAAA CGATAATAAA
ATCGCTCTTA TTTCCTTTAT CAATTGTACA GTTACTTATA ACGACCGCGT TTTATTTGAT
GCTTCATGGG GCGCATTTGA TATGGCTGTT GGTTCAACAA TCACTTCCGT ATTCCCAGGT
GCCGCAGATG CTGCAGCATT TTTCCCAATG GATGAAGAAA TTCAAGAAAT TCCTGCTCCA
CTTGTACTGA ATGAACTTGA ACGTATGTAT CAAACAGTTC GTGATATCCG AAATGAAGGT
ATTTTACACG ACGCACATAT CGAGCAATTA GTAGCAATTC AAGAGGTATT AAATAAATTC
TATACGAAAG AATGGTTGCT TCGCCTTGAA ATATTAGAAT TGCTTTTAGA ACATAACAAA
GGGCACGAAA CATCAGCAGC ATTATTACAA CAACTTTCTA CTTTCACAAC TGATGAAGCT
GTAACACGCC TTATTAACAA TGGTCTTACG CTACTTCCAG TAAAGGGTGT GAAAAATGAT
GCTACGATTA ACTGA
 
Protein sequence
MTKKTEIPSH LKPFVSTQHY DQYTPVNHAV WRYIMRQNHS FLKDVAHPAY VNGLQSSGIN 
IEAIPKVEEM NECLASSGWG AVTIDGLIPG VAFFDFQGHG LLPIATDIRK VENIEYTPAP
DIVHEAAGHA PILLDPTYAK YVKRFGQIGA KAFSTKEEHD AFEAVRTLTI VKESPTSTPD
EVTAAENNVI EKQNLVSGLS EAEQISRLFW WTVEYGLIGD IDNPKIYGAG LLSSVGESKH
CLTDAVEKVP FSIEACTSTT YDVTKMQPQL FVCKSFEELT EALEKFAETM AFKTGGKEGL
EKAIRSENHA TAELNSGLQI TGTFTETIEN DAGELIYMRT SSPTALAIHN KELANHSTAV
HSDGFGTPIG LLTENIALEN CTDEQLQALG ITIGNIAEFT FESDIHVKGT VTDIVKNDNK
IALISFINCT VTYNDRVLFD ASWGAFDMAV GSTITSVFPG AADAAAFFPM DEEIQEIPAP
LVLNELERMY QTVRDIRNEG ILHDAHIEQL VAIQEVLNKF YTKEWLLRLE ILELLLEHNK
GHETSAALLQ QLSTFTTDEA VTRLINNGLT LLPVKGVKND ATIN