Gene Phep_3273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3273 
Symbol 
ID8254392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3885300 
End bp3887099 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content44% 
IMG OID644936926 
Productphenylalanine 4-monooxygenase 
Protein accessionYP_003093530 
Protein GI255533158 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3186] Phenylalanine-4-hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.90398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.367732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCAA TGAATGATTT TAATGACTTT AACAATAAGC AGGTAGCTAA TCTGCCAAGG 
CATTTACGAC AATTTATTGT AGAACAGCAT TACGAAAAGT ATACGCCGGT AGACCAGGCC
GTATGGCGTT ATGTGATGCG CCAGAATTAC AGCTATTTAA AGCATGTTGC TTTTTATCCT
TATATCAAAG GCCTGCAGCG CGCGGGACTG AGTATTGAAT ATATCCCTGA CCTGCAAACA
ATGAACGACA ACCTGGGCAA GATTGGCTGG GGTGCAGTTA CCGTGGATGG TTTTATACCG
CCTGCAGCCT TTATGGAGTA CCAGGCTTAC CGGGTACTGG TTATTGCAGC AGATATCCGT
CAGATCAATC ATATTGAATA TACTCCTGCA CCAGATATTA TCCACGAATC GGCAGGTCAT
GCCCCAATTA TTGCCGACGC GGACTATAAC AATTACCTGA GCTATTTTGG TTCTATAGGT
GCCAAGGCCA TGTTCTCATC AAAAGATTTT GAGCTTTATG AGGCCATCAG GAACCTTTCG
ATCTTAAAAG AGGCTGTGGA TGCCAATGAA GAAGAGATCG CAAAGGCAGA ACGCCTGTTA
CAGCAGATCT CGGAAAATAT GGGTGAACCA TCGGAAATGG CTTTATTGAG CCGCCTGCAT
TGGTGGACAG TTGAATATGG CCTGATCGGC ACACTGGAAG ATCCAAAGAT CTATGGTGCC
GGTTTGCTTT CTTCGATCGG TGAAAGTTCG AGCTGTATGA AACCTGAGGT GCAGAAACTA
TGGTATAACA TTGATACCAT TAACTACAGT TACGACATTA CCAAACCCCA GCCCCAGTTG
TTTGTAACTG AAACGTTCCA GAACCTGATT GACGTACTTG AAGTTTTTGC CGATACCATG
GCCTTCAGGA AAGGTGGCAC CGAAAGTATT GTAAAGGCTA TTGAATGTAA AAACCCGGCA
ACTGCAGTTT ACAGTTCAGG CTTGCAGGTT ACCGGCGTGT TTACAGATAT GGGACTGGAT
GGCAATGATG CGCTTACTTT TATCAGAACA ACCGGGCCTT CTGCTTTGGC GATTGGTAAT
AAGCAACTGG AAGGACATGG TAAACATTTC CATAAAGATG GTTTTTCATC TCCTGTGGGT
AAACTGAAGG GTATTGCTAC GCCGCTGGAA GACATGGACA TGCTCCAATT GCTGAATTGT
GGGATCAAAC CGCTTAACCT GGCTATCCTG GAGTTTGAAA GTGGCATTAC GGTAAAGGGT
ACAGTCAGGA CCATCCATCA GCAAAATGAA AAAACTTTTC TGATTACCTT TGACAATTGT
ACCGTTAAGG AGCGTAACGG GAATATACTT TTTCAGCCAG ACTGGGGCAT GTATGACATG
GCTGTTGGCG AAAAGATCGT TTCCGTTTAC AATGGTGCAG CTGATAAGGA TGCTTATGAA
GAAATTACCC ATATCAGTAA TAAACAAACC CACAAAGTGG CTTACGACGA GAAGACCCAA
AAACTGCATG CCATTTATAA AGCGGTACGA CAGATAAGGG AAAGCGGAAC AGGATATGAA
CAATTGCCAG TCCTTTTTGG GGCATTAAAA AATGAACACC GCTACGACTG GCTGTCGGCG
ATGGAAATTC TGGAGATCTT ATACCATAAA CAGCTTTATC CTGAACTGGA AAAAGAGCTG
CGTATTTACC TGGAACTTAA ATCGGCCAGT GAAAGCGAAC ACACAAAACT GATTAATGAC
GGTTTACATG TTATTGCAAA CCCGGTTACC AAATTGATTA CAGAAGAAGA AGCACATTAA
 
Protein sequence
MDAMNDFNDF NNKQVANLPR HLRQFIVEQH YEKYTPVDQA VWRYVMRQNY SYLKHVAFYP 
YIKGLQRAGL SIEYIPDLQT MNDNLGKIGW GAVTVDGFIP PAAFMEYQAY RVLVIAADIR
QINHIEYTPA PDIIHESAGH APIIADADYN NYLSYFGSIG AKAMFSSKDF ELYEAIRNLS
ILKEAVDANE EEIAKAERLL QQISENMGEP SEMALLSRLH WWTVEYGLIG TLEDPKIYGA
GLLSSIGESS SCMKPEVQKL WYNIDTINYS YDITKPQPQL FVTETFQNLI DVLEVFADTM
AFRKGGTESI VKAIECKNPA TAVYSSGLQV TGVFTDMGLD GNDALTFIRT TGPSALAIGN
KQLEGHGKHF HKDGFSSPVG KLKGIATPLE DMDMLQLLNC GIKPLNLAIL EFESGITVKG
TVRTIHQQNE KTFLITFDNC TVKERNGNIL FQPDWGMYDM AVGEKIVSVY NGAADKDAYE
EITHISNKQT HKVAYDEKTQ KLHAIYKAVR QIRESGTGYE QLPVLFGALK NEHRYDWLSA
MEILEILYHK QLYPELEKEL RIYLELKSAS ESEHTKLIND GLHVIANPVT KLITEEEAH