Gene Saro_3187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3187 
SymbolphhA 
ID3917445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3407491 
End bp3408429 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content66% 
IMG OID640445971 
Productphenylalanine 4-monooxygenase 
Protein accessionYP_498456 
Protein GI87201199 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3186] Phenylalanine-4-hydroxylase 
TIGRFAM ID[TIGR01267] phenylalanine-4-hydroxylase, monomeric form 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGACAA CGACCGCACC CGCTTTCGAC TATGCCTGCC TACCGGAAAT GCCGGAGGGC 
GTGTTCACCG CGCCCTTGCG CCGCCCGGAC CGCGTGGGCG AGGACTGGCT CGAGCCTGCG
CAACACCGCT ACACCGCGCA GGAGCACGCG ATCTGGGATG AGCTTTACGC CCGCCAGATG
GAACTTCTGC CCGGCAGGGC CTGCAGCGCC TTCCTGCAGG GCCTGGAGCG GCTCGACCTC
GGGCGCGGAG GCGTGCCCGA CTTCGCACGG CTTTCGTCCG AGCTTGGCGC GCTGACTGGC
TGGAGCGTCG TGCCCGTGCC GATGCTGATC CCCGATCACG TGTTCTTCTG GCACCTGGCG
AACCGCCGCT TTCCCGCAGG CAACTTCATC CGCACGCGCG AGACGTTCGA TTACATCCAG
GAGCCCGACG TCTTCCACGA TGTCTTCGGC CACGTACCGA TGCTGACCGA CCCGACTTAT
GCCGACTACA TGCAGGAGTA TGGCCGCGCC GGGTGGAAGG CGATGCGTTA CAACCGGCTC
AAGGCGCTGG GCGCGCTCTA CTGGTACACG GTGGAGTTCG GGCTGGTGAT CGAGGACGGC
GCGCCCAAGG TCTATGGTGC GGGGATCCTC TCCGGCCCGC GCGAGGCGGT GTTCGCGCTG
GAGGGGCAGT CGCCCAACCG CATCATGCTC AACGTCGACC GGGTCATGCG CACGGATTAC
GTGATCGACG ATCTCCAGCC GACCTATTTC GTGATCGAGA GCTTCGCGGA CCTCTATCAC
CAGACGGTCG AGCGCGATTT CGACCGGCTC TACCGCGCGC TCGGCGCCGG GTTCACTTAT
GCCAACACTG CGGTGATCGA CGTGGACGAC GTGCTGCACC GGGGCACGCT GGAATACCAC
CTGCGGGGCG GGCGCGGATC GGGCGCAATT CCGGTCTGA
 
Protein sequence
MLTTTAPAFD YACLPEMPEG VFTAPLRRPD RVGEDWLEPA QHRYTAQEHA IWDELYARQM 
ELLPGRACSA FLQGLERLDL GRGGVPDFAR LSSELGALTG WSVVPVPMLI PDHVFFWHLA
NRRFPAGNFI RTRETFDYIQ EPDVFHDVFG HVPMLTDPTY ADYMQEYGRA GWKAMRYNRL
KALGALYWYT VEFGLVIEDG APKVYGAGIL SGPREAVFAL EGQSPNRIML NVDRVMRTDY
VIDDLQPTYF VIESFADLYH QTVERDFDRL YRALGAGFTY ANTAVIDVDD VLHRGTLEYH
LRGGRGSGAI PV