Gene Saro_0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0597 
Symbol 
ID3915609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp642174 
End bp643286 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content65% 
IMG OID640443327 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_495878 
Protein GI87198621 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCATCC AGACAAAGGG CCAGACCGAC CTGTTCGAAA ACCCGCTCGG CCTCGACGGC 
TTCGAATTCA TCGAGTTCTC CGCACCGGAA GCGGGCAGCC TCGAGCCAGT CTTCGCGCGC
ATGGGCTTCA CCCACATCGC CAACCATCGC TCCAAGCAGG TCCAGCTCTG GCGCCAGGGC
GGCATCAACC TCATCGCCAA CCACGAACCG CGCAGCCCTG CGGCCTACTT CGCCGCCGAG
CACGGCCCAT CGGCCTGCGG CATGGGCTGG CGCGTGCGCA ACGCGGCCAA GGCCTATGCC
ACCGCTATCG AGCGCGGGGC CGAGCCGGTC GAGGTCCGCA CCGGACCGAT GGAACTGCGC
CTGCCCGCGA TCCGCGGCAT CGGCGGTTCG ATCATCTATC TCATCGACCG CTACGAAGGC
GGCCAGTTCG GCGACCTCTC GATCTACGAC ATCGACTTCG AATATCTCCC CGGAGTGGAC
CGGCATCCCG TCGGCGCGGG TTTCCACACC ATCGACCACC TGACCCACAA CGTCTACGGC
GGGCGCATGG CGCACTGGGC TTCGTTCTAC GAGCGCGTGT TCGGCTTCCG CGAGATCCGC
TACTTCGACA TCAAGGGCGA ATATACCGGC CTCACCAGCC GAGCGATGAC CGCGCCCGAT
GGCAAGATCC GCATTCCGCT GAACGAGGAA GCCAAGGCAG GCGGCGGCCA GATCGAGGAG
TTCCTGCGCG CCTACAACGG CGAGGGCATC CAGCACATCG CCTTCGCCTG CGACGACCTG
CTCGGCGGGT GGGACAAGCT CAAGGCCATT GGCACCCCCT TCATGTCCCC GCCGCCCGCG
ACCTACTACG AAATGCTCGA CGAACGCCTG CCCGGCCATG GCGAGCCGGT GGCGGAACTG
CAGAAGCGCG GCATCCTGCT CGACGGCTCC ACCGAAGAGG GCGATCCCCG CCTCCTGCTG
CAGATCTTCT CCGAAACCCA GATCGGCCCG GTGTTCTTCG AGTTCATCCA GCGCAAGCGC
GACGAAGGCT TTGGCGAAGG CAATTTCACC GCCCTGTTCA AGTCGATGGA ACGCGACCAG
CTTCGTCGCG GCGTGCTCAA GACCGAAGCC TGA
 
Protein sequence
MSIQTKGQTD LFENPLGLDG FEFIEFSAPE AGSLEPVFAR MGFTHIANHR SKQVQLWRQG 
GINLIANHEP RSPAAYFAAE HGPSACGMGW RVRNAAKAYA TAIERGAEPV EVRTGPMELR
LPAIRGIGGS IIYLIDRYEG GQFGDLSIYD IDFEYLPGVD RHPVGAGFHT IDHLTHNVYG
GRMAHWASFY ERVFGFREIR YFDIKGEYTG LTSRAMTAPD GKIRIPLNEE AKAGGGQIEE
FLRAYNGEGI QHIAFACDDL LGGWDKLKAI GTPFMSPPPA TYYEMLDERL PGHGEPVAEL
QKRGILLDGS TEEGDPRLLL QIFSETQIGP VFFEFIQRKR DEGFGEGNFT ALFKSMERDQ
LRRGVLKTEA