Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0597 |
Symbol | |
ID | 3915609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 642174 |
End bp | 643286 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640443327 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_495878 |
Protein GI | 87198621 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCATCC AGACAAAGGG CCAGACCGAC CTGTTCGAAA ACCCGCTCGG CCTCGACGGC TTCGAATTCA TCGAGTTCTC CGCACCGGAA GCGGGCAGCC TCGAGCCAGT CTTCGCGCGC ATGGGCTTCA CCCACATCGC CAACCATCGC TCCAAGCAGG TCCAGCTCTG GCGCCAGGGC GGCATCAACC TCATCGCCAA CCACGAACCG CGCAGCCCTG CGGCCTACTT CGCCGCCGAG CACGGCCCAT CGGCCTGCGG CATGGGCTGG CGCGTGCGCA ACGCGGCCAA GGCCTATGCC ACCGCTATCG AGCGCGGGGC CGAGCCGGTC GAGGTCCGCA CCGGACCGAT GGAACTGCGC CTGCCCGCGA TCCGCGGCAT CGGCGGTTCG ATCATCTATC TCATCGACCG CTACGAAGGC GGCCAGTTCG GCGACCTCTC GATCTACGAC ATCGACTTCG AATATCTCCC CGGAGTGGAC CGGCATCCCG TCGGCGCGGG TTTCCACACC ATCGACCACC TGACCCACAA CGTCTACGGC GGGCGCATGG CGCACTGGGC TTCGTTCTAC GAGCGCGTGT TCGGCTTCCG CGAGATCCGC TACTTCGACA TCAAGGGCGA ATATACCGGC CTCACCAGCC GAGCGATGAC CGCGCCCGAT GGCAAGATCC GCATTCCGCT GAACGAGGAA GCCAAGGCAG GCGGCGGCCA GATCGAGGAG TTCCTGCGCG CCTACAACGG CGAGGGCATC CAGCACATCG CCTTCGCCTG CGACGACCTG CTCGGCGGGT GGGACAAGCT CAAGGCCATT GGCACCCCCT TCATGTCCCC GCCGCCCGCG ACCTACTACG AAATGCTCGA CGAACGCCTG CCCGGCCATG GCGAGCCGGT GGCGGAACTG CAGAAGCGCG GCATCCTGCT CGACGGCTCC ACCGAAGAGG GCGATCCCCG CCTCCTGCTG CAGATCTTCT CCGAAACCCA GATCGGCCCG GTGTTCTTCG AGTTCATCCA GCGCAAGCGC GACGAAGGCT TTGGCGAAGG CAATTTCACC GCCCTGTTCA AGTCGATGGA ACGCGACCAG CTTCGTCGCG GCGTGCTCAA GACCGAAGCC TGA
|
Protein sequence | MSIQTKGQTD LFENPLGLDG FEFIEFSAPE AGSLEPVFAR MGFTHIANHR SKQVQLWRQG GINLIANHEP RSPAAYFAAE HGPSACGMGW RVRNAAKAYA TAIERGAEPV EVRTGPMELR LPAIRGIGGS IIYLIDRYEG GQFGDLSIYD IDFEYLPGVD RHPVGAGFHT IDHLTHNVYG GRMAHWASFY ERVFGFREIR YFDIKGEYTG LTSRAMTAPD GKIRIPLNEE AKAGGGQIEE FLRAYNGEGI QHIAFACDDL LGGWDKLKAI GTPFMSPPPA TYYEMLDERL PGHGEPVAEL QKRGILLDGS TEEGDPRLLL QIFSETQIGP VFFEFIQRKR DEGFGEGNFT ALFKSMERDQ LRRGVLKTEA
|
| |