Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1586 |
Symbol | |
ID | 3918694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1643707 |
End bp | 1645572 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444326 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_496860 |
Protein GI | 87199603 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG1082] Sugar phosphate isomerases/epimerases [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.435654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACCT CGATCGCCAC CGTCTCGATC AGCGGCACGC TCGATGCCAA GCTGCAAGCC ATTGCCGACG CTGGCTATGA TGGCGCGGAA ATCTTCGAGA ACGACCTGCT CTCCACCCAT CTCTCCGCAC GCGAGATCGG CGCAATGATG CGCGACCTCG GCCTCGCCTG CACGATGTTC CAACCCTTCC GCGACCTGGA AGGGCTTCCC GCCGAACAAC GCGCCCGCGC CTTCGACCGG CTCGAACGCA AGTTCGACGT GATGGAGGAA CTCGGCACCG ACCTGCTCCT CGCTTGCTCG TCATGCTCGC CCATTGCGGA GGGGAACCGT GAACGCATCG TTGCCGACCT TGCCGACGCG GGAGAACGCG CCGCGCGGCG TGGGCTTCGC ATCGGCTATG AAGCGCTGGC CTGGGGTCGA CACATCAATG ACCACCGCGA TGCGTGGAGC ATCGTGCGCG ACGTGGATCA TCCGGCCATC GGGTTGGTCC TCGACAGCTT CCACTCGCTT TCGCGCCGCA TCCCGTCGAG CAGCATCGGC GATATCCGCC CCGAAAAGCT TTTCATCGTC CAGGTGGCCG ATGCTCCGGT CCTCAACATG GACCTCCTGC AATGGAGCCG CCATTTCCGC TCCATGCCCG GCCAGGGCGA TTTCCCGCTG GACGACTGGG CTGAAGCGAT CCGCCGCATC GGCTATGATG GCTACTGGAG CCTCGAGATA TTCAACGACC GGTTCCGCGC CGGTTCGGCG CACGGCGTTG CGCTCGACGG ATATCGCTCG CTCCGCCTGA TGCAGGGCGG TATCGCCCGG GCCGCCAGCG GTGCCTCGGC GCTGCCCGCG AAGGCAAGGC CGCTCGGCGT CGAGTTCATC GAGTTTGCCG CCAGCCACGA AGAGGCCGAG GCCCTTGGCG GCATGCTGCG CCCGCTAGGC TTCCGTCCCA CCGCGCGGCA CCGCACCAAG GATGTCACTC GCTGGCAGCA AGGCGGAATC AACATCGTCG TCAACTGTGA ACCGGAAGGC CTCGCCCACA GCTTCGACGT CGTTCACGGC GCGTCGGTCT GCGCGATCGG CCTGGCCGTC GAGGACGTGC CCGCCGCACT GGCCCGCGCC GAATTCCTGC GCGTTCCCCG CTTCGAACAG GCCGTCGCCC CCGGCGAGTG GCCAATTCCG AGCGTTCGCG GGGTGGGCGG AAGCCTGCTC TACTTCGTCG ATGCCGCGAC CCGCGAAGCG ATGTGGGCGC ATGAATTCCC CCACGCGCTC GAACCCTTGC CCGAAGCCCC CCTACTCACT TCGATCGACC ACATCGCACA GACGATGCAG TACGAGGAGT TCCTGAGCTG GCTTCTGTTC TACGTCGCGC TGTTCGACCT GGAAAAGACT CCGCAACTCG AGATCGCCGA TCCCATGGGC CTCGTGCAGA GCCAGGCGGT GGAGAGCGCC GACCGTTCGG TGCGCTTCAC CCTCAACGGG TCGCTCGCGG CGCAGTCGCT CACTTCGCGG TTCGTGCAGA ACTATTTCGG CGCGGGCGTG CAGCATGTCG CCTTTGCCAC CGGCGACGCC TTTGCAGCTT CGGAAAGCGC GGCATCCAAA GGACTTGAGC GGCTCGCGAT CCCCCGCAAC TACCACGACG ATCTCGAGGC GCGGTGGGGG CTGGAAGGCA ACCTGGCCGA CCGGATGGCC GCCGACGATC TGCTATACGA CCGCGACGGC GAGGCCGAGT ACTTCCAGTT CTACAGCCGC GCCTTTGCGC GGCGCGTGTT CTTCGAGGTG GTCGAACGGC GCGGATACGA GGGTTACGGC GCGGCGAACG CCCCGATCCG TCTCGCTGCG CAGGCTCGGC ACAAGCCTGA CCTGAACGGC CTCTAA
|
Protein sequence | MKTSIATVSI SGTLDAKLQA IADAGYDGAE IFENDLLSTH LSAREIGAMM RDLGLACTMF QPFRDLEGLP AEQRARAFDR LERKFDVMEE LGTDLLLACS SCSPIAEGNR ERIVADLADA GERAARRGLR IGYEALAWGR HINDHRDAWS IVRDVDHPAI GLVLDSFHSL SRRIPSSSIG DIRPEKLFIV QVADAPVLNM DLLQWSRHFR SMPGQGDFPL DDWAEAIRRI GYDGYWSLEI FNDRFRAGSA HGVALDGYRS LRLMQGGIAR AASGASALPA KARPLGVEFI EFAASHEEAE ALGGMLRPLG FRPTARHRTK DVTRWQQGGI NIVVNCEPEG LAHSFDVVHG ASVCAIGLAV EDVPAALARA EFLRVPRFEQ AVAPGEWPIP SVRGVGGSLL YFVDAATREA MWAHEFPHAL EPLPEAPLLT SIDHIAQTMQ YEEFLSWLLF YVALFDLEKT PQLEIADPMG LVQSQAVESA DRSVRFTLNG SLAAQSLTSR FVQNYFGAGV QHVAFATGDA FAASESAASK GLERLAIPRN YHDDLEARWG LEGNLADRMA ADDLLYDRDG EAEYFQFYSR AFARRVFFEV VERRGYEGYG AANAPIRLAA QARHKPDLNG L
|
| |