Gene Saro_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1586 
Symbol 
ID3918694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1643707 
End bp1645572 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content66% 
IMG OID640444326 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_496860 
Protein GI87199603 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.435654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACCT CGATCGCCAC CGTCTCGATC AGCGGCACGC TCGATGCCAA GCTGCAAGCC 
ATTGCCGACG CTGGCTATGA TGGCGCGGAA ATCTTCGAGA ACGACCTGCT CTCCACCCAT
CTCTCCGCAC GCGAGATCGG CGCAATGATG CGCGACCTCG GCCTCGCCTG CACGATGTTC
CAACCCTTCC GCGACCTGGA AGGGCTTCCC GCCGAACAAC GCGCCCGCGC CTTCGACCGG
CTCGAACGCA AGTTCGACGT GATGGAGGAA CTCGGCACCG ACCTGCTCCT CGCTTGCTCG
TCATGCTCGC CCATTGCGGA GGGGAACCGT GAACGCATCG TTGCCGACCT TGCCGACGCG
GGAGAACGCG CCGCGCGGCG TGGGCTTCGC ATCGGCTATG AAGCGCTGGC CTGGGGTCGA
CACATCAATG ACCACCGCGA TGCGTGGAGC ATCGTGCGCG ACGTGGATCA TCCGGCCATC
GGGTTGGTCC TCGACAGCTT CCACTCGCTT TCGCGCCGCA TCCCGTCGAG CAGCATCGGC
GATATCCGCC CCGAAAAGCT TTTCATCGTC CAGGTGGCCG ATGCTCCGGT CCTCAACATG
GACCTCCTGC AATGGAGCCG CCATTTCCGC TCCATGCCCG GCCAGGGCGA TTTCCCGCTG
GACGACTGGG CTGAAGCGAT CCGCCGCATC GGCTATGATG GCTACTGGAG CCTCGAGATA
TTCAACGACC GGTTCCGCGC CGGTTCGGCG CACGGCGTTG CGCTCGACGG ATATCGCTCG
CTCCGCCTGA TGCAGGGCGG TATCGCCCGG GCCGCCAGCG GTGCCTCGGC GCTGCCCGCG
AAGGCAAGGC CGCTCGGCGT CGAGTTCATC GAGTTTGCCG CCAGCCACGA AGAGGCCGAG
GCCCTTGGCG GCATGCTGCG CCCGCTAGGC TTCCGTCCCA CCGCGCGGCA CCGCACCAAG
GATGTCACTC GCTGGCAGCA AGGCGGAATC AACATCGTCG TCAACTGTGA ACCGGAAGGC
CTCGCCCACA GCTTCGACGT CGTTCACGGC GCGTCGGTCT GCGCGATCGG CCTGGCCGTC
GAGGACGTGC CCGCCGCACT GGCCCGCGCC GAATTCCTGC GCGTTCCCCG CTTCGAACAG
GCCGTCGCCC CCGGCGAGTG GCCAATTCCG AGCGTTCGCG GGGTGGGCGG AAGCCTGCTC
TACTTCGTCG ATGCCGCGAC CCGCGAAGCG ATGTGGGCGC ATGAATTCCC CCACGCGCTC
GAACCCTTGC CCGAAGCCCC CCTACTCACT TCGATCGACC ACATCGCACA GACGATGCAG
TACGAGGAGT TCCTGAGCTG GCTTCTGTTC TACGTCGCGC TGTTCGACCT GGAAAAGACT
CCGCAACTCG AGATCGCCGA TCCCATGGGC CTCGTGCAGA GCCAGGCGGT GGAGAGCGCC
GACCGTTCGG TGCGCTTCAC CCTCAACGGG TCGCTCGCGG CGCAGTCGCT CACTTCGCGG
TTCGTGCAGA ACTATTTCGG CGCGGGCGTG CAGCATGTCG CCTTTGCCAC CGGCGACGCC
TTTGCAGCTT CGGAAAGCGC GGCATCCAAA GGACTTGAGC GGCTCGCGAT CCCCCGCAAC
TACCACGACG ATCTCGAGGC GCGGTGGGGG CTGGAAGGCA ACCTGGCCGA CCGGATGGCC
GCCGACGATC TGCTATACGA CCGCGACGGC GAGGCCGAGT ACTTCCAGTT CTACAGCCGC
GCCTTTGCGC GGCGCGTGTT CTTCGAGGTG GTCGAACGGC GCGGATACGA GGGTTACGGC
GCGGCGAACG CCCCGATCCG TCTCGCTGCG CAGGCTCGGC ACAAGCCTGA CCTGAACGGC
CTCTAA
 
Protein sequence
MKTSIATVSI SGTLDAKLQA IADAGYDGAE IFENDLLSTH LSAREIGAMM RDLGLACTMF 
QPFRDLEGLP AEQRARAFDR LERKFDVMEE LGTDLLLACS SCSPIAEGNR ERIVADLADA
GERAARRGLR IGYEALAWGR HINDHRDAWS IVRDVDHPAI GLVLDSFHSL SRRIPSSSIG
DIRPEKLFIV QVADAPVLNM DLLQWSRHFR SMPGQGDFPL DDWAEAIRRI GYDGYWSLEI
FNDRFRAGSA HGVALDGYRS LRLMQGGIAR AASGASALPA KARPLGVEFI EFAASHEEAE
ALGGMLRPLG FRPTARHRTK DVTRWQQGGI NIVVNCEPEG LAHSFDVVHG ASVCAIGLAV
EDVPAALARA EFLRVPRFEQ AVAPGEWPIP SVRGVGGSLL YFVDAATREA MWAHEFPHAL
EPLPEAPLLT SIDHIAQTMQ YEEFLSWLLF YVALFDLEKT PQLEIADPMG LVQSQAVESA
DRSVRFTLNG SLAAQSLTSR FVQNYFGAGV QHVAFATGDA FAASESAASK GLERLAIPRN
YHDDLEARWG LEGNLADRMA ADDLLYDRDG EAEYFQFYSR AFARRVFFEV VERRGYEGYG
AANAPIRLAA QARHKPDLNG L