Gene Saro_3484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3484 
Symbol 
ID5077633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp86640 
End bp88049 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content64% 
IMG OID640481208 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001165870 
Protein GI146275710 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACC GCGATCCCGG CGCATTCGAC AAGGCCCGCT GCCCCGGCAC TAGCTGGGAA 
GACATCCTGC GCGCCGACGA GGTGCAGCCG CCCGCCTTCA TGGCGGAGGA CCGTTCGCAG
TATCTCGGCT CAGAACCGAT CGATGCAGCG CGATACTACA GCCCCGAATT CTTCAAGACC
GAGTGCGAAA GGATGTGGCC CTTCGTCTGG CAGTTCGCCG CGCGCGAGGA AGATCTGCCC
GAGCCGGGCG ACTACGTAAC CTACGACAAC GCGGGTCGGT CCTACCTGAT CGTGCGGCAA
GAGGACGGCA GTCTCAAGGC GTTCCATAAT GTCTGCCTGC ACCGCGGCCG CAAGCTCAAG
ACGGACAGCG GCAGCGCAGA ACAGTTCCTC TGTCCGTTCC ACGGCTTCTC GTGGAATCCG
GACGGCTCGT TGCGCAACAT CCCCTGCCGC TGGGACTTTG CCCACCTCAG CGACCAGAAG
ATGCAGCTTC CCGAGGCGAG CCTTGCGCAG TGGGGCGGCT ATGTCTTCGT CCGCGATGCT
GCCGAGGGGC CGACCATCGA GGAATACCTC GATCCGCTTC CGGAGTTCTT CAAGCGCTGG
AAGCACGAGG AATGCGTGAC GGTTGCCTGG GTCGCCAAGG TGATCCCGGC AAACTGGAAG
ATTGCGATGG AGGCTTTCAT GGAAAGCTAC CACGCCTATG TCACGCACCC GCAGCTCATG
CCGTTCACCG GCGATGCCAA CGCGGCCTAC CACGTGCTCG GCCGCCACGT GAACGTGAAC
TACACGCCCT TCGGCGTCGT CAGCCCGCAC ATCGAGGCGC AAGCCGAGGC CGAGCACTGG
CCGCAGCAGC GCATCATTGA CGAGTTCCGC AAGTACAACG GTCGCAGCGC CGACAACTAC
GACGCGGACA AGGACAACTA CGCCATCGAG GTGCCCGAAG GCCGCAGCGC CCGCGCCGCA
CTTGGCGAGA AGATGCGCGA GGTTTCGGCA AGGCAGTTCG GTGGTGACTA TTCCGGCGTT
TCGGAAAGCG AACTGCTCGA CGCGCTGGTC TTCAACGTCT TCCCGAACTT CGCGCCGTGG
GGCGGTTTCA TGCCCAATAT CGTCTATCGC TGGCGGCCCT GGCCCGATCA GGACAAGTGC
CTGATGGAAG TGCGCGTGAT CGCCCGCGTC CCGGAAGGCC AGCCGCGCCC CGCCGGTGTG
CCGATGCACA TGCTGGGCGA CGACCAGATC TGGGCCGATG CGCCCGAGCT TGGCGTGCTT
GGCGCGGTGC TCGACCAGGA CAGCGAGAAC ATGGCGCTGT GCCACGAAGG CCTGAAGGTT
TCCAAGAACC AGGCGGTGGA ACTGGCGGAC TATCAGGAAG TGCGCATCCG CCACATCCAC
CAGACGCTCG ACAGCTATCT GAACGCGTGA
 
Protein sequence
MADRDPGAFD KARCPGTSWE DILRADEVQP PAFMAEDRSQ YLGSEPIDAA RYYSPEFFKT 
ECERMWPFVW QFAAREEDLP EPGDYVTYDN AGRSYLIVRQ EDGSLKAFHN VCLHRGRKLK
TDSGSAEQFL CPFHGFSWNP DGSLRNIPCR WDFAHLSDQK MQLPEASLAQ WGGYVFVRDA
AEGPTIEEYL DPLPEFFKRW KHEECVTVAW VAKVIPANWK IAMEAFMESY HAYVTHPQLM
PFTGDANAAY HVLGRHVNVN YTPFGVVSPH IEAQAEAEHW PQQRIIDEFR KYNGRSADNY
DADKDNYAIE VPEGRSARAA LGEKMREVSA RQFGGDYSGV SESELLDALV FNVFPNFAPW
GGFMPNIVYR WRPWPDQDKC LMEVRVIARV PEGQPRPAGV PMHMLGDDQI WADAPELGVL
GAVLDQDSEN MALCHEGLKV SKNQAVELAD YQEVRIRHIH QTLDSYLNA