Gene Saro_3842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3842 
Symbol 
ID5077453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp9846 
End bp11213 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content61% 
IMG OID640480952 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_001165614 
Protein GI146275453 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTCG AACGTATCGG TCGCGAACCG GATTATTCAC GCTACATGGA CCTCAAGGAA 
GGCTGGCTTG ACCGCCGGAT CTTTTCGGAT GCCGACATCT ACGAGGAGGA GCTGTACCGC
ATTTTCGCGC GGTCGTGGCT GTTCGTTGCC CACGAAAGCC AGATCCCCAG TTCCGGAGAC
TTCCTGACGA CCCACATGGG CGAAGATGCG GTGATCGTCG CCCGCCAGCC CGACGGATCG
ATCCGGGTCA TGCTCAATTC CTGCCCGCAC CGCGGCAATA AGGTGTGCTT CGCCGATGCC
GGGAACACCC GTCGGTTCGT CTGCAATTAC CACGGCTGGG CGTTTGACAC CGCCGGCGAC
CTCAAGGGCA TGCACGAGGA ATATTGCTAC GACGCGGGCG ATATCGACTT CAAGAACCAT
GGCCTCAAGA ACGTCGCCAA GGTCGGCAAC TACAAGGGCC TGGTGTTCGC CACCTTCAAC
AGCGATGCGC CGAGCCTGGA AGCCTGGCTA GGCGATTTCC GGTGGTATCT CGACATGATC
CTCGACAACG AGGAAGGCGG CACCGAATTC ATTGGCGGCT GCATCAAGTC GGTGATCAGC
GCGAACTGGA AGTTCGGGGT CGAGAACTTC ATCGGCGACG CTTACCACGC CGGCTGGACG
CATGATTCGG GCACTCGGTC GATGAACAAC GGCCAGCCGT TCCCGCCGAT CGACATGGAT
AATTCCTATC ACGCCAGCGT GAACGGCCAC GGCTGGGAAT TCGGCACCGA AGGCGTGGGC
GACCTCTTCC TGCTCGGGCG CCCCAAGGTG ATGGACTATT ACAACAAGAT CCGCCCGAAG
ATGGCGGAAC GCCTGGGCGA GATGCGCTCG AAGATCTTCG GTTCGGTCGC CTCGGCATCG
ATCTTCCCCA ACGTCTCGTT CCTGCCGGGC ATTTCCACCT TCCGCCAGTG GCAACCCAAG
GGGCCGATGC AGTTCGAATT GAAGACCTGG GTGATCGTCA ACAAGAACAT GCCCGACGAC
ATCAAGGAGG AAGTGACCAA GGGCGTGATG CAGACCTTCG GCCCCGGTGG CACCTTCGAG
ATGGATGACG GGGAAAACTG GGAGAACTGC ACCACCGTCA ACCGCGGCGT CGTCACCCGG
CACGAGCGCC TGCACTATCG CTGCGGGATC GGCCGCCAGA TCGAACATGA TACCCTGCCG
GGCATCGTCT ATCGCGGCCA GTACAACGAC GCCAACCAGC GCGGCTTCTA CCAGCGCTGG
CTCGACATGA TGACCCATGA CGAATTCGGC AAGATGCCGG CACGGCCCGA ACCGCAGCTG
GGCAATGTGG CCGAAACCCG CGACCTTCCC GGCCTGTTCG CGCTCTGA
 
Protein sequence
MRFERIGREP DYSRYMDLKE GWLDRRIFSD ADIYEEELYR IFARSWLFVA HESQIPSSGD 
FLTTHMGEDA VIVARQPDGS IRVMLNSCPH RGNKVCFADA GNTRRFVCNY HGWAFDTAGD
LKGMHEEYCY DAGDIDFKNH GLKNVAKVGN YKGLVFATFN SDAPSLEAWL GDFRWYLDMI
LDNEEGGTEF IGGCIKSVIS ANWKFGVENF IGDAYHAGWT HDSGTRSMNN GQPFPPIDMD
NSYHASVNGH GWEFGTEGVG DLFLLGRPKV MDYYNKIRPK MAERLGEMRS KIFGSVASAS
IFPNVSFLPG ISTFRQWQPK GPMQFELKTW VIVNKNMPDD IKEEVTKGVM QTFGPGGTFE
MDDGENWENC TTVNRGVVTR HERLHYRCGI GRQIEHDTLP GIVYRGQYND ANQRGFYQRW
LDMMTHDEFG KMPARPEPQL GNVAETRDLP GLFAL