Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3842 |
Symbol | |
ID | 5077453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | + |
Start bp | 9846 |
End bp | 11213 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640480952 |
Product | ring hydroxylating dioxygenase, alpha subunit |
Protein accession | YP_001165614 |
Protein GI | 146275453 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTTCG AACGTATCGG TCGCGAACCG GATTATTCAC GCTACATGGA CCTCAAGGAA GGCTGGCTTG ACCGCCGGAT CTTTTCGGAT GCCGACATCT ACGAGGAGGA GCTGTACCGC ATTTTCGCGC GGTCGTGGCT GTTCGTTGCC CACGAAAGCC AGATCCCCAG TTCCGGAGAC TTCCTGACGA CCCACATGGG CGAAGATGCG GTGATCGTCG CCCGCCAGCC CGACGGATCG ATCCGGGTCA TGCTCAATTC CTGCCCGCAC CGCGGCAATA AGGTGTGCTT CGCCGATGCC GGGAACACCC GTCGGTTCGT CTGCAATTAC CACGGCTGGG CGTTTGACAC CGCCGGCGAC CTCAAGGGCA TGCACGAGGA ATATTGCTAC GACGCGGGCG ATATCGACTT CAAGAACCAT GGCCTCAAGA ACGTCGCCAA GGTCGGCAAC TACAAGGGCC TGGTGTTCGC CACCTTCAAC AGCGATGCGC CGAGCCTGGA AGCCTGGCTA GGCGATTTCC GGTGGTATCT CGACATGATC CTCGACAACG AGGAAGGCGG CACCGAATTC ATTGGCGGCT GCATCAAGTC GGTGATCAGC GCGAACTGGA AGTTCGGGGT CGAGAACTTC ATCGGCGACG CTTACCACGC CGGCTGGACG CATGATTCGG GCACTCGGTC GATGAACAAC GGCCAGCCGT TCCCGCCGAT CGACATGGAT AATTCCTATC ACGCCAGCGT GAACGGCCAC GGCTGGGAAT TCGGCACCGA AGGCGTGGGC GACCTCTTCC TGCTCGGGCG CCCCAAGGTG ATGGACTATT ACAACAAGAT CCGCCCGAAG ATGGCGGAAC GCCTGGGCGA GATGCGCTCG AAGATCTTCG GTTCGGTCGC CTCGGCATCG ATCTTCCCCA ACGTCTCGTT CCTGCCGGGC ATTTCCACCT TCCGCCAGTG GCAACCCAAG GGGCCGATGC AGTTCGAATT GAAGACCTGG GTGATCGTCA ACAAGAACAT GCCCGACGAC ATCAAGGAGG AAGTGACCAA GGGCGTGATG CAGACCTTCG GCCCCGGTGG CACCTTCGAG ATGGATGACG GGGAAAACTG GGAGAACTGC ACCACCGTCA ACCGCGGCGT CGTCACCCGG CACGAGCGCC TGCACTATCG CTGCGGGATC GGCCGCCAGA TCGAACATGA TACCCTGCCG GGCATCGTCT ATCGCGGCCA GTACAACGAC GCCAACCAGC GCGGCTTCTA CCAGCGCTGG CTCGACATGA TGACCCATGA CGAATTCGGC AAGATGCCGG CACGGCCCGA ACCGCAGCTG GGCAATGTGG CCGAAACCCG CGACCTTCCC GGCCTGTTCG CGCTCTGA
|
Protein sequence | MRFERIGREP DYSRYMDLKE GWLDRRIFSD ADIYEEELYR IFARSWLFVA HESQIPSSGD FLTTHMGEDA VIVARQPDGS IRVMLNSCPH RGNKVCFADA GNTRRFVCNY HGWAFDTAGD LKGMHEEYCY DAGDIDFKNH GLKNVAKVGN YKGLVFATFN SDAPSLEAWL GDFRWYLDMI LDNEEGGTEF IGGCIKSVIS ANWKFGVENF IGDAYHAGWT HDSGTRSMNN GQPFPPIDMD NSYHASVNGH GWEFGTEGVG DLFLLGRPKV MDYYNKIRPK MAERLGEMRS KIFGSVASAS IFPNVSFLPG ISTFRQWQPK GPMQFELKTW VIVNKNMPDD IKEEVTKGVM QTFGPGGTFE MDDGENWENC TTVNRGVVTR HERLHYRCGI GRQIEHDTLP GIVYRGQYND ANQRGFYQRW LDMMTHDEFG KMPARPEPQL GNVAETRDLP GLFAL
|
| |