Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3601 |
Symbol | |
ID | 5077750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 224846 |
End bp | 225853 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481325 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001165987 |
Protein GI | 146275827 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.022726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGGTC AGGAACTGAT CGACTGGCGC ACCGCCCCTG TCGAAGGCCG CGCATCCGTG GAGGAACGCA ATCTCGACAT CGGCTTCCCT CTGGGCTGGT ACGCGATCGA CCTTTCGGCG AACCTCGCGG TCGGCGAAGT GCGCCCGCTG CGCTACTTCT CGAAGGACCT CGCCATGTGG CGCGGCGAGG ACGGCCAGGT TCGCGTGATC GACGCCTATT GCAAGCATCT GGGCGCGCAC ATGGGCCACG GCGGCAAGGT TCACGGCAAC CTGCTGGAAT GTCCGTTCCA CGCCTGGCGC TATGACGGCG AGGAGGGGAC CGTGAAGGAC ATCCCCTATT CGAAGTCGAT CCCGCCGCAG GTGAAGCGCA AGTGCACCCG TACCTGGCAC GTGACCGAGG CCAACCGCTG GATCTGGCTG TGGTATCATC CCGAGGACGT GGCCCCGCTA TTCGAGGTGG TGCACCTGCC CGAGGCGACC GATCCCGAGT GGACCGATTA CGACATCTAC GAATGGAACG TCTACGGTTC GATCCAGAAC ATGGCCGAGA ACGGCGTGGA CGTGGCGCAC TTCAAGTACA TCCACGGCAC CGCCAACGTG CCGCTGGGCG ATCTGCGCTG GGGCGACTGG GGCCGCGGCG CCGACGTGAA GGCCAAGATG GGCACGCCGT GGGGCGAGGT CGACGGCCAG ATCAGCTACG ACACGATGGG GCCGGGGCAG AGCTGGACGC GCTTTACCGG CATTTCCGAA ACGCTGCTCG TGGCCTGCAT CACGCCGGTC GAGCTTGACC ATGTCCATGT GCGCTTCTGC TTCACCCAGC CGCGTTCGCA GGCCGAGGGC GAGCGGGCGG GCGTCGCCAA GGCGATCATC CGCGACATCT GCAAGCAGTT CGACCAGGAC AAGATCATCT GGGACCGCCA GAAGTTCGAG CCCAACGCGC TGATCTGCGA AGGCGACGGT CCGATCGCGC AGTTCCGCAA GTACTACTCG CGCTACTACG TCAAGTAA
|
Protein sequence | MRGQELIDWR TAPVEGRASV EERNLDIGFP LGWYAIDLSA NLAVGEVRPL RYFSKDLAMW RGEDGQVRVI DAYCKHLGAH MGHGGKVHGN LLECPFHAWR YDGEEGTVKD IPYSKSIPPQ VKRKCTRTWH VTEANRWIWL WYHPEDVAPL FEVVHLPEAT DPEWTDYDIY EWNVYGSIQN MAENGVDVAH FKYIHGTANV PLGDLRWGDW GRGADVKAKM GTPWGEVDGQ ISYDTMGPGQ SWTRFTGISE TLLVACITPV ELDHVHVRFC FTQPRSQAEG ERAGVAKAII RDICKQFDQD KIIWDRQKFE PNALICEGDG PIAQFRKYYS RYYVK
|
| |