Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1872 |
Symbol | |
ID | 3917093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1973414 |
End bp | 1974484 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640444616 |
Product | Rieske (2Fe-2S) protein |
Protein accession | YP_497146 |
Protein GI | 87199889 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.171768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCG GCGAACCGGT TCCGAAGCTG TCGGCGAAAC CCGCCGCCAC GTATCTTCGC AACACCTGGT ACGTGGCGGG CTGGGCCAGC GATCTTGCCG GCGAGCCGCA GCAGCGCACG TTCCTGGAAG AGCCGGTGGC GCTCTTCCGC GACGGACACG GTGAGGCGAA GGCCATCGGC GGGCGCTGCC CGCACCGGTT CGCGCCGCTC GGCCATGGCA GCGTCGTCGA CGGGGCGCTG ATGTGCCCCT ACCACGGCCT GCGTTTCGAT GGGGATGGAC GCTGCGTCCA CAACCCGCAT CCCGGCGGAC ATCTTCCCGA TGCGCGGCAG CGGGTCTATC CGCTTGTCGA GCGGCATGCC TTGCTGTGGA TATGGATGGG CGATGCAGCA AAGGCTGATC CGGCATCGAT CCCGGACTTT TCGTGGCTTT CGGACCCCAG ATGGGAGGCC GTGCGCGGGG CCACGGTCGC CGAGGGTCAC TTCGAGCTCT ACAGCGACAA CATTCTCGAC CTCAGCCACG CCAACTTCGT CCACCCGGCG CTGGTCGCCA GCGCATTCAC CGAAGGCGAG CGCAAGTTCT GGCAGGACGG AGACAATGTC TTTGCCGAAT ACGTGCGGCT GAACGACGAG CTTTCCGTCG GCATTTCGGC GGTGATGGGG ACCGAGGGGC GGCCGCAGGA TTTCTACGGC ATGGTCAAGT GGCATGCGCC GGCCGTACTC TACTTCGATT TCCGCGCGGG CGAGCCGGGC ACGCCGCGCG AGCAATGCAC GCTGCTGCCA TCGCTCCATG CCTTCACGCC GGAAACCCCT GACACGACGC ATTACTTCTG GGCGACCGCG CGCGACTACA GGCTGGGCGA CGCGGAGTTC ACCGCCGGAA TGCGCGCCGC GCTCGAATTC GCGTTCGAGC AGGAAGACAT GCCGATCATC CGCGACAGCC ACCGGCTCAT GCGCGGCGAG GACTTCTGGG CGCTTCGCCC GCTGATCCTC GGTGGCGATG GTGGCGGGGT GCGGGCCCGG AGAATGCTGC AACGGCTGAT CGAGCGCGAG AGACAGCAGG ACGCTGCCTG A
|
Protein sequence | MSGGEPVPKL SAKPAATYLR NTWYVAGWAS DLAGEPQQRT FLEEPVALFR DGHGEAKAIG GRCPHRFAPL GHGSVVDGAL MCPYHGLRFD GDGRCVHNPH PGGHLPDARQ RVYPLVERHA LLWIWMGDAA KADPASIPDF SWLSDPRWEA VRGATVAEGH FELYSDNILD LSHANFVHPA LVASAFTEGE RKFWQDGDNV FAEYVRLNDE LSVGISAVMG TEGRPQDFYG MVKWHAPAVL YFDFRAGEPG TPREQCTLLP SLHAFTPETP DTTHYFWATA RDYRLGDAEF TAGMRAALEF AFEQEDMPII RDSHRLMRGE DFWALRPLIL GGDGGGVRAR RMLQRLIERE RQQDAA
|
| |