Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1876 |
Symbol | |
ID | 3917097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1977248 |
End bp | 1978189 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640444620 |
Product | intradiol ring-cleavage dioxygenase |
Protein accession | YP_497150 |
Protein GI | 87199893 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.103516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCATG AAATCCACGA TAGTGGCAAT GAAAACCATC ACGATCACGC CGACGAAGGC CTCCTCGCGG ACCTTCAACG CATGGAGCAA TTGCGGGTCG GGCGCCGCCG GGCATTGGCC CTGTTCGGTT CGGCAAGCGG CAGCGCACTC CTGCTGGGAT GCGGCGGCAG CGATGGCTCG TCTTCCGGCA CCACGACGAC GGTGACTTCG ACATCGACGT CCACCGCAAC GGCGACTGCA ACTCCAACTC CCACGCCGAC CTCGACCAGC ACGTCATCGA GTTGCACGGT CCCTTCGAGC GAAACCAATG GGCCATATCC GGCCGATGGC ACCAACACTT CCTCTGGCGT CACGTCGAAC GCGCTTACCG CCACCGGCGT CGTCCGCAGC GACATCCGGT CCAGCTTCGT CGGTTCATCG ACCGCGACGG CAACCGGCGT GACAATGACC TTCACCATCA CCGTCGTCGA CGTGAACAAC GGCTGCGCGC CGCTGTCGGG ATATGCCATC TACATCTGGC ACTGCGACAA GGATGGGAAC TACTCGCTCT ACAACCTGCC CAGCGAAAGC TATCTGCGCG GTGTCCAGGT GACGGACTCG AACGGCCAGG TCACGTTCAC CACAATCGTC CCCGGCTGCT ACAACGGCCG CTATCCACAC ATCCATTTCG AGGTCTTCTC GAGCCTCGCC AATGCCACCA GCGGCAATTA CGCGCGGCTC ATCTCGCAGT TTGCCATTCC CGCCACGGTA TGCGCCGCGG TCTATGCCAC GTCGAACTAT GCGACGAGCA GCACCAACTA CAACAACGGC AACAACTCGA CATCGACGGA CAACATCTTC AGCGATGCGA CAAGCGCGCA GCTCGCGGTA ATGACGCCGA CGATGACCGG GTCGGTATCC GGCGGCTACA CCGCAACGAC CACCATCGGC ATTTCAACCT GA
|
Protein sequence | MPHEIHDSGN ENHHDHADEG LLADLQRMEQ LRVGRRRALA LFGSASGSAL LLGCGGSDGS SSGTTTTVTS TSTSTATATA TPTPTPTSTS TSSSCTVPSS ETNGPYPADG TNTSSGVTSN ALTATGVVRS DIRSSFVGSS TATATGVTMT FTITVVDVNN GCAPLSGYAI YIWHCDKDGN YSLYNLPSES YLRGVQVTDS NGQVTFTTIV PGCYNGRYPH IHFEVFSSLA NATSGNYARL ISQFAIPATV CAAVYATSNY ATSSTNYNNG NNSTSTDNIF SDATSAQLAV MTPTMTGSVS GGYTATTTIG IST
|
| |