Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3830 |
Symbol | |
ID | 5077978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 484047 |
End bp | 484943 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481553 |
Product | intradiol ring-cleavage dioxygenase |
Protein accession | YP_001166215 |
Protein GI | 146276055 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | [TIGR02439] catechol 1,2-dioxygenase, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.154107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGCCA CCTTCGCCAG TTCCGATTCC GTGCAGAAGC TCTTCGATCG CGCCTGCGGT CTTGATTGCG CAGGCGGCAA TCCCCGCCTC AAGGCGATCA TGCGCGACCT TCTCCAGGCA ACGGCCGACA TCATCGTCAA GCATGACGTG TCCGAAAGCG AGTTCTGGCA GGCGACCCGC TATCTTGCCG ATGGCGCCGG CGAGATCGGC CTGATCGTCC CCGGCATCGG CCTCGAACAC TTCCTCGATC TCTACATGGA CGCCAAGGAC GCCGAAGCCG GCCTCACCGG CGGAACCCCG CGCACGATCG AAGGCCCGCT CTACGTCGCT GGTGCACCGC TGGTGGATGG CAGTGACGAA GTGGACCTGA CTTCCGACCC CGACGATACC GACACGCTGC ACATGACCGG CACGATCACC GGCCCCGATG GCGAGCCGGT CAAGGACGCG ATCCTCCACG TCTGGCACGC GAACAGCAAG GGCTGGTATT CGCACTTCGA TCCCACGAGC GAGCAGACCC CGTTCAACAA CCGCCGCCGC ATCCGCGTCC CCGCCGACGG TCGCTACGCC TTCCGCTCCA AGATGCCGCA TGGCTATTCC GTGCCGCCGG GTGGCGCCAC CGACGTGCTG ATGCAGGCGC TCGGCCGCCA CGGCAATCGC CCAGCGCACG TCCACTTCTT CGTCGAGGCG CCGGGCTACC GCACGCTGAC CACGCAGATC AACTTCGGCG ACGACCCCTT CGCGGCCGAC GATTTCGCCT TCGGCACGCG AGAGGGCTTG CTGCCGGTGC CGAGCCGCCA GGGCGATACC GCCCACATCG CGTTCGACTT CCAGCTCCAG CGCGCCCGCT CGGAGGACGA GCAGCGGTTC TCGGAACGCA CCCGCGCCCA GGCCTGA
|
Protein sequence | MPATFASSDS VQKLFDRACG LDCAGGNPRL KAIMRDLLQA TADIIVKHDV SESEFWQATR YLADGAGEIG LIVPGIGLEH FLDLYMDAKD AEAGLTGGTP RTIEGPLYVA GAPLVDGSDE VDLTSDPDDT DTLHMTGTIT GPDGEPVKDA ILHVWHANSK GWYSHFDPTS EQTPFNNRRR IRVPADGRYA FRSKMPHGYS VPPGGATDVL MQALGRHGNR PAHVHFFVEA PGYRTLTTQI NFGDDPFAAD DFAFGTREGL LPVPSRQGDT AHIAFDFQLQ RARSEDEQRF SERTRAQA
|
| |