Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3479 |
Symbol | |
ID | 5077628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 80834 |
End bp | 82414 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640481203 |
Product | 4-cresol dehydrogenase (hydroxylating) |
Protein accession | YP_001165865 |
Protein GI | 146275705 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAGCA AGACCAAGAC TGCCGCCGCG AGCGCTCCCG AAGCACCTGC CGTGCTGCCC GCCGGCGTCA GCGCAACCGA CATGGCATCC GCCATTGCAG AATTCGTCGC GATCCTCGGG CCGCAGAACG TGCTGACCGA TGCCGATCAC ATCGCCCCCT ATACCAAGGT GATGATCGCG GAGAGCGAGG ACCTGCACCG CCCCTCGGCC GTGCTCTATG CTCGCGAGGT CGGCGAGATC CAGAAGATCC TGAAGGTCTG CAACGACTAC AAGGTGCCGA TCTGGACGAT CTCGACCGGG CGCAATTTCG GCTACGGATC TGCCGCGCCG CAAAGCGCCG GACAGGTCGT CCTCGATCTC AAGCACATGA ACCGCATTCT CGAGGTCGAC CCGGTGCTGT GCACCGCGCT GGTCGAACCG GGCGTGACCT ACCAGCAGCT CAAGGACTAT CTGGAAGAGC ATGACATCCC GCTGTGGCTG TCGTGCCCGG CGCCGTCGGC TATCGCCGGA CCGCTTGGCA ACACGGTGGA TCGCGGCGTC GGCTACACGC CCTATGGCGA ACACTTCATG ATGCAGTGCG GGATGGAAGT CGTCCTAGCG AACGGCGAAG TCCTGCGCAC CGGCATGGGC GGGGTCGAAG GGACCAGCGC CTGGCAGGTG TTCAAGTGGG GTTACGGACC CTATCTCGAC GGCATCTTCA CGCAGTCGAA CTATGGCATC GTGACCAAGA TGGGCATGTG GCTGATGCCA AAGCCGCCCG TCTACAAGCC GTTCTGCATC CGCTACGACA ACGACGAGGA CATCCACGAC ATCGTGGAGA CGCTGCGCCC GCTGCGCATC GCGAACGTCA TTCCCAACGC GATGGTGTTC GCCAACGTGA TGTGGGAAGC CGCCGCGCTG ATGCCGCGCA GCAAGTACTA TGACGGCACC GGCACCACCC CCGACAGCGT GCTTGAAGAG ATCAAGGCCA AGGAAGGCCT GGGCGCTTGG AACGTCTATG CCGCGCTTTA CGGCACCAAG GAGCAGGTCG ACGTCAACTG GCAGATCATC ACCGGCGCGA TCAAGGCCAG CGGCAAGGGC AAGATCATCA CCGAGGAAGA GGCCGGCGAC ACCCAGCCTT TCAACTATCG CGCCAAGCTG ATGCGCGGCG ACATGACGAT GCAGGAATTC GGCCTCTATC GCTGGCGCGG TGGCGGCGGA TCGATGTGGT TCGCGCCGGT TACCGCAGCC AAGGGCAGCG AAACCGTGGA GCAGACGCGT CTCGCCAAGG AAATCCTGGG CGAATACGGG CTCGATTATG TGGCCGAATA CATCGTCGGC ATGCGCGACA TGCACCACAT CATCGACGTG CTCTACGACC GCTCCGATCC GGAGGAGATG AAGCGCGCGC ACGAATGCTT CGGCAAGCTG CTGAGCGAGT TCGGCAAGCG TGGATATGCG GTCTATCGCG TCAACACCGC GTTCATGGAC CAGACGGCGG ACCTCTATGG CCCGGTCAAG CGCAAGGTCG ACCAGACGCT GAAGCGCGCG CTCGACCCGA ACGGCATCCT CGCGCCGGGC AAGTCCGGCA TCCGTATCTG A
|
Protein sequence | MPSKTKTAAA SAPEAPAVLP AGVSATDMAS AIAEFVAILG PQNVLTDADH IAPYTKVMIA ESEDLHRPSA VLYAREVGEI QKILKVCNDY KVPIWTISTG RNFGYGSAAP QSAGQVVLDL KHMNRILEVD PVLCTALVEP GVTYQQLKDY LEEHDIPLWL SCPAPSAIAG PLGNTVDRGV GYTPYGEHFM MQCGMEVVLA NGEVLRTGMG GVEGTSAWQV FKWGYGPYLD GIFTQSNYGI VTKMGMWLMP KPPVYKPFCI RYDNDEDIHD IVETLRPLRI ANVIPNAMVF ANVMWEAAAL MPRSKYYDGT GTTPDSVLEE IKAKEGLGAW NVYAALYGTK EQVDVNWQII TGAIKASGKG KIITEEEAGD TQPFNYRAKL MRGDMTMQEF GLYRWRGGGG SMWFAPVTAA KGSETVEQTR LAKEILGEYG LDYVAEYIVG MRDMHHIIDV LYDRSDPEEM KRAHECFGKL LSEFGKRGYA VYRVNTAFMD QTADLYGPVK RKVDQTLKRA LDPNGILAPG KSGIRI
|
| |