Gene Saro_3479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3479 
Symbol 
ID5077628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp80834 
End bp82414 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content63% 
IMG OID640481203 
Product4-cresol dehydrogenase (hydroxylating) 
Protein accessionYP_001165865 
Protein GI146275705 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAGCA AGACCAAGAC TGCCGCCGCG AGCGCTCCCG AAGCACCTGC CGTGCTGCCC 
GCCGGCGTCA GCGCAACCGA CATGGCATCC GCCATTGCAG AATTCGTCGC GATCCTCGGG
CCGCAGAACG TGCTGACCGA TGCCGATCAC ATCGCCCCCT ATACCAAGGT GATGATCGCG
GAGAGCGAGG ACCTGCACCG CCCCTCGGCC GTGCTCTATG CTCGCGAGGT CGGCGAGATC
CAGAAGATCC TGAAGGTCTG CAACGACTAC AAGGTGCCGA TCTGGACGAT CTCGACCGGG
CGCAATTTCG GCTACGGATC TGCCGCGCCG CAAAGCGCCG GACAGGTCGT CCTCGATCTC
AAGCACATGA ACCGCATTCT CGAGGTCGAC CCGGTGCTGT GCACCGCGCT GGTCGAACCG
GGCGTGACCT ACCAGCAGCT CAAGGACTAT CTGGAAGAGC ATGACATCCC GCTGTGGCTG
TCGTGCCCGG CGCCGTCGGC TATCGCCGGA CCGCTTGGCA ACACGGTGGA TCGCGGCGTC
GGCTACACGC CCTATGGCGA ACACTTCATG ATGCAGTGCG GGATGGAAGT CGTCCTAGCG
AACGGCGAAG TCCTGCGCAC CGGCATGGGC GGGGTCGAAG GGACCAGCGC CTGGCAGGTG
TTCAAGTGGG GTTACGGACC CTATCTCGAC GGCATCTTCA CGCAGTCGAA CTATGGCATC
GTGACCAAGA TGGGCATGTG GCTGATGCCA AAGCCGCCCG TCTACAAGCC GTTCTGCATC
CGCTACGACA ACGACGAGGA CATCCACGAC ATCGTGGAGA CGCTGCGCCC GCTGCGCATC
GCGAACGTCA TTCCCAACGC GATGGTGTTC GCCAACGTGA TGTGGGAAGC CGCCGCGCTG
ATGCCGCGCA GCAAGTACTA TGACGGCACC GGCACCACCC CCGACAGCGT GCTTGAAGAG
ATCAAGGCCA AGGAAGGCCT GGGCGCTTGG AACGTCTATG CCGCGCTTTA CGGCACCAAG
GAGCAGGTCG ACGTCAACTG GCAGATCATC ACCGGCGCGA TCAAGGCCAG CGGCAAGGGC
AAGATCATCA CCGAGGAAGA GGCCGGCGAC ACCCAGCCTT TCAACTATCG CGCCAAGCTG
ATGCGCGGCG ACATGACGAT GCAGGAATTC GGCCTCTATC GCTGGCGCGG TGGCGGCGGA
TCGATGTGGT TCGCGCCGGT TACCGCAGCC AAGGGCAGCG AAACCGTGGA GCAGACGCGT
CTCGCCAAGG AAATCCTGGG CGAATACGGG CTCGATTATG TGGCCGAATA CATCGTCGGC
ATGCGCGACA TGCACCACAT CATCGACGTG CTCTACGACC GCTCCGATCC GGAGGAGATG
AAGCGCGCGC ACGAATGCTT CGGCAAGCTG CTGAGCGAGT TCGGCAAGCG TGGATATGCG
GTCTATCGCG TCAACACCGC GTTCATGGAC CAGACGGCGG ACCTCTATGG CCCGGTCAAG
CGCAAGGTCG ACCAGACGCT GAAGCGCGCG CTCGACCCGA ACGGCATCCT CGCGCCGGGC
AAGTCCGGCA TCCGTATCTG A
 
Protein sequence
MPSKTKTAAA SAPEAPAVLP AGVSATDMAS AIAEFVAILG PQNVLTDADH IAPYTKVMIA 
ESEDLHRPSA VLYAREVGEI QKILKVCNDY KVPIWTISTG RNFGYGSAAP QSAGQVVLDL
KHMNRILEVD PVLCTALVEP GVTYQQLKDY LEEHDIPLWL SCPAPSAIAG PLGNTVDRGV
GYTPYGEHFM MQCGMEVVLA NGEVLRTGMG GVEGTSAWQV FKWGYGPYLD GIFTQSNYGI
VTKMGMWLMP KPPVYKPFCI RYDNDEDIHD IVETLRPLRI ANVIPNAMVF ANVMWEAAAL
MPRSKYYDGT GTTPDSVLEE IKAKEGLGAW NVYAALYGTK EQVDVNWQII TGAIKASGKG
KIITEEEAGD TQPFNYRAKL MRGDMTMQEF GLYRWRGGGG SMWFAPVTAA KGSETVEQTR
LAKEILGEYG LDYVAEYIVG MRDMHHIIDV LYDRSDPEEM KRAHECFGKL LSEFGKRGYA
VYRVNTAFMD QTADLYGPVK RKVDQTLKRA LDPNGILAPG KSGIRI