Gene Saro_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3830 
Symbol 
ID5077978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp484047 
End bp484943 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content67% 
IMG OID640481553 
Productintradiol ring-cleavage dioxygenase 
Protein accessionYP_001166215 
Protein GI146276055 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3485] Protocatechuate 3,4-dioxygenase beta subunit 
TIGRFAM ID[TIGR02439] catechol 1,2-dioxygenase, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.154107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCCA CCTTCGCCAG TTCCGATTCC GTGCAGAAGC TCTTCGATCG CGCCTGCGGT 
CTTGATTGCG CAGGCGGCAA TCCCCGCCTC AAGGCGATCA TGCGCGACCT TCTCCAGGCA
ACGGCCGACA TCATCGTCAA GCATGACGTG TCCGAAAGCG AGTTCTGGCA GGCGACCCGC
TATCTTGCCG ATGGCGCCGG CGAGATCGGC CTGATCGTCC CCGGCATCGG CCTCGAACAC
TTCCTCGATC TCTACATGGA CGCCAAGGAC GCCGAAGCCG GCCTCACCGG CGGAACCCCG
CGCACGATCG AAGGCCCGCT CTACGTCGCT GGTGCACCGC TGGTGGATGG CAGTGACGAA
GTGGACCTGA CTTCCGACCC CGACGATACC GACACGCTGC ACATGACCGG CACGATCACC
GGCCCCGATG GCGAGCCGGT CAAGGACGCG ATCCTCCACG TCTGGCACGC GAACAGCAAG
GGCTGGTATT CGCACTTCGA TCCCACGAGC GAGCAGACCC CGTTCAACAA CCGCCGCCGC
ATCCGCGTCC CCGCCGACGG TCGCTACGCC TTCCGCTCCA AGATGCCGCA TGGCTATTCC
GTGCCGCCGG GTGGCGCCAC CGACGTGCTG ATGCAGGCGC TCGGCCGCCA CGGCAATCGC
CCAGCGCACG TCCACTTCTT CGTCGAGGCG CCGGGCTACC GCACGCTGAC CACGCAGATC
AACTTCGGCG ACGACCCCTT CGCGGCCGAC GATTTCGCCT TCGGCACGCG AGAGGGCTTG
CTGCCGGTGC CGAGCCGCCA GGGCGATACC GCCCACATCG CGTTCGACTT CCAGCTCCAG
CGCGCCCGCT CGGAGGACGA GCAGCGGTTC TCGGAACGCA CCCGCGCCCA GGCCTGA
 
Protein sequence
MPATFASSDS VQKLFDRACG LDCAGGNPRL KAIMRDLLQA TADIIVKHDV SESEFWQATR 
YLADGAGEIG LIVPGIGLEH FLDLYMDAKD AEAGLTGGTP RTIEGPLYVA GAPLVDGSDE
VDLTSDPDDT DTLHMTGTIT GPDGEPVKDA ILHVWHANSK GWYSHFDPTS EQTPFNNRRR
IRVPADGRYA FRSKMPHGYS VPPGGATDVL MQALGRHGNR PAHVHFFVEA PGYRTLTTQI
NFGDDPFAAD DFAFGTREGL LPVPSRQGDT AHIAFDFQLQ RARSEDEQRF SERTRAQA