Gene Saro_1876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1876 
Symbol 
ID3917097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1977248 
End bp1978189 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content62% 
IMG OID640444620 
Productintradiol ring-cleavage dioxygenase 
Protein accessionYP_497150 
Protein GI87199893 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3485] Protocatechuate 3,4-dioxygenase beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.103516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCATG AAATCCACGA TAGTGGCAAT GAAAACCATC ACGATCACGC CGACGAAGGC 
CTCCTCGCGG ACCTTCAACG CATGGAGCAA TTGCGGGTCG GGCGCCGCCG GGCATTGGCC
CTGTTCGGTT CGGCAAGCGG CAGCGCACTC CTGCTGGGAT GCGGCGGCAG CGATGGCTCG
TCTTCCGGCA CCACGACGAC GGTGACTTCG ACATCGACGT CCACCGCAAC GGCGACTGCA
ACTCCAACTC CCACGCCGAC CTCGACCAGC ACGTCATCGA GTTGCACGGT CCCTTCGAGC
GAAACCAATG GGCCATATCC GGCCGATGGC ACCAACACTT CCTCTGGCGT CACGTCGAAC
GCGCTTACCG CCACCGGCGT CGTCCGCAGC GACATCCGGT CCAGCTTCGT CGGTTCATCG
ACCGCGACGG CAACCGGCGT GACAATGACC TTCACCATCA CCGTCGTCGA CGTGAACAAC
GGCTGCGCGC CGCTGTCGGG ATATGCCATC TACATCTGGC ACTGCGACAA GGATGGGAAC
TACTCGCTCT ACAACCTGCC CAGCGAAAGC TATCTGCGCG GTGTCCAGGT GACGGACTCG
AACGGCCAGG TCACGTTCAC CACAATCGTC CCCGGCTGCT ACAACGGCCG CTATCCACAC
ATCCATTTCG AGGTCTTCTC GAGCCTCGCC AATGCCACCA GCGGCAATTA CGCGCGGCTC
ATCTCGCAGT TTGCCATTCC CGCCACGGTA TGCGCCGCGG TCTATGCCAC GTCGAACTAT
GCGACGAGCA GCACCAACTA CAACAACGGC AACAACTCGA CATCGACGGA CAACATCTTC
AGCGATGCGA CAAGCGCGCA GCTCGCGGTA ATGACGCCGA CGATGACCGG GTCGGTATCC
GGCGGCTACA CCGCAACGAC CACCATCGGC ATTTCAACCT GA
 
Protein sequence
MPHEIHDSGN ENHHDHADEG LLADLQRMEQ LRVGRRRALA LFGSASGSAL LLGCGGSDGS 
SSGTTTTVTS TSTSTATATA TPTPTPTSTS TSSSCTVPSS ETNGPYPADG TNTSSGVTSN
ALTATGVVRS DIRSSFVGSS TATATGVTMT FTITVVDVNN GCAPLSGYAI YIWHCDKDGN
YSLYNLPSES YLRGVQVTDS NGQVTFTTIV PGCYNGRYPH IHFEVFSSLA NATSGNYARL
ISQFAIPATV CAAVYATSNY ATSSTNYNNG NNSTSTDNIF SDATSAQLAV MTPTMTGSVS
GGYTATTTIG IST