Gene Saro_0534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0534 
Symbol 
ID3918664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp580304 
End bp582577 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content66% 
IMG OID640443264 
Productsulfatase 
Protein accessionYP_495815 
Protein GI87198558 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.966586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGTG CATTGGGTGC CAGCGCACTG GCGCTCATGG CATCGGCGGC CTTCGGGCAG 
GCGGTGGTGC CCGCCCCCGC AACTGTTCCC GCCTACCAGA TCAAAGCACC TGCCGGTGCG
CCCAATGTCG TCGTCATCCT GCTCGACGAC GTAGGTTTCG GGGCCGCTTC AACCTTCGGA
GGGCCGATCG AGACGCCGGC GCTTGGCCGG CTTGCCGCGG ATGGACTGCG CTACAACCGC
TTTCACACCA CCGGAATCTG CTCGCCCACC CGGGCATCGC TGCTGACCGG GCGCAATCCG
CACAGCACCG GCATCGGCGC GGTCGAGAAC TCGTCCGACG AACGCCCCGG CTACAGCGGC
TTCCACTCCA AGGACACGGC ATCCATTGCC ACTGTCCTGC GCCAAAACGG CTACAACACC
GCGGCATTCG GCAAGTGGCA CCAGGTGCCG GACTGGGAGG CGTCGCCGTC CGGGCCTTTC
GATCGCTGGC CGACCGGCGA AGGCTTCGAG CGGTTCTATG GCTTCATTGG CGGGGAGACC
GATCAGTTCG ATCCGTCGCT GTTCGAAGGC ACGACCCCCG TGATGCGGCC CGACGTGCCG
AATTATCACC TGACCGAGGA CCTCGCCGAC AAGTCGATCG CATGGCTACG CACGCAGCAT
TCGGTCACGC CCGACAAACC GTTCTTCCTC TACTTCGCGC CCGGGGCGAC GCATGCGCCG
CTCCAGGTGC CAAGGGGCTG GAGCGAGCGA TACAAGGGCA AGTTCGACCA GGGTTGGGAC
AAGGTCCGCG AGGAGACTTT CGTCCGCCAG AAAAGGCTCG GCGTCATTCC CGCTAACGCC
CGGCTCACTC CGCGCCCCGA TGGCCTGCCG GCCTGGGATA GCCTCACGCC GGACCAGAAG
CGCTTCGCGG CGCGCACGAT GGAAGTCTAC GCCGGGTTTC TTGCCCACAC CGACGCCCAG
GTCGGCAAGC TGCTCGACAG TCTCGCGGCC AATGGCGAGC GCCAGAACAC GATGGTCTTC
TACGTCTTCG GCGACAATGG AGCGAGCGGC GAGGGCGGTC TGTCGGGGAG CGCGAACTAT
TTCGCCAACA TCCAGGGGCT GCCCGAGACC GACCAGATTC GTGCCGCGCA TCTCGATGCG
CTTGGTGGCC CCGATGCCTA TGCCCACTAT CCCGCGGGAT GGGCCTGGGC GATGAACGCG
CCGCTGCCCT GGATGAAGAC CGTGGCGTCG CATCTGGGGG GGACGCGCAA CGCGATGGTC
TTCGACTGGC CGGGGCATGT GGCTGACAAG GGCGGCATCC GGACGCAGTT CAGCCACGTC
AACGACATCG TCCCGACGAT TCTCGAGGCT GCCGGAATCA CTGCTCCGTC GACTGTGGAC
GGCATCGCGC AGAAGCCGAT GGACGGCGTC AGCCTGCTCT ACAGCCTGAA GGACCCGAAA
GCGCCCGAAC GACACCTGAC GCAGTACTTC GAGGTCTTTG GCCATCGCGC GATCTACCAT
GACGGGTGGA TGGCCTCGGC GTTCCACAGC CGGTTGCCGT GGTCGGTCAT GGGTTTTGGC
GACAAGAAGT TCGAGGACGA TCGCTGGGCA CTCTACGATC TCGGAAAGGA CTTCTCGCAG
GCGCGCGACG TTGCTGATCG CAACCCCGCG AAGCTGGCCG ACCTGAAGGC GCTTTTCGAT
GCGGAAGCAG CGCGAAACCA GGTCCTGCCG CTGCGCAACA CCACGCTCGG GAACAACAAG
GTTCCAAGCA TCGCGGCCGG CCGCACCACG ATGACCTTCC ACGAAGGCGC GGTTGGCGTT
CCGGAAACGG CCCTGCCGCG CGCCATGAAC CGATCGTGGA GCGTCGATGC AGCTATCGAC
ATCGCTGATG GAGCCGAAGG CGTCGTCGCC ACGCTTGGCG GCCGTAGCGC CGGTTGGTCA
CTGTATCTGG ACAGGGGCGG CAAGCCGACG TTCTCCTACC GCGTCTTCGA CATAGAGGCC
GTGACGCTGC GCGCCGCGCA ATCGCTCGCA CCGGGCAAGC ACGCGCTGCG CTTCGACTTC
GACTATGCGG GGCCGGGCTA TGGCAAGGGG GCGCGCCTGC GCCTCATGGT CGATGGCGCG
GTGGTCGATA CGGGCGAGGT GAAGTCCAGT CCCACCGCAT TCTATACGAT CGACGAAAGC
TTCGATGTCG GCTTGGACCA CGGCTCGCCC GCCGGCTCCT ACCCGGCGGG GACGGCTCCG
GGCTTCGCGT TTCAAAAGGG CCGGATCGAG CAAGTGACCT TCAGCGCGCG CTGA
 
Protein sequence
MISALGASAL ALMASAAFGQ AVVPAPATVP AYQIKAPAGA PNVVVILLDD VGFGAASTFG 
GPIETPALGR LAADGLRYNR FHTTGICSPT RASLLTGRNP HSTGIGAVEN SSDERPGYSG
FHSKDTASIA TVLRQNGYNT AAFGKWHQVP DWEASPSGPF DRWPTGEGFE RFYGFIGGET
DQFDPSLFEG TTPVMRPDVP NYHLTEDLAD KSIAWLRTQH SVTPDKPFFL YFAPGATHAP
LQVPRGWSER YKGKFDQGWD KVREETFVRQ KRLGVIPANA RLTPRPDGLP AWDSLTPDQK
RFAARTMEVY AGFLAHTDAQ VGKLLDSLAA NGERQNTMVF YVFGDNGASG EGGLSGSANY
FANIQGLPET DQIRAAHLDA LGGPDAYAHY PAGWAWAMNA PLPWMKTVAS HLGGTRNAMV
FDWPGHVADK GGIRTQFSHV NDIVPTILEA AGITAPSTVD GIAQKPMDGV SLLYSLKDPK
APERHLTQYF EVFGHRAIYH DGWMASAFHS RLPWSVMGFG DKKFEDDRWA LYDLGKDFSQ
ARDVADRNPA KLADLKALFD AEAARNQVLP LRNTTLGNNK VPSIAAGRTT MTFHEGAVGV
PETALPRAMN RSWSVDAAID IADGAEGVVA TLGGRSAGWS LYLDRGGKPT FSYRVFDIEA
VTLRAAQSLA PGKHALRFDF DYAGPGYGKG ARLRLMVDGA VVDTGEVKSS PTAFYTIDES
FDVGLDHGSP AGSYPAGTAP GFAFQKGRIE QVTFSAR