Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0534 |
Symbol | |
ID | 3918664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 580304 |
End bp | 582577 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443264 |
Product | sulfatase |
Protein accession | YP_495815 |
Protein GI | 87198558 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.966586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGTG CATTGGGTGC CAGCGCACTG GCGCTCATGG CATCGGCGGC CTTCGGGCAG GCGGTGGTGC CCGCCCCCGC AACTGTTCCC GCCTACCAGA TCAAAGCACC TGCCGGTGCG CCCAATGTCG TCGTCATCCT GCTCGACGAC GTAGGTTTCG GGGCCGCTTC AACCTTCGGA GGGCCGATCG AGACGCCGGC GCTTGGCCGG CTTGCCGCGG ATGGACTGCG CTACAACCGC TTTCACACCA CCGGAATCTG CTCGCCCACC CGGGCATCGC TGCTGACCGG GCGCAATCCG CACAGCACCG GCATCGGCGC GGTCGAGAAC TCGTCCGACG AACGCCCCGG CTACAGCGGC TTCCACTCCA AGGACACGGC ATCCATTGCC ACTGTCCTGC GCCAAAACGG CTACAACACC GCGGCATTCG GCAAGTGGCA CCAGGTGCCG GACTGGGAGG CGTCGCCGTC CGGGCCTTTC GATCGCTGGC CGACCGGCGA AGGCTTCGAG CGGTTCTATG GCTTCATTGG CGGGGAGACC GATCAGTTCG ATCCGTCGCT GTTCGAAGGC ACGACCCCCG TGATGCGGCC CGACGTGCCG AATTATCACC TGACCGAGGA CCTCGCCGAC AAGTCGATCG CATGGCTACG CACGCAGCAT TCGGTCACGC CCGACAAACC GTTCTTCCTC TACTTCGCGC CCGGGGCGAC GCATGCGCCG CTCCAGGTGC CAAGGGGCTG GAGCGAGCGA TACAAGGGCA AGTTCGACCA GGGTTGGGAC AAGGTCCGCG AGGAGACTTT CGTCCGCCAG AAAAGGCTCG GCGTCATTCC CGCTAACGCC CGGCTCACTC CGCGCCCCGA TGGCCTGCCG GCCTGGGATA GCCTCACGCC GGACCAGAAG CGCTTCGCGG CGCGCACGAT GGAAGTCTAC GCCGGGTTTC TTGCCCACAC CGACGCCCAG GTCGGCAAGC TGCTCGACAG TCTCGCGGCC AATGGCGAGC GCCAGAACAC GATGGTCTTC TACGTCTTCG GCGACAATGG AGCGAGCGGC GAGGGCGGTC TGTCGGGGAG CGCGAACTAT TTCGCCAACA TCCAGGGGCT GCCCGAGACC GACCAGATTC GTGCCGCGCA TCTCGATGCG CTTGGTGGCC CCGATGCCTA TGCCCACTAT CCCGCGGGAT GGGCCTGGGC GATGAACGCG CCGCTGCCCT GGATGAAGAC CGTGGCGTCG CATCTGGGGG GGACGCGCAA CGCGATGGTC TTCGACTGGC CGGGGCATGT GGCTGACAAG GGCGGCATCC GGACGCAGTT CAGCCACGTC AACGACATCG TCCCGACGAT TCTCGAGGCT GCCGGAATCA CTGCTCCGTC GACTGTGGAC GGCATCGCGC AGAAGCCGAT GGACGGCGTC AGCCTGCTCT ACAGCCTGAA GGACCCGAAA GCGCCCGAAC GACACCTGAC GCAGTACTTC GAGGTCTTTG GCCATCGCGC GATCTACCAT GACGGGTGGA TGGCCTCGGC GTTCCACAGC CGGTTGCCGT GGTCGGTCAT GGGTTTTGGC GACAAGAAGT TCGAGGACGA TCGCTGGGCA CTCTACGATC TCGGAAAGGA CTTCTCGCAG GCGCGCGACG TTGCTGATCG CAACCCCGCG AAGCTGGCCG ACCTGAAGGC GCTTTTCGAT GCGGAAGCAG CGCGAAACCA GGTCCTGCCG CTGCGCAACA CCACGCTCGG GAACAACAAG GTTCCAAGCA TCGCGGCCGG CCGCACCACG ATGACCTTCC ACGAAGGCGC GGTTGGCGTT CCGGAAACGG CCCTGCCGCG CGCCATGAAC CGATCGTGGA GCGTCGATGC AGCTATCGAC ATCGCTGATG GAGCCGAAGG CGTCGTCGCC ACGCTTGGCG GCCGTAGCGC CGGTTGGTCA CTGTATCTGG ACAGGGGCGG CAAGCCGACG TTCTCCTACC GCGTCTTCGA CATAGAGGCC GTGACGCTGC GCGCCGCGCA ATCGCTCGCA CCGGGCAAGC ACGCGCTGCG CTTCGACTTC GACTATGCGG GGCCGGGCTA TGGCAAGGGG GCGCGCCTGC GCCTCATGGT CGATGGCGCG GTGGTCGATA CGGGCGAGGT GAAGTCCAGT CCCACCGCAT TCTATACGAT CGACGAAAGC TTCGATGTCG GCTTGGACCA CGGCTCGCCC GCCGGCTCCT ACCCGGCGGG GACGGCTCCG GGCTTCGCGT TTCAAAAGGG CCGGATCGAG CAAGTGACCT TCAGCGCGCG CTGA
|
Protein sequence | MISALGASAL ALMASAAFGQ AVVPAPATVP AYQIKAPAGA PNVVVILLDD VGFGAASTFG GPIETPALGR LAADGLRYNR FHTTGICSPT RASLLTGRNP HSTGIGAVEN SSDERPGYSG FHSKDTASIA TVLRQNGYNT AAFGKWHQVP DWEASPSGPF DRWPTGEGFE RFYGFIGGET DQFDPSLFEG TTPVMRPDVP NYHLTEDLAD KSIAWLRTQH SVTPDKPFFL YFAPGATHAP LQVPRGWSER YKGKFDQGWD KVREETFVRQ KRLGVIPANA RLTPRPDGLP AWDSLTPDQK RFAARTMEVY AGFLAHTDAQ VGKLLDSLAA NGERQNTMVF YVFGDNGASG EGGLSGSANY FANIQGLPET DQIRAAHLDA LGGPDAYAHY PAGWAWAMNA PLPWMKTVAS HLGGTRNAMV FDWPGHVADK GGIRTQFSHV NDIVPTILEA AGITAPSTVD GIAQKPMDGV SLLYSLKDPK APERHLTQYF EVFGHRAIYH DGWMASAFHS RLPWSVMGFG DKKFEDDRWA LYDLGKDFSQ ARDVADRNPA KLADLKALFD AEAARNQVLP LRNTTLGNNK VPSIAAGRTT MTFHEGAVGV PETALPRAMN RSWSVDAAID IADGAEGVVA TLGGRSAGWS LYLDRGGKPT FSYRVFDIEA VTLRAAQSLA PGKHALRFDF DYAGPGYGKG ARLRLMVDGA VVDTGEVKSS PTAFYTIDES FDVGLDHGSP AGSYPAGTAP GFAFQKGRIE QVTFSAR
|
| |