Gene Swit_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_1858 
Symbol 
ID5199037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp2077948 
End bp2080299 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content68% 
IMG OID640581403 
Productsulfatase 
Protein accessionYP_001262356 
Protein GI148554774 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.190386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCT GCATCGGCAA GCTGTTCCAT GGCACGGCCC TGGGCCTTGC CCTGTCCATC 
ACCGCGCAGG GCGCCGCCCA GGTCCGGACC GCGCCCGCCG TCTCCCAACC CGCGACGCTG
GAGAATTGGG CGAGGACCGT CAGGCCGGCC GAGCGCGGCG CCCCCAACGT GATCGTGATC
CTCACCGACG ATGTCGGCTT CGGCGCCGCC TCGACCTTCG GCGGCCCCGT CCCCACGCCG
ACGCTCCAGG CGCTGGCCAA CAGGGGCCTG CGCTACAACC GCTTCCATGT GACGGCGGTG
TGCGGGGCGA CCCGCGCAGC GCTGCTCACC GGGCGCAACC AGAACAACGT CAACATGGCC
GTGGTGCCGG AACTGCCCGC GGCGCCCGAC GGCTATAATA CGATCATCCC GAAATCGGCC
GGCACGATCG CGCAGCTCCT GCGCCACAAC GGCTATTCGA CGGCGCTGAT CGGCAAGGCC
AATGTCACGC CGATGTGGGA GACCGGCCCC GCCGGCCCGT TCGATCGCTG GCCCACCGGG
CTCGGCTTCG ATTATTATTA TGGCTTCATG ACCGCCGAGA CGGACGAATA TTCGCCGCCG
CTCTATGAGA ATACCCGGCC GGTCGACCCG CCTGCCCAGC CGGACTACAT CCTCGATCGC
GACCTCGCCG ACCATGCCAT AGGCTGGCTG CGCGAGCAGC ACGAACTGGG CTCGCAGCGC
CCCTTCTTCC TCTATTACGC GACCGGATCG ACCCATGCGC CATTGCAGGC GCCGGCCGAC
TGGATCGCGA AGTTCCGCGG CCAGTTCGAC CAGGGCTGGG ACAAGCTGCG CGAGGAGACG
CTCAGGCGCC AGATCGCCAC GGGGATCGTG CCGCGCGGGA CCAGGCTGGC GCCCCGCGCG
CCCGGCATCC CCGCCTGGGA CAGCATCGCC CCCGAGGATC GCAAGCTCTA TGCGCGCCAG
ATGGAGGTCT ATGCCGGCAT GCTCGCCTTC TCCGACCACC AGATCGGACG GGTGCTCGAC
GCCGTCCGCG CGATGGGCCA GGAGGACAAC ACGCTGGTCG TCTTCATCGA GGGCGACAAT
GGCGCGAGCG GCGAGGGTTC GCTGACCGGC GTCATCAACG CCTGGAACCC GGTGAACGGG
ATTGCCGAGG ACGCGGCGGA GAATCGCCGG CGGATCGACG AATGGGGCGG GCCCGACACC
CAGCCCCATT ATGCCGCCGG CTGGGCGGTC GCGATGAACA CGCCGTTTCC CTGGATGAAG
CAATATGCCT CGCACCTCGG CGCGACCCGC GCCGGCATGG TCCTGTCCTG GCCGAAGCGC
ATCGCGGCGA AGGGCGAGAT CCGCACCCAA TATACCCATG TGATCGACGT GCTGCCGACG
ATCCTCGCCG CCGCGAACAT CCAGCCGCCG GAAAGCCTGG ACGGCGAGCG CCAGCAGAAG
CTGGACGGAC TGGACATGTC CTATTCGTTC GACGCTCCGA AGGCGCCGTC GGAGCGGCGC
GTCCAATATT ACAACCTGCT CGACAATGCC GGCATCTACA AGGACGGCTG GCTCGCCTCC
ACGACGCCGA ACAGCGTCCC GTGGAATTTC ATGATGCAGA AGAGCGTTCC CTTCGCATCG
CGGAACTGGG AGCTCTACGA TCTCGACCGC GACTTCAGCC AGTCGACCGA CCTCGCCCGC
CGCTATCCGG CGAAGCTGGA GGAGATGAAG GCGCTGTACC GCGAGGAGGC GGCCAGGAAC
CACGTCCTGC CGGGCTTCAA TCCGACGACG ACCTTCCTGT GGGAGAAGAA CCATGCCGCG
CCGTCGAGCA CGACCTTCGG CGGGCCGGTG TCGCGCCTGC CCTGGGGGAT GGCGCCGGAC
GTCCTGAACC GGTCCTTCAC GATCCAGGCG CATGTCTCGG TTTCGGCGCC GGAGCCCGAT
GGCGCCCTGG TCACGCAGGG CGGCCGCTTC GGCGGCTATT GCCTGTGCGT CGAGCGCGGC
CTGCCGACCT TCGTCTACAA TGCCGGCGGC GCAGGGCTCT ACCAGATCAG CTCGACCGCG
CCGCTGTCGC CGGGCGCGCA CCAGATCGAC GCGGCCTTCG ACTATGACGG CGGCGGTCGC
GGCAAGGGCG GCACGGTCAC GCTCGAGATC GACGGCCGGA TCGTGGCGAC CGGCCGTGTC
GAACACACGA TCCCGATCAT GTTCGCGCTC GACGAGGACT TCAACATCGG CCGCGACAGC
GGCACGACGG TCAAGGCGGG CTATGCCCTG CCCTTCACCT TCAAGGGCGA CATCGCCTCG
GTGACCATCC AACTCGGCCA GCAGTTCGAG TCCGAACGGA CCAAGGCCGA ACGGGCGATG
TCCGGGGACT GA
 
Protein sequence
MTSCIGKLFH GTALGLALSI TAQGAAQVRT APAVSQPATL ENWARTVRPA ERGAPNVIVI 
LTDDVGFGAA STFGGPVPTP TLQALANRGL RYNRFHVTAV CGATRAALLT GRNQNNVNMA
VVPELPAAPD GYNTIIPKSA GTIAQLLRHN GYSTALIGKA NVTPMWETGP AGPFDRWPTG
LGFDYYYGFM TAETDEYSPP LYENTRPVDP PAQPDYILDR DLADHAIGWL REQHELGSQR
PFFLYYATGS THAPLQAPAD WIAKFRGQFD QGWDKLREET LRRQIATGIV PRGTRLAPRA
PGIPAWDSIA PEDRKLYARQ MEVYAGMLAF SDHQIGRVLD AVRAMGQEDN TLVVFIEGDN
GASGEGSLTG VINAWNPVNG IAEDAAENRR RIDEWGGPDT QPHYAAGWAV AMNTPFPWMK
QYASHLGATR AGMVLSWPKR IAAKGEIRTQ YTHVIDVLPT ILAAANIQPP ESLDGERQQK
LDGLDMSYSF DAPKAPSERR VQYYNLLDNA GIYKDGWLAS TTPNSVPWNF MMQKSVPFAS
RNWELYDLDR DFSQSTDLAR RYPAKLEEMK ALYREEAARN HVLPGFNPTT TFLWEKNHAA
PSSTTFGGPV SRLPWGMAPD VLNRSFTIQA HVSVSAPEPD GALVTQGGRF GGYCLCVERG
LPTFVYNAGG AGLYQISSTA PLSPGAHQID AAFDYDGGGR GKGGTVTLEI DGRIVATGRV
EHTIPIMFAL DEDFNIGRDS GTTVKAGYAL PFTFKGDIAS VTIQLGQQFE SERTKAERAM
SGD