Gene Swit_0381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_0381 
Symbol 
ID5198167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp404987 
End bp407323 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content71% 
IMG OID640579920 
Productsulfatase 
Protein accessionYP_001260889 
Protein GI148553307 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.586411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCG ATCCCAATAA GCTGTCGCGG CGCGAGGCGA TCGCCGCGGG CATCACCGGC 
TCGGCGGCGG TCGCGGCCGG CCTGACCGCC GCGCCGGCAC AGGCGAGGGA CGCCGCCGGG
TTCAAGGGCC GCATCGCGCC GGTGATGAGC GCCTCGACCC CGGATATCCG ACAAGAGGTC
GTGCCGGCGG CGGGCAGCCC CAATGTCCTG CTGATCATCC TCGACGATGT CGGCTTCGCC
GATCTGGGCT GCTATGGCGG CGAGATCCCG ACGCCGAACA TCGATCGCCT GGCTGCATCG
GGGATTCGCT ACACCAATTT CCGGACCACC GGCGTCTGTT CGGCGACCCG CGCCAGCGTG
ATGACCGGGC TCAACCCGCA CAGTGCCGGG ATCGGTTGGC TGACCTTCTC CGATGCGGGC
TATCCCGGCT ATCGCGGCGA CCTGGCCGAG GATGCCGAGA CGATGGCCGA GCGGTTCAGC
GACGCAGGCT ATTGCGTCTA TCATGTCGGC AAATGGCATG TGAACCTGGC GGACACCACC
AACGCCGCCG GCCCGACCCG CAACTGGCCC AGCCAGCGCG GCTATGCGCG CAGCTACTGG
TTCCAGGGCC ATTCGACCGA CTATTTCGCG CCCGCCCAGC TCTATGCCGG CAACGAGCGG
ATCACCCCGC CGGTCGACGG CTATTATGCG ACCGACGACT TCACCGACAA GGCGCTGGCC
TTCCTGCGTG ATCATCGCGC GCAGCGCGGC GACCGGCCCT TCCTGATGAC GCTGGCGCAT
CCCGGCGCCC ATTCGCCGCT CCAGGCGCGG CGCGAGGACA TCGCGCGCTT CAAGGGCGCC
TATGACGCGG GCTGGGACGT GCTGCGCGCC GCGCGGCTCG AACGGCAGAA GGCGATGGGG
CTGATCCCGG CCGACGCCGT CCTGCCGCCC GCCAACCCCG GCGTGCCGCG CTGGGATACG
CTCGATCCGG CCGCGCGCCG GGTGCAGGCG CGCTACATGG AGGTCTATGC CGCGATGATC
GCGCGGATCG ACGATGGGGT GGGGCGCATC CTCGACGCGC TCGACGCCTC GGGCGACCAT
GACAATACGA TCGTCGCGCT GATCTCCGAC AATGGCGGCG CGCCCGACGG GCGGGGCGGC
ACGCCCAACC TGCTGGCGAT GGTCAATGGC GGCGTCACTC CGGCCCAAGT GGCCGAGCGG
TTCGACGAGA TCGGCGGGCC GGACAGCTAT CCGATGTATT CGCTGGGCTG GGCGTCGGTG
TCGAACACGC CGTTCCGGCT CTACAAGCAC GACACCCATC TCGGCGGGGT CGCCGACCCG
CTGATCCTGA GCTGGCCCAA GGCGATCCCG GCGCGGGGCG AACTGCGCGG CCAGTTCCTG
CATGCGATCG ACCTGCTGCC GACCCTGATC GACGCGGCGG GCCTTTCCGC CACGTCCGGC
CGCGCGGGCG CGAAGCCGAT CGAGGGCCGC TCGGCCCGGC CGAGCTTCAC CGACCGCGCC
GCGCCCGATC CGCGCGACCG CCAATATTTC GAGATGGGCG GGCTCCGCGC GATGCGGCTC
GGCCGGTGGC GGATCGTGTC CAAGGGGCGG TTCGGCCTGC CCGGCGACGC GTGGGAGCTG
TACGACACCG CGACCGATCC CAACGAGACC CGCGACCTCG CCGCCGAGCG GCCCGACATC
GTCGAGCGGC TCGACCGCGC CTGGATGAAG GAGGCGCAGG CGCATCAGGT CTTCCCGATC
GACGATCGCT CGCTGCTCGA ACGCTCCTTC GCCGAGCTGT TCCGGGGCGG CGGCAAGGAT
CGTTGGTCGA TCATCCCGCC GATCGACCTG ATCCCCGAGG AATCCTCGCC CAAGCTGCTC
GGCCGCGATT TCGAGGTCGA GTTGACGCTG GCCGATGCCG GGCGGCAGGG CGTGCTGTTC
GCCCATGGCA ACCAGTTCCT CGGCGCCGTC GCCTTCGTCC GCGACGGGCG GGTGTGGTTC
GAGTTCCGCT GCGATCCCCA TCTGATCGCG CTCGACGCGC CCTGGCCCGC GAAGGCGCGC
AGCGTGCGCT TCGTCCAGCG CCTGTCGGCG CGTCCGCGCG TCGGGCGGCT GTCGATCGTC
ATCGACGGGC GCGAGGTGGC GGCGCTCGAC AGCGACCGGC TGCTGCTCGG CACGCCGATG
CAGGGGCTTC AGATCGGGCG CAACGGCGCG GTGCGCGCCA GCCCGCGCTA TGCCGCGCCC
TTCGCCTTCG ACGGGCGGAT CGAGCGCGTC GAGATCCGCA CCGACAACAA GCCCTATGAC
GCCGGCGAGA TCGCCGCCGC GAGCCGCGCC TATGCCGCGC CAGCCAGGAA GGACTAG
 
Protein sequence
MERDPNKLSR REAIAAGITG SAAVAAGLTA APAQARDAAG FKGRIAPVMS ASTPDIRQEV 
VPAAGSPNVL LIILDDVGFA DLGCYGGEIP TPNIDRLAAS GIRYTNFRTT GVCSATRASV
MTGLNPHSAG IGWLTFSDAG YPGYRGDLAE DAETMAERFS DAGYCVYHVG KWHVNLADTT
NAAGPTRNWP SQRGYARSYW FQGHSTDYFA PAQLYAGNER ITPPVDGYYA TDDFTDKALA
FLRDHRAQRG DRPFLMTLAH PGAHSPLQAR REDIARFKGA YDAGWDVLRA ARLERQKAMG
LIPADAVLPP ANPGVPRWDT LDPAARRVQA RYMEVYAAMI ARIDDGVGRI LDALDASGDH
DNTIVALISD NGGAPDGRGG TPNLLAMVNG GVTPAQVAER FDEIGGPDSY PMYSLGWASV
SNTPFRLYKH DTHLGGVADP LILSWPKAIP ARGELRGQFL HAIDLLPTLI DAAGLSATSG
RAGAKPIEGR SARPSFTDRA APDPRDRQYF EMGGLRAMRL GRWRIVSKGR FGLPGDAWEL
YDTATDPNET RDLAAERPDI VERLDRAWMK EAQAHQVFPI DDRSLLERSF AELFRGGGKD
RWSIIPPIDL IPEESSPKLL GRDFEVELTL ADAGRQGVLF AHGNQFLGAV AFVRDGRVWF
EFRCDPHLIA LDAPWPAKAR SVRFVQRLSA RPRVGRLSIV IDGREVAALD SDRLLLGTPM
QGLQIGRNGA VRASPRYAAP FAFDGRIERV EIRTDNKPYD AGEIAAASRA YAAPARKD