Gene Swit_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3043 
Symbol 
ID5198671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp3337971 
End bp3339653 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content69% 
IMG OID640582592 
Productsulfatase 
Protein accessionYP_001263531 
Protein GI148555949 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0150299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0001034 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGAAGC GCCGGCTCCT CGGCGCGGCG CTCGCGCTGC TGGCGGGGAC CGGCCTGACC 
GCCGTCGCCG GCGCGCGATC GCCGGCGGCG CCGTCGCCCC AGCGCCCCAA CATCCTGCTG
ATCGTCGCCG ACGATCTCGG CTATTCGGAC ATCGGCGCGT TCGGGGGCGA GATCGCGACA
CCCAACCTCG ACCGGCTGGC GCAGGCCGGC GTCCGCTTCG CCGACTTCCA CGCGGCGCCG
GCCTGTTCGC CGACGCGGGC GATGCTGCTG ACCGGCCGCG ACCATCATGC GGCCGGCCTC
GGCACGATGG CCGAATTCAC CGACCCGGCG CAACGCGGGC GCCCCGGCTA TGAAGGCTAT
CTGAAGCTGA AGATGCCGAC CATCGCCTCG GCCCTGTCCG CCAGGGGCTA TTATACCGCG
ATGGCCGGGA AATGGCATCT CGGCTATGCC GAGAAGCAAT CGCCCAAGGC GCACGGCTTC
GCGCGCTCCT TCGCCCTGCT CGACGGCGCC GGCAATCATT ATGGGATCGA CCAGACGGCC
CAATGGCGAT CGGTCGGGAT CGGCGTCGGG ACGCAATATC GCGAGGATGG CAGGCTCACC
ACCTTTCCCG AAGGGGCTTA TTCGAGCGAC CTGTTCACCG AAAAGCTGAT CGGCTACCTG
ACCAGTCCGG CCCGGGCGCA TCGGCCCTTC TTCGCCTATC TGGCCTTCAC CGCACCACAT
TGGCCACTCC AGGCCCCGGC GGACGTCGTC GCCAAATATT CCGGCAAATA TGATGACGGA
CCGATGGCGT TGCGCGAACG GCGACTGAAG CGGATGAAGG AGCTTGGCAT CGTCCCGCCG
GACGTGCGGC CCTTCCAGCC CCTGGCCGTC GAGGACTGGA CAACGCTGTC CGCCGAACGG
CGACGCGTCG AAGCACGCAA GATGGAAATC TACGCCGCGA TGGTCGACCG GCTCGACCAG
AATGTCGGGC GGCTGCTGGC CAGCCTGTCG CGATCGGGCG ACCTCGGGAA CACGATCGTC
GTCTTCCTGT CGGACAACGG TCCCGACGGC GGCGGCGGAG CCCCCGCCCT GCACGACCCG
CGGACGCAGG CGTCGCTGGG GATCGACAAC AGCCTGGAGA ACATGGGCCG CGCCCATTCC
TTCCTGACCT ATGGCGCCGG CTGGGCGCAG GCGGGGTCCG CCCCCTTCAA CCGCTTCAAG
GGCTATACGA CCGAGGGTGG CACCCGTGTG CCCGCCTTCA TCTCGGGAGC GGGGGTGACC
TGGCATGGCA TCAGCCACGC GCTGACCCAC GTCACCGACA TGATGCCGAC CGCGCTCGCC
CTGGCGGCGG GCCCGGCCAG CAGCGCAAGG AAACCGGCGA CCGAAGGCCG TTCGCTGGTG
CCGCTACTGC GCGACGCGCG AATCGCGCAG GTTCGCCAGC CGGACGAGGC GATCGGCGAG
GAACTGTTCT TCGGGCGTTC GCTGAGGGCG GGCCAGTGGA AGGCGGTCTA CCCCGCCCCG
ACTCGTCCGC CGACCATGCT CAGCGACACC GACGGACGCT GGCACTTATA CGACCTGTCG
GTGGATCCCG GCGAAACCCG CGACCTCGCC GCCGAGCACA CCGACATATT GGCCGGGCTG
GTTCGGCACT GGCACGACTA TGCGCGCCGC AACGATGTCG TCCTGCATCC CGCCGCCGAC
TGA
 
Protein sequence
MVKRRLLGAA LALLAGTGLT AVAGARSPAA PSPQRPNILL IVADDLGYSD IGAFGGEIAT 
PNLDRLAQAG VRFADFHAAP ACSPTRAMLL TGRDHHAAGL GTMAEFTDPA QRGRPGYEGY
LKLKMPTIAS ALSARGYYTA MAGKWHLGYA EKQSPKAHGF ARSFALLDGA GNHYGIDQTA
QWRSVGIGVG TQYREDGRLT TFPEGAYSSD LFTEKLIGYL TSPARAHRPF FAYLAFTAPH
WPLQAPADVV AKYSGKYDDG PMALRERRLK RMKELGIVPP DVRPFQPLAV EDWTTLSAER
RRVEARKMEI YAAMVDRLDQ NVGRLLASLS RSGDLGNTIV VFLSDNGPDG GGGAPALHDP
RTQASLGIDN SLENMGRAHS FLTYGAGWAQ AGSAPFNRFK GYTTEGGTRV PAFISGAGVT
WHGISHALTH VTDMMPTALA LAAGPASSAR KPATEGRSLV PLLRDARIAQ VRQPDEAIGE
ELFFGRSLRA GQWKAVYPAP TRPPTMLSDT DGRWHLYDLS VDPGETRDLA AEHTDILAGL
VRHWHDYARR NDVVLHPAAD