Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3043 |
Symbol | |
ID | 5198671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 3337971 |
End bp | 3339653 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640582592 |
Product | sulfatase |
Protein accession | YP_001263531 |
Protein GI | 148555949 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0150299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0001034 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGAAGC GCCGGCTCCT CGGCGCGGCG CTCGCGCTGC TGGCGGGGAC CGGCCTGACC GCCGTCGCCG GCGCGCGATC GCCGGCGGCG CCGTCGCCCC AGCGCCCCAA CATCCTGCTG ATCGTCGCCG ACGATCTCGG CTATTCGGAC ATCGGCGCGT TCGGGGGCGA GATCGCGACA CCCAACCTCG ACCGGCTGGC GCAGGCCGGC GTCCGCTTCG CCGACTTCCA CGCGGCGCCG GCCTGTTCGC CGACGCGGGC GATGCTGCTG ACCGGCCGCG ACCATCATGC GGCCGGCCTC GGCACGATGG CCGAATTCAC CGACCCGGCG CAACGCGGGC GCCCCGGCTA TGAAGGCTAT CTGAAGCTGA AGATGCCGAC CATCGCCTCG GCCCTGTCCG CCAGGGGCTA TTATACCGCG ATGGCCGGGA AATGGCATCT CGGCTATGCC GAGAAGCAAT CGCCCAAGGC GCACGGCTTC GCGCGCTCCT TCGCCCTGCT CGACGGCGCC GGCAATCATT ATGGGATCGA CCAGACGGCC CAATGGCGAT CGGTCGGGAT CGGCGTCGGG ACGCAATATC GCGAGGATGG CAGGCTCACC ACCTTTCCCG AAGGGGCTTA TTCGAGCGAC CTGTTCACCG AAAAGCTGAT CGGCTACCTG ACCAGTCCGG CCCGGGCGCA TCGGCCCTTC TTCGCCTATC TGGCCTTCAC CGCACCACAT TGGCCACTCC AGGCCCCGGC GGACGTCGTC GCCAAATATT CCGGCAAATA TGATGACGGA CCGATGGCGT TGCGCGAACG GCGACTGAAG CGGATGAAGG AGCTTGGCAT CGTCCCGCCG GACGTGCGGC CCTTCCAGCC CCTGGCCGTC GAGGACTGGA CAACGCTGTC CGCCGAACGG CGACGCGTCG AAGCACGCAA GATGGAAATC TACGCCGCGA TGGTCGACCG GCTCGACCAG AATGTCGGGC GGCTGCTGGC CAGCCTGTCG CGATCGGGCG ACCTCGGGAA CACGATCGTC GTCTTCCTGT CGGACAACGG TCCCGACGGC GGCGGCGGAG CCCCCGCCCT GCACGACCCG CGGACGCAGG CGTCGCTGGG GATCGACAAC AGCCTGGAGA ACATGGGCCG CGCCCATTCC TTCCTGACCT ATGGCGCCGG CTGGGCGCAG GCGGGGTCCG CCCCCTTCAA CCGCTTCAAG GGCTATACGA CCGAGGGTGG CACCCGTGTG CCCGCCTTCA TCTCGGGAGC GGGGGTGACC TGGCATGGCA TCAGCCACGC GCTGACCCAC GTCACCGACA TGATGCCGAC CGCGCTCGCC CTGGCGGCGG GCCCGGCCAG CAGCGCAAGG AAACCGGCGA CCGAAGGCCG TTCGCTGGTG CCGCTACTGC GCGACGCGCG AATCGCGCAG GTTCGCCAGC CGGACGAGGC GATCGGCGAG GAACTGTTCT TCGGGCGTTC GCTGAGGGCG GGCCAGTGGA AGGCGGTCTA CCCCGCCCCG ACTCGTCCGC CGACCATGCT CAGCGACACC GACGGACGCT GGCACTTATA CGACCTGTCG GTGGATCCCG GCGAAACCCG CGACCTCGCC GCCGAGCACA CCGACATATT GGCCGGGCTG GTTCGGCACT GGCACGACTA TGCGCGCCGC AACGATGTCG TCCTGCATCC CGCCGCCGAC TGA
|
Protein sequence | MVKRRLLGAA LALLAGTGLT AVAGARSPAA PSPQRPNILL IVADDLGYSD IGAFGGEIAT PNLDRLAQAG VRFADFHAAP ACSPTRAMLL TGRDHHAAGL GTMAEFTDPA QRGRPGYEGY LKLKMPTIAS ALSARGYYTA MAGKWHLGYA EKQSPKAHGF ARSFALLDGA GNHYGIDQTA QWRSVGIGVG TQYREDGRLT TFPEGAYSSD LFTEKLIGYL TSPARAHRPF FAYLAFTAPH WPLQAPADVV AKYSGKYDDG PMALRERRLK RMKELGIVPP DVRPFQPLAV EDWTTLSAER RRVEARKMEI YAAMVDRLDQ NVGRLLASLS RSGDLGNTIV VFLSDNGPDG GGGAPALHDP RTQASLGIDN SLENMGRAHS FLTYGAGWAQ AGSAPFNRFK GYTTEGGTRV PAFISGAGVT WHGISHALTH VTDMMPTALA LAAGPASSAR KPATEGRSLV PLLRDARIAQ VRQPDEAIGE ELFFGRSLRA GQWKAVYPAP TRPPTMLSDT DGRWHLYDLS VDPGETRDLA AEHTDILAGL VRHWHDYARR NDVVLHPAAD
|
| |