Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_2518 |
Symbol | |
ID | 5197038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 2797860 |
End bp | 2800157 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640582066 |
Product | sulfatase |
Protein accession | YP_001263015 |
Protein GI | 148555433 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.606572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCCTGC CCTTCCTCTC CCGCCTCGCC ATGGGCGCCG CCGCCGCCGC CCTGATCGCC GCCGTACCGG TGCCCGCCGT CGCATCCGCG GGGCAGGGCG CCTGGCCGGC GCGGGTCACG GCGCCGCGCG GCGCGCCCAA CGTCCTGCTG ATCATGACCG ACGATGTCGG CTTCGGCGCG TCGAGCGCCT ATGGCGGGCC GATCGCGACG CCGGTGTTCG ACGCGCTGGC GAGGGGCGGC GCGCGCTACA ACAATTTCAA CACGACCGCG CTGTGCTCGC CGACCCGCGC CTCGTTGCTC ACCGGCCGCG ATCCACACAA CGTCAACATG GGCAACGTCA CCAACATGCC CACCGGCTTC GAAGGCTATA CCACCGCGAT CCCGCGATCG GCGGCGACGG TCGCCCGCAT CCTGCGCGAC GGCGGCTACA GCACCGCCAT GTTCGGCAAG AGCCATCTCA CCCCCGACTG GGAGCTGAGC GCCGACGGCC CGTTCGACCG CTGGCCGACC GGACTCGGCT TCGACTATTT CTACGGCTTC CTGTCGGCCG ACACCAACCA GACCGCGCCC AACCTCTACG AGAACATCCG GCCGATCGCC CCGCCCGACG AGCCCGGCTA TCTGCTCGAC ACCGACCTCG CCGACCGCGC GATCGGCTGG ATCTCGGCGC AGCATGCGGC AGCGCCCGGC AAGCCCTTCT TCGTCTATTA TGCGCCGGGC ACCGCCCATT CGCCGCACAC CGCGCCGGCC GAATGGCTGC GGCGCTATCG CGGCGCCTTC GACCAGGGGT GGGACAAGGT GCGCGAGGAG AGCTTCGCGC GGCAGAAGGC GCTCGGCATC ATCCCCGCCG ACGCCGCGCT CGCCGCCCGG CCGCCCGGCA TCCCCGCCTG GGACAGCCTC ACCCCCGACC AGAAGCGGCT GTCGGCGCGG ATGATGGAGG CCTATGCCGC AAGCCTCGCC TATTGCGACG CGCAGATCGG CCGGGTGATC GAGCATCTCC GCCGCACCGG CCAGCTCGCC GACACGATGA TCGTCTTCCT CCAGGGCGAC AATGGCGGCA GCGCCGAGGG CGGGCCGATG GGAATGATCT TCGAGCAGAC CGCCGTGCTG GGCAGCGAGG AGGACCCCGC CGACCAGATG CGCCGCATCG ACGACATCGG CACCGGCCGC GCCTACACCA TGTATCCGGC CGGCTGGGGC TGGGCGATGA ACGCGCCCTT CCCCTGGTAC AAGCAGGTCG CCAGCCATGC CGGCGGCACG CGCAACGGCC TGGTCATCGC CTGGCCCGGC CATGTCGCCG CGCCCGAGAC GGTGCGCGGC CAATATGCCC ATGTCTCCGA CATAGTGCCG ACGATCCTGT CCGCGACCGG CGTGGCCGCG CCGAAGACGG TCGACGGCGT CGCGCAGCAG CCGTTCGACG GGATCAGCCT CGCCTATACG CTCGACCGGC CGACCGCGCC GGCCCGCCGC CGCACGCAGG TCTACGAGAT GATGGAGAAT TTCGGCATCT ATCGCGACGG CTGGTTCGCC GGATCGACGC CCAAGCGCTA CGCTTGGCAG TTCGCGACCA AAGAGAGCCT GGCCGATCCG CAGAACCGCG ACCGCCATTG GGAGCTATAC GACCTGCGCA CCGACTTCTC CCAGACGCGC GACCTCGCCG CCGCCCAGCC CGCGAAGCTG AAGGCGTTGC AGGCGCTGTT CTGGAAGCAG GCGGCGCGCA ACCACATCCT GCCGATCCAC GACTATAAGC TCGGCGCCGA GGGCCGGCCG ACGCTCAACG CCGGGCGCGA CGTCTTCACC TACTCCGCGC CGGTCAGCCG CGTCCCCTAT GCCGCCGCGC CGCCGACGAC CGGCCGGAGC TTCACGATCG AGGCCGACGT GACGCTCCCT GCCGATGGGG GGCGGGGCGT GCTGCTGGCG GCCGGGGGAC GCTTCGGCGG CCATGCCTTC TTCCTCGACG ACGGCCGCCC GGCCTTCCAC GTCAACGCGG TCGGCCCCTA TCAATATGAG CTTATCGCCC GCGACAGGCC GCGTCCCGGC CGCCACCGCC TCGCCGCGCG CTTCGTCGCC GACGACGCGA AGCCGGGCAG CGGCGGCGCG GTGCTGCTGC TGGTCGACGG GGTCGAGGTC GCGCGCGGCC GCGCCGACCG CGTCTTCCAC GGCTGGCTGC AGAACACCGA GGGGTTCGAC ATCGGCGAGG ACACGCTGAC CCCGGTCAGC CCGCGCTACG ACGTCGCCGG CAGCCGCTTC ACCGGAACGA TCGAACAGGT CGTGGTCCGG CTGGACTCCG CGCCCTGA
|
Protein sequence | MSLPFLSRLA MGAAAAALIA AVPVPAVASA GQGAWPARVT APRGAPNVLL IMTDDVGFGA SSAYGGPIAT PVFDALARGG ARYNNFNTTA LCSPTRASLL TGRDPHNVNM GNVTNMPTGF EGYTTAIPRS AATVARILRD GGYSTAMFGK SHLTPDWELS ADGPFDRWPT GLGFDYFYGF LSADTNQTAP NLYENIRPIA PPDEPGYLLD TDLADRAIGW ISAQHAAAPG KPFFVYYAPG TAHSPHTAPA EWLRRYRGAF DQGWDKVREE SFARQKALGI IPADAALAAR PPGIPAWDSL TPDQKRLSAR MMEAYAASLA YCDAQIGRVI EHLRRTGQLA DTMIVFLQGD NGGSAEGGPM GMIFEQTAVL GSEEDPADQM RRIDDIGTGR AYTMYPAGWG WAMNAPFPWY KQVASHAGGT RNGLVIAWPG HVAAPETVRG QYAHVSDIVP TILSATGVAA PKTVDGVAQQ PFDGISLAYT LDRPTAPARR RTQVYEMMEN FGIYRDGWFA GSTPKRYAWQ FATKESLADP QNRDRHWELY DLRTDFSQTR DLAAAQPAKL KALQALFWKQ AARNHILPIH DYKLGAEGRP TLNAGRDVFT YSAPVSRVPY AAAPPTTGRS FTIEADVTLP ADGGRGVLLA AGGRFGGHAF FLDDGRPAFH VNAVGPYQYE LIARDRPRPG RHRLAARFVA DDAKPGSGGA VLLLVDGVEV ARGRADRVFH GWLQNTEGFD IGEDTLTPVS PRYDVAGSRF TGTIEQVVVR LDSAP
|
| |