Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_2516 |
Symbol | |
ID | 5198192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 2792968 |
End bp | 2795328 |
Gene Length | 2361 bp |
Protein Length | 786 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640582064 |
Product | sulfatase |
Protein accession | YP_001263013 |
Protein GI | 148555431 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.424858 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGAC CGGGGCGGTC GATGCGGGCA GTGATGCGCA TGCTGGCGGG CGGATCGCTG CTCGCGGCGA ACCTGGCCTT CGCCGCCGCG CCGCCGCCCG TCGCGGGCGA TGCCGCGCGC CCCGTCGCCG ATCCGCCGGG CGCGCAGCAT TTCGCCGGGA TCACGCCCGC CACCTCCTCG GCGCCGTCCT GGCCGGAACA GCCCCGCGCT CCGAAGGGCG CGCCCAACGT CCTGCTGATC CTGACCGACG ACATCGGCTT CGGCGCGACC AGCACCTTCG GCGGCCCGGT CCCCACCCCG ACCTTCGACG CGCTGGCGAA GGAGGGGCTG CGCTACACCC AGTTCAACAC CACCGCGATC TGCTCGCCGA CCCGCGCGGC GCTGCTCACC GGGCGCAACC CGCACGCGGT GGGGATGGGC CATGTCGAGA ACACCTCGGC CGGCTACGAA GGCTATGATT CGGTCATCCC CAAATCCTCG GCCTTCATCT CGCGGATCCT GCGCCAGGGC GGCTACGCCA CCGCCGCCTT CGGCAAATGG CACCTGACCC CGCAATGGGA GCAGAGCCAG GCCGGCCCCT ATGATCGCTG GCCGCTGGGC CAGGGCTTCG AGAATTTCTA CGGCTTCCTC GCGTCCGACA ACAGCCAGTG GAACCCCACG CTGACGCAGG ACAACAGCTT CATCGACGTG GCGACGCCGC CCGGCTATCA TTTCGACGCC GACATGGCCG ACCATGCGAT CCGCTGGATC GCCGAGCGCA AGGCCAACGC GCCCGACAAG CCCTTCTTCG CCTATTATGC GCCGGGCACC GCGCACACGC CCAGCCACGC GCCCAGGGAA TGGCTCGACC GCTTCAAGGG CCGTTTCGAC AAGGGCTGGG ACGCGCTGCG CGAGGAGATC TTCGCGCGGC AGAAGGCGCT GGGCGTGATC CCCGCCGGCA CCGTGCTGAC CGCGCGCCCG AAGGAGCTGC CCGCCTGGAG CTCGCTCTCC GCCGACCAGC GCCGCCTCTA TGCGCGCTAC ATGGAGGCCT GGGCGGCGTC GATCGCCTAT ATGGATTCGC AGATCGGCCG GGTGATCCAG TCGCTCAAGG ACAGCGGCCA ATATGACAAC ACGCTGATCA TCTACATCCA GGGCGACAAC GGATCGAGCG CGGAGGGCGG CATGGACGGG CTGCTGTTCC AGCAGTCGCT GCTCAACAAC CATGCCGAGC AGCGCGCCTA CGCGCTCAGC CGGATCGACG ACATCGGCGG GCCCGACCTC TATCCGCTCT TTCCCGCCGG CTGGGGCTGG GCGACCGCCG CGCCGTTCAA ATATTACAAG CAGATCGCCT CCTATTTCGG CGGCACCCGC AACGGCATGG TGATGAGCTG GCCGAAGGGG ATCGCCGACA AGGGCGGCAT CCGCGGCCAA TATCATTTCG TCACCGACAT CATGCCGACC ATCCTGGAGG CGACCGGCGT CGCCCGGCCC GCGTCGGTCG ACGGGGTCGA GCAGCGTCCG CTCGACGGGG TGAGCATGGC GTACAGCTTC ACCGCCCCCG ACGCGCCGTC GCACCGGCGG ATGCAGGTGT ACGAGATGGT CGATTCGCGC GGCATCTATC TCGACGGCTG GTTCGCCTCG ATGCGGCCGA CCCGGCAGCC CTGGACCTTC GACAAGCCCG CCGATCCCGA CAAGACGCCG TGGGAGCTGT ACGACGTCCG CGCCGATTTC AGCCAGGCGC ACGACCTGGC GGCGCGCTAT CCCGATCGGC TGCGGACGAT GCAGCAGCTC TTCTGGGCGG AGGCCGGACG CAACCACATC CTGCCGATCC TCGAGCCGAC GATCCTGCCC ACCGGCCGCC CCTCGCTCGG CGCCGGCCGC ACCGGCTTCC GCTATCCGCA GGGGATCACC CGCATCCCCG AGGACAGCGC GCCGCACATC ATCGGCCGTT CCTACAGCGT GTCGACGACG ATCGACGTGG CGGGCGCGCC GAACGGCGTC CTCGTCGCGC AGGGCGGCCG CTTCGGCGGC TGGGCGCTCT ACTTCCGCGA CGGGCGGCTG ACCTACCACC AGAACGCGCT CGACCCGCGC CAGTATCGCG TCGTCAGCGA TCGCGCGATC GGGCCCGGGC GGCACAGCGT CGAAATGCTG TTCGAGGCCG ACAGCCATGC CCGCGGCGCG GGCGGCACGG TCAGCTTCAC GATCGACGGC CAGCCGGCGG GACAGGGCCA TGTGCCGCTG ACCCTCGGCG GGTGGATCTC GCACATCGAG GGGCTCGACG TCGGCGTCGA CACCGGCACG CCGGTCTCCG ACGAGTACCG ATCGCATGAC AGCGCGTTCA ACGGCCGGTT CGATGCACTG GAGATGAAAC TGATCGATTG A
|
Protein sequence | MARPGRSMRA VMRMLAGGSL LAANLAFAAA PPPVAGDAAR PVADPPGAQH FAGITPATSS APSWPEQPRA PKGAPNVLLI LTDDIGFGAT STFGGPVPTP TFDALAKEGL RYTQFNTTAI CSPTRAALLT GRNPHAVGMG HVENTSAGYE GYDSVIPKSS AFISRILRQG GYATAAFGKW HLTPQWEQSQ AGPYDRWPLG QGFENFYGFL ASDNSQWNPT LTQDNSFIDV ATPPGYHFDA DMADHAIRWI AERKANAPDK PFFAYYAPGT AHTPSHAPRE WLDRFKGRFD KGWDALREEI FARQKALGVI PAGTVLTARP KELPAWSSLS ADQRRLYARY MEAWAASIAY MDSQIGRVIQ SLKDSGQYDN TLIIYIQGDN GSSAEGGMDG LLFQQSLLNN HAEQRAYALS RIDDIGGPDL YPLFPAGWGW ATAAPFKYYK QIASYFGGTR NGMVMSWPKG IADKGGIRGQ YHFVTDIMPT ILEATGVARP ASVDGVEQRP LDGVSMAYSF TAPDAPSHRR MQVYEMVDSR GIYLDGWFAS MRPTRQPWTF DKPADPDKTP WELYDVRADF SQAHDLAARY PDRLRTMQQL FWAEAGRNHI LPILEPTILP TGRPSLGAGR TGFRYPQGIT RIPEDSAPHI IGRSYSVSTT IDVAGAPNGV LVAQGGRFGG WALYFRDGRL TYHQNALDPR QYRVVSDRAI GPGRHSVEML FEADSHARGA GGTVSFTIDG QPAGQGHVPL TLGGWISHIE GLDVGVDTGT PVSDEYRSHD SAFNGRFDAL EMKLID
|
| |