Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bmul_3570 |
Symbol | |
ID | 5770205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia multivorans ATCC 17616 |
Kingdom | Bacteria |
Replicon accession | NC_010086 |
Strand | + |
Start bp | 508506 |
End bp | 510041 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641317873 |
Product | choline-sulfatase |
Protein accession | YP_001583546 |
Protein GI | 161520119 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACC CGACTCCCAA CATCCTGATC CTGATGGCCG ACCAGCTGAC GCCGTTCGCG CTGCCGGCAT ACGGTAACCG CGTGGCGCGC ACGCCGACGC TCGACCGGCT CGCCGCGCAA GGCGTGGTGT TCGACGCCGC TTACTGCGCG AGCCCGCTCT GCGCGCCGTC GCGCTTCTCG CTGCTGACCG GCAAGCTGCC GTCCGGGATC GGCGCCTACG ATAACGCCGC CGAATTGCCG GCGCAAACGC TGACGTTCGC GCACTATCTG CGCGCGGCCG GCTATCGGAC GATGCTGTCC GGCAAGATGC ACTTCTGCGG GCCCGATCAG CTGCACGGCT TCGAGGAGCG GCTGACGACC GACATCTATC CGGCCGATTT CGGCTGGGTG CCCGACTGGG ATCGCCCGAC CGAGCGGCCG AGCTGGTATC ACAACATGAG CTCGGTGCTC GACGCGGGCC CGTGCGTGCG CACGAACCAG CTCGACTTCG ACGACGAAGT GACGTTCGCC GCAAAGCAGA AGCTCTACGA CGTCGCGCGC GAGCGCGCCG CGGGCCGCGA CGAACGGCCG TTCTGCATGG TCGTGTCGCT GACGCACCCG CACGACCCGT ATGCGATCAC GCGCGAATAC TGGGATCTGT ACAGCGACGA CGAGATCGAC ATGCCGGCCG TGCAGCTCGG CGCATCGGAC AGCGATCCGC ATTCGCAGCG GCTGCGCTTC GTCTGCGAGA ACGACCGCAC GCCGCCGAGC GACGCGCAGA TCCGCGCGGC GCGGCGTGCC TACTACGGCG CGACGTCGTA CGTCGACGCG CAGTTCGGCA GCGTGCTCGC GGCGCTCGAA CAGTGCGGAT TCGCCGACGA TACGATCGTG ATCGTCACGT CCGATCACGG CGACATGCTC GGCGAGCGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG ATCGTCCATG CACCGCAGCG CTTCGACGCC GCGCGCGTGC GCGGTCCCGT GTCGCATCTC GATCTACTGC CGACGCTCGT CGAACTCGCG AGCGCGACGC CGGAAGGCGG CTGGCCCGAC GCGGTGGACG GCGCCAGTCT CGTCCCGCAT CTGCGCGGCA CGGCCGCACA CGATGTCGCG CTCGGCGAAT ATCTGGCCGA AGGCGCGATC GCGCCGATCG TGATGATCCG GCGTGGCGAC TGGAAGTACG TGCATTGCCC GGCCGATCCG GAGCAGCTGT ACAACCTGTC CGACGATCCG CGCGAACTGA CGAATCTCGC GGGCGCGCCG GAGGCGGCCG ACGTGCTGGC CGCCTTCCGC GCGGAAGCGG CACGGCGCTG GAACCTGCCC GAACTCGACC GGCAGGTGCG CGCGAGCCAG CGTCGGCGCC GCTTTCATTA CGCGGCGACG ACGCAAGGGC GGATCCAGCC GTGGGACTGG CAGCCTTTCA CCGATGCGAG CCAGCGCTAT ATGCGCAATC ACATCGAACT CGACACGCTC GAAGCGATGG CGCGCTTTCC GCGCGTCGGG CACTGA
|
Protein sequence | MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAQ GVVFDAAYCA SPLCAPSRFS LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAAGYRTMLS GKMHFCGPDQ LHGFEERLTT DIYPADFGWV PDWDRPTERP SWYHNMSSVL DAGPCVRTNQ LDFDDEVTFA AKQKLYDVAR ERAAGRDERP FCMVVSLTHP HDPYAITREY WDLYSDDEID MPAVQLGASD SDPHSQRLRF VCENDRTPPS DAQIRAARRA YYGATSYVDA QFGSVLAALE QCGFADDTIV IVTSDHGDML GERGLWYKMT FFEGGCRVPL IVHAPQRFDA ARVRGPVSHL DLLPTLVELA SATPEGGWPD AVDGASLVPH LRGTAAHDVA LGEYLAEGAI APIVMIRRGD WKYVHCPADP EQLYNLSDDP RELTNLAGAP EAADVLAAFR AEAARRWNLP ELDRQVRASQ RRRRFHYAAT TQGRIQPWDW QPFTDASQRY MRNHIELDTL EAMARFPRVG H
|
| |