Gene Bmul_3570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBmul_3570 
Symbol 
ID5770205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia multivorans ATCC 17616 
KingdomBacteria 
Replicon accessionNC_010086 
Strand
Start bp508506 
End bp510041 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content68% 
IMG OID641317873 
Productcholine-sulfatase 
Protein accessionYP_001583546 
Protein GI161520119 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC CGACTCCCAA CATCCTGATC CTGATGGCCG ACCAGCTGAC GCCGTTCGCG 
CTGCCGGCAT ACGGTAACCG CGTGGCGCGC ACGCCGACGC TCGACCGGCT CGCCGCGCAA
GGCGTGGTGT TCGACGCCGC TTACTGCGCG AGCCCGCTCT GCGCGCCGTC GCGCTTCTCG
CTGCTGACCG GCAAGCTGCC GTCCGGGATC GGCGCCTACG ATAACGCCGC CGAATTGCCG
GCGCAAACGC TGACGTTCGC GCACTATCTG CGCGCGGCCG GCTATCGGAC GATGCTGTCC
GGCAAGATGC ACTTCTGCGG GCCCGATCAG CTGCACGGCT TCGAGGAGCG GCTGACGACC
GACATCTATC CGGCCGATTT CGGCTGGGTG CCCGACTGGG ATCGCCCGAC CGAGCGGCCG
AGCTGGTATC ACAACATGAG CTCGGTGCTC GACGCGGGCC CGTGCGTGCG CACGAACCAG
CTCGACTTCG ACGACGAAGT GACGTTCGCC GCAAAGCAGA AGCTCTACGA CGTCGCGCGC
GAGCGCGCCG CGGGCCGCGA CGAACGGCCG TTCTGCATGG TCGTGTCGCT GACGCACCCG
CACGACCCGT ATGCGATCAC GCGCGAATAC TGGGATCTGT ACAGCGACGA CGAGATCGAC
ATGCCGGCCG TGCAGCTCGG CGCATCGGAC AGCGATCCGC ATTCGCAGCG GCTGCGCTTC
GTCTGCGAGA ACGACCGCAC GCCGCCGAGC GACGCGCAGA TCCGCGCGGC GCGGCGTGCC
TACTACGGCG CGACGTCGTA CGTCGACGCG CAGTTCGGCA GCGTGCTCGC GGCGCTCGAA
CAGTGCGGAT TCGCCGACGA TACGATCGTG ATCGTCACGT CCGATCACGG CGACATGCTC
GGCGAGCGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG
ATCGTCCATG CACCGCAGCG CTTCGACGCC GCGCGCGTGC GCGGTCCCGT GTCGCATCTC
GATCTACTGC CGACGCTCGT CGAACTCGCG AGCGCGACGC CGGAAGGCGG CTGGCCCGAC
GCGGTGGACG GCGCCAGTCT CGTCCCGCAT CTGCGCGGCA CGGCCGCACA CGATGTCGCG
CTCGGCGAAT ATCTGGCCGA AGGCGCGATC GCGCCGATCG TGATGATCCG GCGTGGCGAC
TGGAAGTACG TGCATTGCCC GGCCGATCCG GAGCAGCTGT ACAACCTGTC CGACGATCCG
CGCGAACTGA CGAATCTCGC GGGCGCGCCG GAGGCGGCCG ACGTGCTGGC CGCCTTCCGC
GCGGAAGCGG CACGGCGCTG GAACCTGCCC GAACTCGACC GGCAGGTGCG CGCGAGCCAG
CGTCGGCGCC GCTTTCATTA CGCGGCGACG ACGCAAGGGC GGATCCAGCC GTGGGACTGG
CAGCCTTTCA CCGATGCGAG CCAGCGCTAT ATGCGCAATC ACATCGAACT CGACACGCTC
GAAGCGATGG CGCGCTTTCC GCGCGTCGGG CACTGA
 
Protein sequence
MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAQ GVVFDAAYCA SPLCAPSRFS 
LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAAGYRTMLS GKMHFCGPDQ LHGFEERLTT
DIYPADFGWV PDWDRPTERP SWYHNMSSVL DAGPCVRTNQ LDFDDEVTFA AKQKLYDVAR
ERAAGRDERP FCMVVSLTHP HDPYAITREY WDLYSDDEID MPAVQLGASD SDPHSQRLRF
VCENDRTPPS DAQIRAARRA YYGATSYVDA QFGSVLAALE QCGFADDTIV IVTSDHGDML
GERGLWYKMT FFEGGCRVPL IVHAPQRFDA ARVRGPVSHL DLLPTLVELA SATPEGGWPD
AVDGASLVPH LRGTAAHDVA LGEYLAEGAI APIVMIRRGD WKYVHCPADP EQLYNLSDDP
RELTNLAGAP EAADVLAAFR AEAARRWNLP ELDRQVRASQ RRRRFHYAAT TQGRIQPWDW
QPFTDASQRY MRNHIELDTL EAMARFPRVG H