Gene Bphy_3290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_3290 
Symbol 
ID6244721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp217678 
End bp219219 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content62% 
IMG OID642595080 
Productcholine-sulfatase 
Protein accessionYP_001859492 
Protein GI186472150 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTA ATACAAAGCA GAATATCCTT ATCTTGATGG CAGACCAGAT GACACCGTTC 
GCGTTGCGCG CGTATGGCAA TCAGGTCTCG CTGACACCGC GCATCGATGC GCTCGCGAAA
GAAGGCGTCG TGTTCGATTC GGCTTATTGC GCGAGCCCGT TGTGCGCGCC GGCCCGCTTT
TCGATGATGG CGGGCAAACG GCCCGCTGCA ATTGGTGCTT ACGATAACGC CGCCGAATTG
CCGGCGCAGA CGCTGACCTT CGCGCACTAT CTGCGTGCGG CGGGCTATCG AACGATCCTC
TCCGGCAAAA TGCATTTCTG CGGCCCTGAC CAGTTGCATG GCTTCGAAGA ACGCTTGACG
ACCGACATCT ATCCCGCCGA TTTCGGCTGG GTGCCGGACT GGGATCGCCC GGACGTGCGG
CCGAGCTGGT ATCACAACAT GAGTTCGGTG CTGGATGCGG GACCGTGCGT GCGCACGAAC
CAGCTGGATT TCGACGACGA AGTCACTTAC ACGACGCGCC AGAAGCTATA CGACATCGTG
CGTGAGCGCG CGGCGGGCGG CGACGCGCGG CCGTTTTGCG TGGTCGCGTC GCTGACGCAC
CCGCATGATC CTTACGCGAT ACCGCAGCAG TACTGGGACA TGTATCGCGA TGAAGAGATC
GACATGCCGT GCGTGACGCT CACACGCGAT GAAAGCGATC CCCACTCGAA GCGCCTGCGC
GACGTCTACG AAGCAGACCT CACGCCGCCC ACGGCGCAGC AGATCCGCGA TGCGCGGCAC
GCGTATTACG GCGCGCTATC GTATGTCGAT GCGCAATTCG GCGCGATTCT CGACACGCTC
AAAGCAACGG GACTCGCCGA CGACACGATC GTCATCGTCA CGTCGGATCA CGGCGAAATG
CTCGGCGAGC GCGGACTCTG GTACAAGATG ACCTGCTTCG AAGGCGGCGT GCGCGTGCCG
CTGATCGTGC ACGCGCCGAA GCAGTTTCGC GCGCACCGAG TGGCGGCGTC GGTGTCGCAT
GTCGACCTTT TGCCCACGCT GCTCGAAATG GCAACCGGCG CACGCCGTGC GGAGTGGCCG
GATACCATCG ACGGACGCAG CCTCGTGCCG CATCTGCGCA ACGACGGCGG GCACGACGAA
GCGATCGTCG AATACTTCGC CGAAGGTGCT ATTGCGCCGA TGGTGATGAT CCGGCGCGGT
CAGTACAAGT TCATTCACAC GCCCGTCGAT CCCGACCAGC TTTACGATCT CGCCAGCGAT
CCACGAGAAC GTGCCAATCT GGCGCAGGAT CCGGCAGCGG CCACGCTGGT CGAAGCCTTT
CGCAAAGAAG TCACGCAGCG CTGGGACATT CCCGCACTGC ATCAAGCGGT ACTCGCAAGC
CAGCGCCGTC GCCGCTTCCA CTTCGAAGCG ACGACACAAG GCGCGATCCG CTCATGGGAC
TGGCAGCCGT TCAACGATGC GAGCCAGCGC TATATGCGCA ATCACATCGA ACTCGACACG
CTGGAAGCGA TGGCGCGTTA TCCGCGCGTC GTCTCTCGCT GA
 
Protein sequence
MSLNTKQNIL ILMADQMTPF ALRAYGNQVS LTPRIDALAK EGVVFDSAYC ASPLCAPARF 
SMMAGKRPAA IGAYDNAAEL PAQTLTFAHY LRAAGYRTIL SGKMHFCGPD QLHGFEERLT
TDIYPADFGW VPDWDRPDVR PSWYHNMSSV LDAGPCVRTN QLDFDDEVTY TTRQKLYDIV
RERAAGGDAR PFCVVASLTH PHDPYAIPQQ YWDMYRDEEI DMPCVTLTRD ESDPHSKRLR
DVYEADLTPP TAQQIRDARH AYYGALSYVD AQFGAILDTL KATGLADDTI VIVTSDHGEM
LGERGLWYKM TCFEGGVRVP LIVHAPKQFR AHRVAASVSH VDLLPTLLEM ATGARRAEWP
DTIDGRSLVP HLRNDGGHDE AIVEYFAEGA IAPMVMIRRG QYKFIHTPVD PDQLYDLASD
PRERANLAQD PAAATLVEAF RKEVTQRWDI PALHQAVLAS QRRRRFHFEA TTQGAIRSWD
WQPFNDASQR YMRNHIELDT LEAMARYPRV VSR