Gene Bphy_5555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_5555 
Symbol 
ID6247220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010625 
Strand
Start bp17165 
End bp18850 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content58% 
IMG OID642597274 
Productsulfatase 
Protein accessionYP_001861677 
Protein GI186470359 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.708907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTTC CAACCTTTCT TGCCGGACTG GCAAGACCCG TAACACGCGC TCCTTTTGCA 
CTCGTTGCAG CCAGTCTGTT GAGTGTCTCG GTCAGCGGCC AGGCGCAGAC CGACTCGCAG
CAGACGCAAA AGCCCAATAT CCTGCTGATC GTGGGCGACG ACGTGGGCTG GGGCGATCTG
GGTGCTTACG GCGGAGGTGA GGGGCGAGGC ATTCCTGCTC CGAACCTGGA CAGGCTGGCC
GATGAAGGCA TGACTTTCTT CGACTTTTAC GGTCAGCCCA GTTGCACTCC TGGCCGCGCC
GCGCTTCAGA CCGGGCGAAA TCCTAATCGA AGCGGGATGA CAACCGTGGC GTTTCAGGGG
CAAGGAGGCG GGCTCCCGCA CGCTGAATGG ACACTGGCGT CAGTATTGAA GCTCGCACAC
TACAACACGT ATTTCACCGG CAAGTGGCAT CTCGGCGAGG CCGACTATGC GTTGCCGAAT
ACGCAGGGCT ACGACGACAT GAAGTATGTC GGCCTGTATC ACCTGAATGC GTACACGTAT
GCCGACCCGA AGTGGTTTCC GGACATGGAC CAGCAAACCC GGGACATGTT CGTCAAGGTG
ACGACAGGAA TGCTTTCAGG CAAGGCCGGC CAGAAGGCGC ATGAGGACTT CAAGGTGAAT
GGCCAATACC AGAACGAGCC CGAAAAAGGC GTTGTCGGCA TACCGTTCGT GGATGCATAC
ATCGAAAAGG CAGCACTCGA AGATATCGAC GACGCCGCGC AGCGTGGACA ACCGTTCTTC
ATCAATGTGA ACTTCATGAA AGTTCACCAG CCGAATCTTC CGCACCCGGA TTACATCGGC
AAATCGCTGT CGAAGTCCAA ATATGCGGAT TCGATCGTCG AGCTCGACGC ACGTGTCGGT
CACATCATGG ACAAGCTGCG TGAGAAAGGA CTCGACAAGA ACACCCTCGT CTTCTTCACG
ACCGACAATG GCGCATGGCA GGACGTGTAC CCTGACGCGG GATACACGCC GTTCCGAGGC
GCGAAGGGGA CCGACCGGGA AGGTGGCGCG CGGGTGCCGG CGATCGCCTG GTGGCCCGGG
AAGATCAAGC CGCATTCGAG GAACTTCGAC ATCGTCGGCG GACTCGATTG CATGGCGACA
TTTGCCGCAC TCGCAGGTGT CGATCTGCCG AAGAACGATC GCGAAGGCAA GCCGATTATT
TTCGACAGTT TCGACATGTC ACCGGTCCTG TTCGGCACCG GCAAGAGCAA GCGCAACTCG
TGGTTCTATT TCACCGAGAA CGAAATGACG CCAGGTGCTG TACGCGTCGG CCAATTCAAG
GCGGTGTTCA ATCTGCGTGG AGACGCGGGG GCGGATACCG GCGGGCTCGC TGTCGACTCA
AATCTCGGCT GGAAAGGCCC CGACAAGTAC GTTGCGACAG TTCCCCAGGT GTTCGATCTG
TACCAGGACC CGCAGGAGCG CTACGACATC TTCATGAACA ACTATACAGA GCACACGTGG
ACGCTTCCGA CGTTCGGCGC CGCAGTGAAA GAGCTGATGC AGTCGTACGT GAAATACCCT
CCGCGCAAGG CGCAAAGCGA AGCGTACTCA GGTCCGATTA CCCTCAGTCA GTACGAACGC
TTTAAATATA TCCGCGATGA ACTTCAGAAG AACGGCTTCA GCATTCCGAT GCCAAGCGGA
AACTGA
 
Protein sequence
MQLPTFLAGL ARPVTRAPFA LVAASLLSVS VSGQAQTDSQ QTQKPNILLI VGDDVGWGDL 
GAYGGGEGRG IPAPNLDRLA DEGMTFFDFY GQPSCTPGRA ALQTGRNPNR SGMTTVAFQG
QGGGLPHAEW TLASVLKLAH YNTYFTGKWH LGEADYALPN TQGYDDMKYV GLYHLNAYTY
ADPKWFPDMD QQTRDMFVKV TTGMLSGKAG QKAHEDFKVN GQYQNEPEKG VVGIPFVDAY
IEKAALEDID DAAQRGQPFF INVNFMKVHQ PNLPHPDYIG KSLSKSKYAD SIVELDARVG
HIMDKLREKG LDKNTLVFFT TDNGAWQDVY PDAGYTPFRG AKGTDREGGA RVPAIAWWPG
KIKPHSRNFD IVGGLDCMAT FAALAGVDLP KNDREGKPII FDSFDMSPVL FGTGKSKRNS
WFYFTENEMT PGAVRVGQFK AVFNLRGDAG ADTGGLAVDS NLGWKGPDKY VATVPQVFDL
YQDPQERYDI FMNNYTEHTW TLPTFGAAVK ELMQSYVKYP PRKAQSEAYS GPITLSQYER
FKYIRDELQK NGFSIPMPSG N