Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphy_5555 |
Symbol | |
ID | 6247220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phymatum STM815 |
Kingdom | Bacteria |
Replicon accession | NC_010625 |
Strand | + |
Start bp | 17165 |
End bp | 18850 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642597274 |
Product | sulfatase |
Protein accession | YP_001861677 |
Protein GI | 186470359 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.708907 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCTTC CAACCTTTCT TGCCGGACTG GCAAGACCCG TAACACGCGC TCCTTTTGCA CTCGTTGCAG CCAGTCTGTT GAGTGTCTCG GTCAGCGGCC AGGCGCAGAC CGACTCGCAG CAGACGCAAA AGCCCAATAT CCTGCTGATC GTGGGCGACG ACGTGGGCTG GGGCGATCTG GGTGCTTACG GCGGAGGTGA GGGGCGAGGC ATTCCTGCTC CGAACCTGGA CAGGCTGGCC GATGAAGGCA TGACTTTCTT CGACTTTTAC GGTCAGCCCA GTTGCACTCC TGGCCGCGCC GCGCTTCAGA CCGGGCGAAA TCCTAATCGA AGCGGGATGA CAACCGTGGC GTTTCAGGGG CAAGGAGGCG GGCTCCCGCA CGCTGAATGG ACACTGGCGT CAGTATTGAA GCTCGCACAC TACAACACGT ATTTCACCGG CAAGTGGCAT CTCGGCGAGG CCGACTATGC GTTGCCGAAT ACGCAGGGCT ACGACGACAT GAAGTATGTC GGCCTGTATC ACCTGAATGC GTACACGTAT GCCGACCCGA AGTGGTTTCC GGACATGGAC CAGCAAACCC GGGACATGTT CGTCAAGGTG ACGACAGGAA TGCTTTCAGG CAAGGCCGGC CAGAAGGCGC ATGAGGACTT CAAGGTGAAT GGCCAATACC AGAACGAGCC CGAAAAAGGC GTTGTCGGCA TACCGTTCGT GGATGCATAC ATCGAAAAGG CAGCACTCGA AGATATCGAC GACGCCGCGC AGCGTGGACA ACCGTTCTTC ATCAATGTGA ACTTCATGAA AGTTCACCAG CCGAATCTTC CGCACCCGGA TTACATCGGC AAATCGCTGT CGAAGTCCAA ATATGCGGAT TCGATCGTCG AGCTCGACGC ACGTGTCGGT CACATCATGG ACAAGCTGCG TGAGAAAGGA CTCGACAAGA ACACCCTCGT CTTCTTCACG ACCGACAATG GCGCATGGCA GGACGTGTAC CCTGACGCGG GATACACGCC GTTCCGAGGC GCGAAGGGGA CCGACCGGGA AGGTGGCGCG CGGGTGCCGG CGATCGCCTG GTGGCCCGGG AAGATCAAGC CGCATTCGAG GAACTTCGAC ATCGTCGGCG GACTCGATTG CATGGCGACA TTTGCCGCAC TCGCAGGTGT CGATCTGCCG AAGAACGATC GCGAAGGCAA GCCGATTATT TTCGACAGTT TCGACATGTC ACCGGTCCTG TTCGGCACCG GCAAGAGCAA GCGCAACTCG TGGTTCTATT TCACCGAGAA CGAAATGACG CCAGGTGCTG TACGCGTCGG CCAATTCAAG GCGGTGTTCA ATCTGCGTGG AGACGCGGGG GCGGATACCG GCGGGCTCGC TGTCGACTCA AATCTCGGCT GGAAAGGCCC CGACAAGTAC GTTGCGACAG TTCCCCAGGT GTTCGATCTG TACCAGGACC CGCAGGAGCG CTACGACATC TTCATGAACA ACTATACAGA GCACACGTGG ACGCTTCCGA CGTTCGGCGC CGCAGTGAAA GAGCTGATGC AGTCGTACGT GAAATACCCT CCGCGCAAGG CGCAAAGCGA AGCGTACTCA GGTCCGATTA CCCTCAGTCA GTACGAACGC TTTAAATATA TCCGCGATGA ACTTCAGAAG AACGGCTTCA GCATTCCGAT GCCAAGCGGA AACTGA
|
Protein sequence | MQLPTFLAGL ARPVTRAPFA LVAASLLSVS VSGQAQTDSQ QTQKPNILLI VGDDVGWGDL GAYGGGEGRG IPAPNLDRLA DEGMTFFDFY GQPSCTPGRA ALQTGRNPNR SGMTTVAFQG QGGGLPHAEW TLASVLKLAH YNTYFTGKWH LGEADYALPN TQGYDDMKYV GLYHLNAYTY ADPKWFPDMD QQTRDMFVKV TTGMLSGKAG QKAHEDFKVN GQYQNEPEKG VVGIPFVDAY IEKAALEDID DAAQRGQPFF INVNFMKVHQ PNLPHPDYIG KSLSKSKYAD SIVELDARVG HIMDKLREKG LDKNTLVFFT TDNGAWQDVY PDAGYTPFRG AKGTDREGGA RVPAIAWWPG KIKPHSRNFD IVGGLDCMAT FAALAGVDLP KNDREGKPII FDSFDMSPVL FGTGKSKRNS WFYFTENEMT PGAVRVGQFK AVFNLRGDAG ADTGGLAVDS NLGWKGPDKY VATVPQVFDL YQDPQERYDI FMNNYTEHTW TLPTFGAAVK ELMQSYVKYP PRKAQSEAYS GPITLSQYER FKYIRDELQK NGFSIPMPSG N
|
| |