Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphy_3290 |
Symbol | |
ID | 6244721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phymatum STM815 |
Kingdom | Bacteria |
Replicon accession | NC_010623 |
Strand | - |
Start bp | 217678 |
End bp | 219219 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642595080 |
Product | choline-sulfatase |
Protein accession | YP_001859492 |
Protein GI | 186472150 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTTA ATACAAAGCA GAATATCCTT ATCTTGATGG CAGACCAGAT GACACCGTTC GCGTTGCGCG CGTATGGCAA TCAGGTCTCG CTGACACCGC GCATCGATGC GCTCGCGAAA GAAGGCGTCG TGTTCGATTC GGCTTATTGC GCGAGCCCGT TGTGCGCGCC GGCCCGCTTT TCGATGATGG CGGGCAAACG GCCCGCTGCA ATTGGTGCTT ACGATAACGC CGCCGAATTG CCGGCGCAGA CGCTGACCTT CGCGCACTAT CTGCGTGCGG CGGGCTATCG AACGATCCTC TCCGGCAAAA TGCATTTCTG CGGCCCTGAC CAGTTGCATG GCTTCGAAGA ACGCTTGACG ACCGACATCT ATCCCGCCGA TTTCGGCTGG GTGCCGGACT GGGATCGCCC GGACGTGCGG CCGAGCTGGT ATCACAACAT GAGTTCGGTG CTGGATGCGG GACCGTGCGT GCGCACGAAC CAGCTGGATT TCGACGACGA AGTCACTTAC ACGACGCGCC AGAAGCTATA CGACATCGTG CGTGAGCGCG CGGCGGGCGG CGACGCGCGG CCGTTTTGCG TGGTCGCGTC GCTGACGCAC CCGCATGATC CTTACGCGAT ACCGCAGCAG TACTGGGACA TGTATCGCGA TGAAGAGATC GACATGCCGT GCGTGACGCT CACACGCGAT GAAAGCGATC CCCACTCGAA GCGCCTGCGC GACGTCTACG AAGCAGACCT CACGCCGCCC ACGGCGCAGC AGATCCGCGA TGCGCGGCAC GCGTATTACG GCGCGCTATC GTATGTCGAT GCGCAATTCG GCGCGATTCT CGACACGCTC AAAGCAACGG GACTCGCCGA CGACACGATC GTCATCGTCA CGTCGGATCA CGGCGAAATG CTCGGCGAGC GCGGACTCTG GTACAAGATG ACCTGCTTCG AAGGCGGCGT GCGCGTGCCG CTGATCGTGC ACGCGCCGAA GCAGTTTCGC GCGCACCGAG TGGCGGCGTC GGTGTCGCAT GTCGACCTTT TGCCCACGCT GCTCGAAATG GCAACCGGCG CACGCCGTGC GGAGTGGCCG GATACCATCG ACGGACGCAG CCTCGTGCCG CATCTGCGCA ACGACGGCGG GCACGACGAA GCGATCGTCG AATACTTCGC CGAAGGTGCT ATTGCGCCGA TGGTGATGAT CCGGCGCGGT CAGTACAAGT TCATTCACAC GCCCGTCGAT CCCGACCAGC TTTACGATCT CGCCAGCGAT CCACGAGAAC GTGCCAATCT GGCGCAGGAT CCGGCAGCGG CCACGCTGGT CGAAGCCTTT CGCAAAGAAG TCACGCAGCG CTGGGACATT CCCGCACTGC ATCAAGCGGT ACTCGCAAGC CAGCGCCGTC GCCGCTTCCA CTTCGAAGCG ACGACACAAG GCGCGATCCG CTCATGGGAC TGGCAGCCGT TCAACGATGC GAGCCAGCGC TATATGCGCA ATCACATCGA ACTCGACACG CTGGAAGCGA TGGCGCGTTA TCCGCGCGTC GTCTCTCGCT GA
|
Protein sequence | MSLNTKQNIL ILMADQMTPF ALRAYGNQVS LTPRIDALAK EGVVFDSAYC ASPLCAPARF SMMAGKRPAA IGAYDNAAEL PAQTLTFAHY LRAAGYRTIL SGKMHFCGPD QLHGFEERLT TDIYPADFGW VPDWDRPDVR PSWYHNMSSV LDAGPCVRTN QLDFDDEVTY TTRQKLYDIV RERAAGGDAR PFCVVASLTH PHDPYAIPQQ YWDMYRDEEI DMPCVTLTRD ESDPHSKRLR DVYEADLTPP TAQQIRDARH AYYGALSYVD AQFGAILDTL KATGLADDTI VIVTSDHGEM LGERGLWYKM TCFEGGVRVP LIVHAPKQFR AHRVAASVSH VDLLPTLLEM ATGARRAEWP DTIDGRSLVP HLRNDGGHDE AIVEYFAEGA IAPMVMIRRG QYKFIHTPVD PDQLYDLASD PRERANLAQD PAAATLVEAF RKEVTQRWDI PALHQAVLAS QRRRRFHFEA TTQGAIRSWD WQPFNDASQR YMRNHIELDT LEAMARYPRV VSR
|
| |