Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bmul_5440 |
Symbol | |
ID | 5770569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia multivorans ATCC 17616 |
Kingdom | Bacteria |
Replicon accession | NC_010087 |
Strand | + |
Start bp | 137228 |
End bp | 137839 |
Gene Length | 612 bp |
Protein Length | 203 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641319736 |
Product | histidinol-phosphate phosphatase family protein |
Protein accession | YP_001585402 |
Protein GI | 161522473 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0241] Histidinol phosphatase and related phosphatases |
TIGRFAM ID | [TIGR00213] D,D-heptose 1,7-bisphosphate phosphatase [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR01656] histidinol-phosphate phosphatase family domain [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00690711 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCCTGA AGCACGATCC GCGTCCGCGC GGCGCCGTGC TGCTCGACAA GGACGGCACG CTGCTCGACG ACGTGCCGTA CAACGTCGAT CCGGCCCGCA TGCGGCTCGC GCCGGGCGCC GCGCGTGCGC TGCGCTCGCT GGCGTCGACC GGCATGCCGA TCGCGGTCGT CTCGAACCAG CCGGGCGTCG CGCTCGGCCG CTTCACCGAA TCTCAGCTTG GCGCGGTCCG CCAGCGTCTT GCCGAACTGT TCGAAGAGAA CGGCGCGGCG CTGGCGGATT TTTTTTACTG CCCGCATCAC CCGCAGGGCA GCGTGCCGCG CTATGCATGC GACTGTCTGT GCCGCAAGCC GAGGCCCGGC ATGCTGCGCC GCGCGGTCGC CGCGCTCGGC GTCGAGGCCG GCGCGAGCTG GATGATCGGC GACATACTCG ACGACATCGA AGCCGGCCGC GCGGCGCAGT GCCGCACGAT CCTCGTCGAT CGCGGCTACG AGACGGAATG GCGGATCGAC GCGACACGCA CGCCGCACTT CGTCGTCGAC CGGCTCGATC TCGCGGCCGA CATCGTCGTG CGCGAAACCG CGCGCCGTCA CGGCTCGCGG GTGCGGCAAT GA
|
Protein sequence | MALKHDPRPR GAVLLDKDGT LLDDVPYNVD PARMRLAPGA ARALRSLAST GMPIAVVSNQ PGVALGRFTE SQLGAVRQRL AELFEENGAA LADFFYCPHH PQGSVPRYAC DCLCRKPRPG MLRRAVAALG VEAGASWMIG DILDDIEAGR AAQCRTILVD RGYETEWRID ATRTPHFVVD RLDLAADIVV RETARRHGSR VRQ
|
| |