Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2018 |
Symbol | |
ID | 4901476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1981570 |
End bp | 1982466 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640135248 |
Product | GHMP kinase ATP-binding subunit |
Protein accession | YP_001066283 |
Protein GI | 126453654 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGAGC CCATGCGTTC ACTCATCCCG GCCGCGCGAA CGGCGTCGGG CATCTCGTAC TGCTCGTTCG GCGAACTGCT GCAGGGCGTG CTGCTCGAGG ACGACGCCGA TTTTCTCGTC ACGCTGCCGA TCCAGAAATA CTCGGTGAGC ACGTTCGTGC CGGACCCCGG CAGCACCGAG ATCGTCACGC AGCCGAGCGG CAAGACGAAG GCCGCGGCGC TCGCGAAGGT GATTCTCGGG CGCTACGGGC ACAACGTCGG CGGGACGTTC TACATCGACT GCGACATCCC GATCGGCAAG GGGCTCAGCA GCTCGTCCGC CGACCTGCTC GCGACCGCGC GCGCGATCGA GGTCTACATG GGCCGCGAGC TGCCGCTCGG CGAGCTGTGC CGCGACATGA GCGGCATCGA GCCGACCGAC GGCGTGATGT TCGCGGAATC GGTCGTCTAC CTGCAGCGCA AGGGGGTGCT GTGGTCGCGG CTCGGGCGCC TGTCCGGCAT CCAGATCCTG TCGCTCGACG AGGGCGGCAC GATCGACACG CTCGAGTATC ACCGCCGCGC GCGCGCGCAT CGGCACAACG CCGAGCATCG CAGCGAGTTC AACGAACTGC TCGAGCGGAT CGTCGCGGCG TTCGGCGCGC GCGATCTCGA CGAGATCGGC AGGGTGTCGA CCCGCAGCGC GTACATCAAC CAGAAGATCA ATCCGAAGCG CCATCTGGCC TCGGTTCACG ACGTATGCCA GGCGACGCGG GGGCTCGGGC TCGTGACGGC GCACAGCGGC ACGTGCATCG GCATCCTGTA CGAAACCGGG CGGGCGGGGC ACCGCGAAAA CCTCGCGCGG GCAGCCGATG CGCTCTCGGG ATACGGAAAC ATCAAGGTCT ATGACACGAT CCAATAG
|
Protein sequence | MSEPMRSLIP AARTASGISY CSFGELLQGV LLEDDADFLV TLPIQKYSVS TFVPDPGSTE IVTQPSGKTK AAALAKVILG RYGHNVGGTF YIDCDIPIGK GLSSSSADLL ATARAIEVYM GRELPLGELC RDMSGIEPTD GVMFAESVVY LQRKGVLWSR LGRLSGIQIL SLDEGGTIDT LEYHRRARAH RHNAEHRSEF NELLERIVAA FGARDLDEIG RVSTRSAYIN QKINPKRHLA SVHDVCQATR GLGLVTAHSG TCIGILYETG RAGHRENLAR AADALSGYGN IKVYDTIQ
|
| |