Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1998 |
Symbol | |
ID | 4881993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1976127 |
End bp | 1977011 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640127926 |
Product | GHMP kinase ATP-binding subunit |
Protein accession | YP_001059033 |
Protein GI | 126438622 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.188336 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTCAC TCATCCCGGC CGCGCGAACG GCGTCGGGCA TCTCGTACTG CTCGTTCGGC GAACTGCTGC AGGGCGTGCT GCTCGAGGAC GACGCCGATT TTCTCGTCAC GCTGCCGATC CAGAAATACT CGGTGAGCAC GTTCGTGCCG GACCCCGGCA GCACCGAGAT CGTCACGCAG CCGAGCGGCA AGACGAAGGC CGCGGCGCTC GCGAAGGTGA TTCTCGGGCG CTATGGGCAC AACGTCGGCG GGACGTTCTA CATCGACTGC GACATCCCGA TCGGCAAGGG GCTCAGCAGC TCGTCCGCCG ACCTGCTCGC GACCGCGCGC GCGATCGAGG TCTACATGGG CCGCGAGCTG CCGCTCGGCG AGCTGTGCCG CGACATGAGC GGCATCGAGC CGACCGACGG CGTGATGTTC GCGGAATCGG TCGTCTACCT GCAGCGCAAG GGGGTGCTGT GGTCGCGGCT CGGGCGCCTG TCCGGCATCC AGATCCTGTC GCTCGACGAG GGTGGCACGA TCGACACGCT CGAGTATCAC CGCCGCGCGC GCGCGCATCG GCACAACGCC GAGCATCGCA GCGAGTTCAA CGAACTGCTC GAGCGGATCG TCGCGGCGTT CGGCACGCGC GATCTCGACG AGATCGGCAG GGTGTCGACC CGCAGCGCGT ACATCAACCA GAAGGTCAAT CCGAAGCGCC ATCTGGCCTC GGTTCACGAC GTATGCCAGG CGACGCGGGG GCTCGGGCTC GTGACGGCGC ACAGCGGCAC GTGCATCGGC ATCCTGTACG AAACCGGGCG GGCGGGGCAC CGCGAAAACC TCGCGCGGGC GGCCGATGCG CTCTCGGGAT ACGGAAACAT CAAGGTCTAT GACACGATCC AATAG
|
Protein sequence | MRSLIPAART ASGISYCSFG ELLQGVLLED DADFLVTLPI QKYSVSTFVP DPGSTEIVTQ PSGKTKAAAL AKVILGRYGH NVGGTFYIDC DIPIGKGLSS SSADLLATAR AIEVYMGREL PLGELCRDMS GIEPTDGVMF AESVVYLQRK GVLWSRLGRL SGIQILSLDE GGTIDTLEYH RRARAHRHNA EHRSEFNELL ERIVAAFGTR DLDEIGRVST RSAYINQKVN PKRHLASVHD VCQATRGLGL VTAHSGTCIG ILYETGRAGH RENLARAADA LSGYGNIKVY DTIQ
|
| |