Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1601 |
Symbol | |
ID | 4886592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1533704 |
End bp | 1534762 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640131540 |
Product | hydroxyethylthiazole kinase, putative |
Protein accession | YP_001062597 |
Protein GI | 126443386 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2145] Hydroxyethylthiazole kinase, sugar kinase family |
TIGRFAM ID | [TIGR00694] hydroxyethylthiazole kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.963992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCGGC CGAGCAAGCC GCGGGGCCGC GGCCCGGCAT CGACGACGCG CGAAGATCGT TGTCGGCTTG ACGACGATCG CTGCGCACGC GTGCCGGAAC GTTTACAGTG CGGCGGCCGC CCTTTCCCAC CGCCATTGCG GCGAGCGCCG ACGGGCAGCG GGCATGCGAT GCGCAAGCGC ACGCCGTCGA GCGCGCGAGC GCGCGCGAAC GCAACTTGGA ACGACACCAT GGAATCGATC AGCTGGAACA CCCCGTCCGT TCGCGACGCG CTCGCCGCCG TCAAGCGCGA CGCGCCGTTC GTCTACGGAC TCACGAACTA TGTTGCCGCG AACCTGAGCG CGAACGTGCT GCTCGCCGTC GGCGCGGCGC CCGCGATCGG CGCCGCGGCC GACTGGCCGG CGCGTTTCGG CGCCGGCGCG AACGCGCTGT GGATCAACAC GGCCGCGCTG ATGAGCAGCG GTGCCGACAC GCTGCTCACG GCCGCGCGCG CCGCGTCGAA AGCCGGCACG CGCTGGGTGC TCGATCCGGT CGCGCTCGGC GCGGGCGCGC CCGAATACGA CGCGATCGTG CGCGATCTGC TCGCCCTGCG GCCGACCGTG ATTCGCGGCA ACGCCAGCGA GCTGATCGCG CTCGCGGGCG GCACGGCGGC CGGCAAGGGC GTCGACACGA CCGCGAGCCC GGAAAGCGCG CTCGCGTTCA TCGGCGATCT GGCGCGGCGC AGCGGCGCCG TCGTCGCGGT GAGCGGCCCG ACCGACTACG TGACGGACGG CGTCGCGACA CTCGCCGTCG CGGGCGGCGA TGCCCGCCTC ACGCGTGTGA CGGGCGCCGG ATGCGCGCTC GGCGCGCTGA TCGCGGCGCT GCTCGCGCAA CGCGGCGCGG CGCTCGCCGC CGCGAGCGCC GCGCACGCGA TTTATGCGAC CGCCGCCGAG CGCGCGGCGG ACGCGCGCGG CACCGCATCG TTCGCGGTGC GCTTCGTCGA CGAACTGTCG CTGCTCGATC CCGCCGAATC GTCGCGCGAT CGCTCGGCCG GGCAGATCGG CGCGAAACGG CGCGAGTGA
|
Protein sequence | MPRPSKPRGR GPASTTREDR CRLDDDRCAR VPERLQCGGR PFPPPLRRAP TGSGHAMRKR TPSSARARAN ATWNDTMESI SWNTPSVRDA LAAVKRDAPF VYGLTNYVAA NLSANVLLAV GAAPAIGAAA DWPARFGAGA NALWINTAAL MSSGADTLLT AARAASKAGT RWVLDPVALG AGAPEYDAIV RDLLALRPTV IRGNASELIA LAGGTAAGKG VDTTASPESA LAFIGDLARR SGAVVAVSGP TDYVTDGVAT LAVAGGDARL TRVTGAGCAL GALIAALLAQ RGAALAAASA AHAIYATAAE RAADARGTAS FAVRFVDELS LLDPAESSRD RSAGQIGAKR RE
|
| |