Gene BURPS1106A_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2018 
Symbol 
ID4901476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1981570 
End bp1982466 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content67% 
IMG OID640135248 
ProductGHMP kinase ATP-binding subunit 
Protein accessionYP_001066283 
Protein GI126453654 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGAGC CCATGCGTTC ACTCATCCCG GCCGCGCGAA CGGCGTCGGG CATCTCGTAC 
TGCTCGTTCG GCGAACTGCT GCAGGGCGTG CTGCTCGAGG ACGACGCCGA TTTTCTCGTC
ACGCTGCCGA TCCAGAAATA CTCGGTGAGC ACGTTCGTGC CGGACCCCGG CAGCACCGAG
ATCGTCACGC AGCCGAGCGG CAAGACGAAG GCCGCGGCGC TCGCGAAGGT GATTCTCGGG
CGCTACGGGC ACAACGTCGG CGGGACGTTC TACATCGACT GCGACATCCC GATCGGCAAG
GGGCTCAGCA GCTCGTCCGC CGACCTGCTC GCGACCGCGC GCGCGATCGA GGTCTACATG
GGCCGCGAGC TGCCGCTCGG CGAGCTGTGC CGCGACATGA GCGGCATCGA GCCGACCGAC
GGCGTGATGT TCGCGGAATC GGTCGTCTAC CTGCAGCGCA AGGGGGTGCT GTGGTCGCGG
CTCGGGCGCC TGTCCGGCAT CCAGATCCTG TCGCTCGACG AGGGCGGCAC GATCGACACG
CTCGAGTATC ACCGCCGCGC GCGCGCGCAT CGGCACAACG CCGAGCATCG CAGCGAGTTC
AACGAACTGC TCGAGCGGAT CGTCGCGGCG TTCGGCGCGC GCGATCTCGA CGAGATCGGC
AGGGTGTCGA CCCGCAGCGC GTACATCAAC CAGAAGATCA ATCCGAAGCG CCATCTGGCC
TCGGTTCACG ACGTATGCCA GGCGACGCGG GGGCTCGGGC TCGTGACGGC GCACAGCGGC
ACGTGCATCG GCATCCTGTA CGAAACCGGG CGGGCGGGGC ACCGCGAAAA CCTCGCGCGG
GCAGCCGATG CGCTCTCGGG ATACGGAAAC ATCAAGGTCT ATGACACGAT CCAATAG
 
Protein sequence
MSEPMRSLIP AARTASGISY CSFGELLQGV LLEDDADFLV TLPIQKYSVS TFVPDPGSTE 
IVTQPSGKTK AAALAKVILG RYGHNVGGTF YIDCDIPIGK GLSSSSADLL ATARAIEVYM
GRELPLGELC RDMSGIEPTD GVMFAESVVY LQRKGVLWSR LGRLSGIQIL SLDEGGTIDT
LEYHRRARAH RHNAEHRSEF NELLERIVAA FGARDLDEIG RVSTRSAYIN QKINPKRHLA
SVHDVCQATR GLGLVTAHSG TCIGILYETG RAGHRENLAR AADALSGYGN IKVYDTIQ