Gene BURPS1106A_2562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2562 
Symbol 
ID4900334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2516561 
End bp2517760 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID640135789 
ProductGHMP kinase domain-containing protein 
Protein accessionYP_001066816 
Protein GI126452895 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0349401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCA TCGAGAACAA CGAAGACCTC GGCGTACTGC GGTTCATCAC GGCGGGCAGC 
GTCGACGACG GCAAGCGCGC GTTGATCGGG CGACTGCTGT ACGACGGCGA GGCGATGGAG
GCCGAGGCGC GCCGCGCACC GCCGCGCGAG ACGCGGATGC GCGCCGGCGG CGCCGCGCCC
GAACCCGCGT CGGCCCCTCG CACGCACGTC GCGCCCGATC TCGATGCGAT GGCGGGCGGG
CCGTCGATCG CCGCGGGCGG CCATGTCGGC CGCGACGGCG GGCGCACGTG CGCGAAGCCC
GACAGGACGC GCGCGGCGAG CGCGCACGCA CCGCCCCGGC GCCTGTTCTC GATCGGCCGC
GCGCCGGCCA CGTTCGGCGA ACTGGTTCAG GGGCGCGAGC CCGCGTCCGG CGACGATTTT
CTGATCACGC TGCCGATCAC GCTGAGCTCG ACTGCCCGAT TCTGCCGGTT TCGCGATTCC
GATCGCCTGT ATGTCTTTCC GGCGAGCAAG AAGAAATCGC TGAAGGCCGC CGCGCTCTTT
CTCGAACGAT TCGGCATCCT GACGGGCGGC GTCCTGCAGA TCTGCAGCGA CGTGTCCGAG
GGCAAGGGGC TCGCGAGCTC GTCGTCCGAC ATCGTCGCGA CGCTGCGCGC GCTCGCCGCG
TGTTTCGACA TCCCGCTTTC TCCCGCCGAC ATGTGCGCGA TCATTCGCGA GATCGAGCCG
ACCGACGGCG TGATGTTCGA CGAATCGGTC GCGTTCTTCC ATCGCCGGGT CGAGCTCGGC
AAGGTGATGG GCCGGCTGCC GAAAATCTGC ATTCTCGCGA TCGACGAGGG CGGCACGATC
GATACCGTCG AGTACAACTG CCATCGCTTC GAGTTTTCCC ACGAGGAAGC GGATCAGTAC
GCGGCGCTGC TCGCCGACGT CGACGCGGCG ATCTCGCGCA GCGATGTGCG GCAGATCGGG
CGCGCGGCGA CGCTCAGCGC GCAAATGCAC CAGAAGCGCA ACCCGAAGCG AACGCTGCGG
CAGCTCGAAG CGCTGATGCG CGAAGTCGGC GCGCACGGCA TCGTCAATTG CCACAGCGGC
ACGTTCATCG GCCTGTGCTT CGATGCGTCG GGCCCCGACG CGCTCGACAC GATCGCGCGT
GCCGAGCGCA CGCTGCGCGA CGCGCTTGGC CAACCCATCT CGCGCTTCTT CACCAGGTGA
 
Protein sequence
MSIIENNEDL GVLRFITAGS VDDGKRALIG RLLYDGEAME AEARRAPPRE TRMRAGGAAP 
EPASAPRTHV APDLDAMAGG PSIAAGGHVG RDGGRTCAKP DRTRAASAHA PPRRLFSIGR
APATFGELVQ GREPASGDDF LITLPITLSS TARFCRFRDS DRLYVFPASK KKSLKAAALF
LERFGILTGG VLQICSDVSE GKGLASSSSD IVATLRALAA CFDIPLSPAD MCAIIREIEP
TDGVMFDESV AFFHRRVELG KVMGRLPKIC ILAIDEGGTI DTVEYNCHRF EFSHEEADQY
AALLADVDAA ISRSDVRQIG RAATLSAQMH QKRNPKRTLR QLEALMREVG AHGIVNCHSG
TFIGLCFDAS GPDALDTIAR AERTLRDALG QPISRFFTR