Gene BURPS1106A_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0224 
Symbol 
ID4902262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp207385 
End bp208416 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content69% 
IMG OID640133454 
Productserine/threonine protein kinase 
Protein accessionYP_001064507 
Protein GI126454046 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.304793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACG CCACTTCCGA ATCCAACGCA GCCGGCGCTT GCGCCGGCTT GCCTTTTGCA 
GGGCTCACGC CGGAGCGCGT GCTCGACGCA CTCGACAGCG TGCTGATTCC CGCCGGTTCG
CGCACCGACG GGCGCCTGCT CGCGCTCAAC AGCTACGAAA ACCGCGTCTA TCAGGCCGGC
ATCGAGGACG GCGCGCCGAT CGTCGCGAAA TTCTATCGTC CGCAGCGCTG GTCGAACGAC
GCGATCCTCG AAGAGCATAC GTTCGTCGCC GAGCTGGCCG CGCGCGAGAT TCCGGCCGTG
CCGCCGCTCG CCTTCGACGG CCGCACCCTG CACGAATTCG ACGGTTTTCG CTTCGCGATC
TTCGAGCGGC GCGGCGGGCG CGCGCCGGAG CTCGACCGGC GCGATACGCT CGAATGGCTC
GGCCGCTTCA TCGGGCGCAT CCACGCGGTC GGCGCGACCA AGCCGTACGC CGCGCGGCCG
ACGCTCGACC TCCGCACGTT CGGCTACGAG CCGCGCGATT TCCTGATGTC GCACGACTTC
GTGCCGGACG ACGTTCGGCC TGCTTACGAA GCGGCGGTCG CGCTCGCGCT GGAAGGCGTC
GAGCGCGCGT ACGAGCGCGC GGGCGACGTG CGGATATTGC GCGCGCATGG CGACTGCCAT
CCGAGCAACG TGCTGTGGAC CGACGCGGGC CCGCACTTCG TCGATTTCGA CGACAGCCGG
ATGGCGCCCG CCGTGCAGGA TCTGTGGCTG CTCCTGCCCG GAGACCGGCC GGGCGCGTCG
CGCGCGCTCA CCGATCTGCT CGCGGGCTAC GAGGACTTCT GCGAATTCGA TCCGCGCGAG
CTGCATCTGA TCGAGGCGCT GCGCACGCTG CGGCTCATCC ATTACGCGGC GTGGCTCGCG
CGCCGCTGGG ACGATCCCGC GTTCCCCGCC GCGTTTCCGT GGTTCAACAC GCATCGCTAT
TGGGAAGCGC GCGTGCTCGA ATTGCGCGAG CAGATCGGCG CGATGCAGGA AGGGCCGTTG
TGGCCCGTGT GA
 
Protein sequence
MNDATSESNA AGACAGLPFA GLTPERVLDA LDSVLIPAGS RTDGRLLALN SYENRVYQAG 
IEDGAPIVAK FYRPQRWSND AILEEHTFVA ELAAREIPAV PPLAFDGRTL HEFDGFRFAI
FERRGGRAPE LDRRDTLEWL GRFIGRIHAV GATKPYAARP TLDLRTFGYE PRDFLMSHDF
VPDDVRPAYE AAVALALEGV ERAYERAGDV RILRAHGDCH PSNVLWTDAG PHFVDFDDSR
MAPAVQDLWL LLPGDRPGAS RALTDLLAGY EDFCEFDPRE LHLIEALRTL RLIHYAAWLA
RRWDDPAFPA AFPWFNTHRY WEARVLELRE QIGAMQEGPL WPV