Gene BURPS1106A_2346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2346 
Symbol 
ID4901009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2321543 
End bp2323138 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content77% 
IMG OID640135574 
Productcarbohydrate kinase family protein 
Protein accessionYP_001066609 
Protein GI126452602 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.121214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCTGC CTGCCGCCTC CCCGCTGCCC CGTTCGCTCG AACCGTTCGA TGCGCCGCCG 
ATTCATGCGA GCGCGCCGCT CCTGAGCGTC GCCGAGCTGC GCGACATCGA AACCGCGGCG
GCCGCCGCGC TGCCGCCCCA CACGCTGATG GAGCGCGCGG GCAAATCGGC CGCGCAGTGG
CTCGCCGCGC GGCTCGTGAG CGACCCGCGG CCCGTGTGGT TCGCGGTCGG CCCGGGCAAC
AACGGCGGCG ACGCGCTCGT CGCCGCGGCC GAGCTGCGCC GGCTCGGCTT CGCGGCCGAC
GCCTGGATGC CGATCGAGGT GAAGCCTGAC GACGCGCGCT GGGCGCTCGA GCGCGCGCGC
GCGGCGAACG TGCCGATCGA CGAGGCGGCG CCCGAATCGT TCGACGGCTA CGGCTGGCTC
GTCGACGGGC TGTTCGGCAT CGGCCTCGCA CGGCCGCTCG ACGGCGCGTT CGCCGCGATC
GCGCAGCGCA TCGCGGCGCG CGCGCGGCAC ACCGGCCGCG TGCTCGCGCT CGACGTGCCG
AGCGGCCTCG ACAGCGACAC CGGCGCGCGC GTCGGCGGCG GGACCGCCGT CACGGCCACC
TGCACGCTGT CGTTCATCGC CGCGAAGCCC GGCCTCTATA CCGGCGACGG GCGCGACCTC
GCGGGCGAAA TCCATGTCGC GCCCCTCGAT CTCGGCGAGC CGCCCGCGCC CGCGATCCGG
CTGAACGCGC CCGAGCTCTT CGAGGCGCGC CTGCCCGAGC GCGCGTTCGC ATCGCACAAG
GGCACGTACG GCAGCCTCGG GATCGTCGGC GGAGACACGG GCATGTGCGG CGCGCCGATC
CTCGCCGCGC GCGCGGCGCT CTTCGCCGGC GCGGGCAAGG TCCATGTCGG CTTCGTCGGC
ACGGGCGCGC CGCCGTACGA TCCGCCGTAT CCGGAGCTGA TGCTGCATCC GGCCGACGCG
CTGCCGAGCG CGTCGCTCAC CGCGCTCGCG ATCGGCTGCG GGCTCGGCGC GAGCGAGCGC
GCCGCGCGCG TGCTCGCGGC GCTGCTGCCG CTCGATGCGC CGAAGCTCAT CGACGCCGAC
GCGCTGAATC TGATCGCGAC GACGCCCGCG CTCGCGGCGA CGCTCGCCGC GCGCGGCCGC
ACAGGCGACG CCGCCGTCCT CACGCCGCAT CCGCTCGAGG CCGCGCGCCT GCTCGCCACC
GACGCGGCCG ACGTCCAGCG CGACCGCGTC GCCGCCGCGC GCGCGCTCTG CGCGCGCTTC
TCGGCGGTCG TCGTGCTGAA AGGGTCCGGC ACCGTGATCG CGGCGCCGGA CGGCCGCCTC
GCGATCAATC CGACCGGCAA CGCGGCGCTC GCCACCGGCG GCACGGGCGA CGTGCTGGGC
GGCCTGATCG GCGCGTTTCT TGCGCAGCGG ATGCCGCGCT ACGAAGCGGC GCTCGCGGGC
GTCTACCTGC ACGGGCTCGC CGCCGAGCGG CTGTGCGCGG CGGGCGCGGG CCCGGCCGGC
CTCGCCGCGG GCGAACTCGC GCCCGCCGTG CGCGCGCTCG TCAATCGGCT GTTTTATACG
CGGCCCGCCG CGCCGGACGA AGCGCCGCTA TACTGA
 
Protein sequence
MILPAASPLP RSLEPFDAPP IHASAPLLSV AELRDIETAA AAALPPHTLM ERAGKSAAQW 
LAARLVSDPR PVWFAVGPGN NGGDALVAAA ELRRLGFAAD AWMPIEVKPD DARWALERAR
AANVPIDEAA PESFDGYGWL VDGLFGIGLA RPLDGAFAAI AQRIAARARH TGRVLALDVP
SGLDSDTGAR VGGGTAVTAT CTLSFIAAKP GLYTGDGRDL AGEIHVAPLD LGEPPAPAIR
LNAPELFEAR LPERAFASHK GTYGSLGIVG GDTGMCGAPI LAARAALFAG AGKVHVGFVG
TGAPPYDPPY PELMLHPADA LPSASLTALA IGCGLGASER AARVLAALLP LDAPKLIDAD
ALNLIATTPA LAATLAARGR TGDAAVLTPH PLEAARLLAT DAADVQRDRV AAARALCARF
SAVVVLKGSG TVIAAPDGRL AINPTGNAAL ATGGTGDVLG GLIGAFLAQR MPRYEAALAG
VYLHGLAAER LCAAGAGPAG LAAGELAPAV RALVNRLFYT RPAAPDEAPL Y