Gene BURPS1106A_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1006 
Symbol 
ID4901159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp985122 
End bp986018 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content70% 
IMG OID640134236 
Productputative dihydrodipicolinate synthase 
Protein accessionYP_001065287 
Protein GI126454975 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAACC TCTTGCAAGG CATCATCGCC TACCCCGTCA CGCCCTTCTC GCCGGACGGC 
CGGCTCGACA CGGCCGCGCT CGGCGCGCTC ATCGAACGCC TGATCGCGAG CGGCGTGCAC
GGCATCGCGC CGCTCGGCAG CACCGGCGAA AGCGCCTATC TGTCCGACGC CGAATGGGAA
GCCGCCGCGT CGGCCTCGAT TCGCGCGGTC GAGCGCCGCG TGCCGACCGT CGTCGGCATT
TCCGATCTCA CCACCGCGAA CGCGGTGCGC CGCGCGAAAT TCGCCGAACA GGCGGGCGCG
GACGCGGTCA TGGTGCTGCC CGTGTCGTAC TGGCGGCTCG ACGACGAAGC GATCGTCGGC
CACTACCGCG CGATCGGCGA CGCGATCGGC ATTCCGATCA TGCTGTACAA CAACCCGGCG
ACGAGCGGCA TCGACATGTC GCCCGAGCTG ATCGCGCGCA TCTTCCGCAC GGTCGACAAC
GTGACGATGG TCAAGGAGAG CACGGGCGAC ATCAAGCGCA TGCACCGGCT CGCGCAACTG
GGCGACGGCG CGATCCCGTT CTACAACGGC AGCAATCCGA TGGCGCTCGC CGCGCTCGCG
GCCGGCGCGG CCGGCTGGTG CACCGCCGCG CCGAACCTGA ACGCGCGCCT GCCGCTCGCG
TTATACGACG CGATGCGCGC AAGCGATCTC GACACGGCGC GCGCCGTCTT TCATCGACAG
TTGCCGCTGT TGCAGTTCAT CGTCTCGGGC GGGCTGCCCG TCACGGTGAA GGCCGGGCTG
CGGCTCGCGG GCTTCGACGC GGGCGAGCCG CGCAAGCCGC TGCGCCCGCT CGACGAAGCG
CGCACGCGCG AGCTCGCCGC GATTCTCGAC GCGCTGCGCG ACACCGCGCA CGCGTGA
 
Protein sequence
MSNLLQGIIA YPVTPFSPDG RLDTAALGAL IERLIASGVH GIAPLGSTGE SAYLSDAEWE 
AAASASIRAV ERRVPTVVGI SDLTTANAVR RAKFAEQAGA DAVMVLPVSY WRLDDEAIVG
HYRAIGDAIG IPIMLYNNPA TSGIDMSPEL IARIFRTVDN VTMVKESTGD IKRMHRLAQL
GDGAIPFYNG SNPMALAALA AGAAGWCTAA PNLNARLPLA LYDAMRASDL DTARAVFHRQ
LPLLQFIVSG GLPVTVKAGL RLAGFDAGEP RKPLRPLDEA RTRELAAILD ALRDTAHA