Gene BURPS668_A1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1443 
Symbol 
ID4888395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1349182 
End bp1350483 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content68% 
IMG OID640131382 
Productputative UDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001062440 
Protein GI126443121 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAACC TGATCGTCCA CGGCGGCGCC CCGCTTCGCG GAGAAATCAC GCCGTCCGCC 
AACAAGAACG CCGTCCTGCC CATCCTGTGC GCGACGCTCC TCACCGACCG GCCGCTGCGG
CTCGTCGGCG TGCCGGACAT CACCGACGTG CGCAAGATCC TCGACATCTT CCGCACGCTC
GGCAGCGACG TCTCGATCGA TTACGCGAGC GGCGTGCTCG ATCTGCACCA TCGCGCGACC
GCGTTCGATC CGGCCGTCCA CCGGCTGCCG GAGGAGATGC GCTCGTCGAT CATGCTGGTG
CCGCCGCTGC TCGCGCGCTT CGGCGTCGCG CGGCTCGAGA ACGACGTAAA GGGCTGCACG
CTCGGCGTGC GCGAGATCGA TCCGCACGTC GAAGTGTTCG AGCGCTTCGG CGCGCGCATC
GAGCGCACGT CCGATTCGCT GATCGTGCGC GCCGACGGCC CGCTCACGCC GAATCATCAC
TGGCTCGACT ACGCGTCCGT CACGACGACC GAGAACTTCG TGCTGTGCGC CGCGTCGGCG
AACGGCACGT CGACGCTCGT CAATGCCGCG TCGGAGCCGC ACGTGCAGGA GTTCTGCCGG
TTCCTCGCGA TGCTCGGCGT GCCGATCGAG GGCATCGGCA CATCGCACCT GAGCGTTCAG
GGCGGGCGCG CGCTCGCGGG CGGCGAATAC CGCTTCAACG AGGACTTTCA CGAAATCGCG
ACGTTTCTCG CGCTCGGCGC GATCACGGGC GGCGACATCG CGGTGCGCAA CGGCTCGCCC
GAGCAGTTTC CGCTGATCGA TCGGACCTTC GCGAAATTCG GCGTGCAGGT CACGCACGAG
AACGGCTGGT CGCACGCGCT GCGCGACGGC CCGCTGAAGG TCAAGCAGCC GTTCACGCGC
AACATCCTGA CGAAAGTCGA GGCCGCGCCG TGGCCCTACC TGCCCGTCGA TCTGCTGCCG
ATCTTCATCG CGCTCGGCGT GCAGGCGCAA GGCAGCGTGA TGTTCTGGAA CAAGGTGTAT
GACGGCGCGA TGGGCTGGAC GGGCGAGCTG TCGAAGTTCG GCGCGCACGT GTTCCTGTCC
GATCCGCATC GGCTGATCAC GTTCGGCGGG CTGCCGCTCA GCCCGGCGCG CGTCGAGAGC
CCGTACATCA TCCGCGTCGC GATCGCGCTG CTGATGGTCG CCGCGAGCAT CGACGGACGC
TCGGAGATCC TGAACGCACA GCCGATCCGG CGCGCGCATC CGCACTTCGT CGAGAACCTG
CGCTCGGTCG GCGCGAACGT CGAGTGGACG AGCGGCGAAT GA
 
Protein sequence
MSNLIVHGGA PLRGEITPSA NKNAVLPILC ATLLTDRPLR LVGVPDITDV RKILDIFRTL 
GSDVSIDYAS GVLDLHHRAT AFDPAVHRLP EEMRSSIMLV PPLLARFGVA RLENDVKGCT
LGVREIDPHV EVFERFGARI ERTSDSLIVR ADGPLTPNHH WLDYASVTTT ENFVLCAASA
NGTSTLVNAA SEPHVQEFCR FLAMLGVPIE GIGTSHLSVQ GGRALAGGEY RFNEDFHEIA
TFLALGAITG GDIAVRNGSP EQFPLIDRTF AKFGVQVTHE NGWSHALRDG PLKVKQPFTR
NILTKVEAAP WPYLPVDLLP IFIALGVQAQ GSVMFWNKVY DGAMGWTGEL SKFGAHVFLS
DPHRLITFGG LPLSPARVES PYIIRVAIAL LMVAASIDGR SEILNAQPIR RAHPHFVENL
RSVGANVEWT SGE