Gene BURPS1106A_3134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3134 
Symbol 
ID4901346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3055999 
End bp3057069 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content61% 
IMG OID640136360 
ProductNAD-dependent epimerase/dehydratase family protein 
Protein accessionYP_001067372 
Protein GI126453040 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.983056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTTGACG GGAAAAAGAT CCTCGTGACC GGTGGGGCTG GCTTTATCGG CTGCGCGATA 
TCGGAGCGAC TCGCAGCGCG TGCAAGCCGC TACGTCGTAA TGGACAACTT GCATCCGCAG
ATCCATGCGA GCGCGGTTCG TCCTGGCGCG CTTCACGAGA AAGCGGAACT CGTCGTTGCC
GACGTCACGG ACGCCGGTGC ATGGGATGCG CTGCTGAGCG ATTTTCAACC GGAAATCATC
ATACATCTGG CCGCCGAAAC GGGCACGGGC CAATCGCTGA CGGAAGCGAG TCGGCATGCG
CTCGTCAACG TCGTCGGCAC CACGCGGCTG ACGGACGCGC TCGTCAAGCA CGGCATCGTG
GTCGAGCACA TTCTGCTGAC GAGCAGCCGC GCGGTCTATG GCGAAGGGGC ATGGCAGAAG
GACGATGGCA CGATCGTTTA TCCCGGCCAA CGCGGGCGCG CCCAGCTCGA GGCTGCGCAA
TGGGATTTCC CGGGGATGAC GATGCTGCCT TCGCGTGCGG ACCGTACCGA GCCGCGGCCG
ACGAGCGTCT ATGGTGCAAC GAAGCTCGCG CAGGAACACG TACTGCGTGC ATGGTCGCTC
GCAACGAAAA CGCCGCTGTC GATTTTGCGT CTGCAGAACG TTTATGGCCC GGGTCAATCG
TTGACTAACT CCTATACCGG CATCGTCGCG CTTTTCTCTC GGCTTGCTCG CGAAAAGAAG
GTGATTCCGC TCTATGAAGA CGGCAATGTG ACGCGCGATT TTGTCAGTAT CGACGATGTG
GCGGACGCCA TTGTCGCGAC GTTGGTGCGC ACGCCGGAAG CACTCTCTCT TTTCGATATC
GGCTCCGGAC AGGCGACGAG CATTCTCGAT ATGGCTCGAA TCATCGCGGC GCATTACGGC
GCTCCCGAGC CGCAGATCAA CGGTGCATTC CGCGACGGAG ATGTACGACA CGCGGCGTGC
GACTTGAGCG AATCGTTGGC GAACCTTGGA TGGAAGCCGC AGTGGTCGCT CAAACGCGGG
ATCGGCGAAT TGCAGACCTG GATCGCGCAA GAGCTTGATC GCAAGAACTA G
 
Protein sequence
MVDGKKILVT GGAGFIGCAI SERLAARASR YVVMDNLHPQ IHASAVRPGA LHEKAELVVA 
DVTDAGAWDA LLSDFQPEII IHLAAETGTG QSLTEASRHA LVNVVGTTRL TDALVKHGIV
VEHILLTSSR AVYGEGAWQK DDGTIVYPGQ RGRAQLEAAQ WDFPGMTMLP SRADRTEPRP
TSVYGATKLA QEHVLRAWSL ATKTPLSILR LQNVYGPGQS LTNSYTGIVA LFSRLAREKK
VIPLYEDGNV TRDFVSIDDV ADAIVATLVR TPEALSLFDI GSGQATSILD MARIIAAHYG
APEPQINGAF RDGDVRHAAC DLSESLANLG WKPQWSLKRG IGELQTWIAQ ELDRKN