Gene BURPS1106A_0999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0999 
Symbol 
ID4903007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp978462 
End bp979541 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content72% 
IMG OID640134228 
ProductNAD-dependent epimerase/dehydratase family protein 
Protein accessionYP_001065279 
Protein GI126452148 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAATGCG CGCACGGCGG CTACAATGCA TTCCTACGGA TCAACCGATA CATCATGATT 
GCGACACGAA TCCTGCGCCG GCCGCGCGTA TTGATCGTCG GCTGCGGCGA TGTCGGCACG
CGCTGCGCCG CGCAACTGCG CGCGCGGCGC GAGAACCTGC GCATCCTCGC GCTGACGAGC
CGGCGTTCGC GCTGCGTCGA GCTTCGGGCG GCGGGCGTCG TGCCCGTCGT CGGCGATCTG
GATGCGCGCG CGACGCTTAA GCGGATCGCG CGCGTCGCGC CCGTCGTGCT GCATCTCGCG
CCGCCGCAGG CCACGGGCGA CGTCGATCGC CGCACGCAGG CGCTCGTCGC CGCGCTCGCG
TCGCCGCGGC GGCCGTCGCG TCAACTCGCG CCGGCATACG GCAGGCTGCG CGCGTGGCGG
ACCGCCGCCA GATCGGCTCG GCCGCCTTTT CAGGCATCGG CTATTGTACC CGACGCCCTG
CCGCGCCCCG TCGTCGTCTA TGCGAGCACG AGCGGCGTCT ATGGCGATTG CGGCGGCGCG
CGGGTTGACG AAACGCGTGC GGTGCGGCCC GCGAATCCGC GCGCGCGGCG GCGCGTGTCG
GCCGAGCGCC AGTTGCGCCG CGCGACCGCG CGCGGCGCGC TGTCCGCGCG CATCGTGCGG
ATCCCCGGCA TCTACGCGGC GAACCGGCTG CCGCTCGCGC GGCTCGAGAA GGGGACGCCG
GCCCTCGTCG AGGCCGACGA CGTCTATACG AACCATATCC ACGCCGACGA TCTCGCGTCG
ATTCTGTTGC GCGCCGCCGT GCGCGGCAAG CCCGCGCGGG TCGTTCATGC GAGCGACGAC
ACCGAGCTGA AGATGGGAGA TTACTTCGAG CGGGTGGCGC GCGCGTTCGG CCTGCGCAGC
CCGCCGCGCA TCGCGCGCGC CGAGGCGGAG CGGCAGCTCG AGCCGATGCT GCTGTCGTTC
ATGCGCGAAT CGCGGCGGCT CGCGAACGCG AGAATGAAGC GCGAATTGCG CATCGCGCTG
CGTTACCCGA GCGTCGACGA CTTTCTGCGC ACCGTATCCG CGCCGCGTCC GCTCAAGTGA
 
Protein sequence
MQCAHGGYNA FLRINRYIMI ATRILRRPRV LIVGCGDVGT RCAAQLRARR ENLRILALTS 
RRSRCVELRA AGVVPVVGDL DARATLKRIA RVAPVVLHLA PPQATGDVDR RTQALVAALA
SPRRPSRQLA PAYGRLRAWR TAARSARPPF QASAIVPDAL PRPVVVYAST SGVYGDCGGA
RVDETRAVRP ANPRARRRVS AERQLRRATA RGALSARIVR IPGIYAANRL PLARLEKGTP
ALVEADDVYT NHIHADDLAS ILLRAAVRGK PARVVHASDD TELKMGDYFE RVARAFGLRS
PPRIARAEAE RQLEPMLLSF MRESRRLANA RMKRELRIAL RYPSVDDFLR TVSAPRPLK