Gene BURPS1106A_0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0216 
Symbol 
ID4902601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp202592 
End bp203731 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content66% 
IMG OID640133446 
Productaldo/keto reductase family oxidoreductase 
Protein accessionYP_001064499 
Protein GI126454356 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGCACG ATTCGCGACG GTTGGCTAAC ATGGCCCGCT GGGGTACGTC CGGCCGGAAG 
GATCCGGCCT CGTTCAACGT CTCGACGGGA ATTCAGATGG CCTACGAAGC AGCTTCAGAA
CGCTATGCGG ACATGCAGTA TCGCGTGAGC GGCAAATCCG GGCTCAAATT GCCGGCGCTT
TCGCTCGGCT TGTGGCACAA CTTCGGCGAC ACGACGCCGA TCTCGACGCA GCGCGAGATC
CTGCGCACCG CATTCGATCT CGGCATCACG CACTTCGATC TCGCGAACAA CTACGGGCCG
CCGTACGGCA GCGCCGAAAC GAACTTCGGC CGGCTGCTGC GCGAGGATTT CAAGCCGTAT
CGCGACGAGC TGCTGATTTC GACGAAAGCC GGCTGGGACA TGTGGCCCGG CCCGTACGGC
AGCGGCGGCG GCTCGCGCAA GTACGTGCTC GCGAGCCTCG ACCAGAGCTT GCGGCGCATG
GGGCTCGACT ATGTCGACAT CTTCTATTCG CACCGCTTCG ACGCGCACAC GCCGCTCGAG
GAAACCGCGA GCGCGCTCGC GAGCGCCGTG CAGCAGGGCA AGGCGCTCTA CGTCGGGGTC
TCGTCGTATT CGGCGGCGAG CACGCGCGAG ATCGCGAAGC TGCTCGCCGA ATACAAGGTG
CCGCTGCTGA TCCACCAGCC CGCGTACAAC ATGCTCAACC GCTGGATCGA GCGCGAGCTG
CTCGACGCGC TCGACGAGAC GGGCTCGGGC TGCATCGCGT TCACGCCGCT CGCGCAGGGG
CTTCTGACCT CGAAGTATCT GAACGGCGTG CCGGCGGATG CGCGGATCAA CAAGCCGGGC
GGCGGATCGC TGAAGGAAGC TCACCTGAGC GCGGAGAACC TCGAGCACGT GCGCAAGCTG
AACGAGATCG CGCAGCGGCG CGGCCAGAGC CTCGCGCAGA TGGCGCTTGC CTGGGTGCTG
CGCGATTCGC GCGTCACGTC CGCGTTGATC GGTGCGAGCC GCGCGGAGCA GGTGCGCGAG
AACGTCGCGG CGCTCGCCCA TCTCGCGTTC AGCGACGACG AGATCGCCGA GATCGACCGC
TATGCGACCG AAGGCGGGAT CAATCTGTGG GAAAAGCCGT CCACCGATCA GGCGATCTGA
 
Protein sequence
MLHDSRRLAN MARWGTSGRK DPASFNVSTG IQMAYEAASE RYADMQYRVS GKSGLKLPAL 
SLGLWHNFGD TTPISTQREI LRTAFDLGIT HFDLANNYGP PYGSAETNFG RLLREDFKPY
RDELLISTKA GWDMWPGPYG SGGGSRKYVL ASLDQSLRRM GLDYVDIFYS HRFDAHTPLE
ETASALASAV QQGKALYVGV SSYSAASTRE IAKLLAEYKV PLLIHQPAYN MLNRWIEREL
LDALDETGSG CIAFTPLAQG LLTSKYLNGV PADARINKPG GGSLKEAHLS AENLEHVRKL
NEIAQRRGQS LAQMALAWVL RDSRVTSALI GASRAEQVRE NVAALAHLAF SDDEIAEIDR
YATEGGINLW EKPSTDQAI