Gene BURPS1106A_A1242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1242 
Symbol 
ID4903718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1176330 
End bp1177820 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content71% 
IMG OID640144348 
Productaldehyde dehydrogenase (NAD) family protein 
Protein accessionYP_001075277 
Protein GI126456955 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGCAT TCGACTCATC GCTCGTGCCG TCGGGCGACA TCCTGATCGG CGGAGAGTGG 
CGCCGCGGCC GCGGCGCGAC GACGCCGAGC TTCTATCCGG CCGACGGCTC GCTCAACACC
GAGATCCACA TGGCCGACGC GGCCGACGCG CGCGAAGCGG TGCAGGCGGC CGACGCCGCG
TGGCGCCGCG CGGACTGGGC GGGCCTGAAG CCGCATCAGC GCGCGGACGT GCTGTACCGC
ATCGCCGATC TGATTCACGC GCATCGCGAG GCGCTCGCGC AACTGCAGCG GCGCGACAAC
GGCAAGCCGA TCAACGAGAC GCGCGCGCTC GTCGCGAGCG CGGCGAGCAC GTTCCGCTAT
TTCGCCGCGT GCGCGCAGAC GCTCGACGAA GCGCTGACGC CGTCGCGCGG CGATTATCTG
TCGATGAGCG TGCACGAGCC GCTCGGCGTC GTCGCGGCGA TCACGCCGTG GAATTCGCCG
ATCGCCTCCG ATGCGCAGAA GCTCGGCCCG GCGCTCGCGG CGGGCAACGC CGTCGTGCTG
AAGCCGGCCG AGGTGACGCC GCTCGCGTCG CTCGCGCTCG CGCGGCTCTG CGAGCAGGCG
GGCGTGCCGC GCGGCGTGAT CTCGGTGCTG CCGGGCAAGG GTTCGGTGAT CGGCGATGCG
CTCGTGCGCG ATCCGCTCGT GAAGAAGGTG TCGTTTACGG GCGGCACCGA GGTGGGCCGG
GGCATCGCGC GGCTCGCGGC CGAGAAGCTG ATGCCGCTGT CGCTCGAACT GGGCGGCAAG
TCGCCGACGA TCGTGTTCGA CGACGCCGAG CTCGATCACG CGGTCAACGG CGTGTTGTAC
GGCATCTTCA GCTCGTCGGG CGAATCGTGC ATCGCGGGCT CGCGCCTGTT CGTCCAGCGC
TCGATCTACG GCGCGTTCGT CGCGCGCCTC GTCGAAGCGG CGCGCAAGCT GCGCGTCGGC
GATCCGGCGA GCGAGCGCAC GCAGATGGGC CCGCTCATCA CCGCGCGGCA TCGCGACACG
GTCGAGCGCT ACGTCGCGCT CGGCCGCGAC GAGGGCGCCC GCGTGCTGTG CGGCGGCGAG
CGGCCGACAG GCGAGGGCCG CGACGCGGGC TTCTTCTATC TGCCGACGAT TCTCGACGGC
CTGTCGAACC ACGCACGCAT TTGCCGGGAG GAAATCTTCG GGCCGGTGCT CGTCGCGCTG
CCGTTCGACG ACGAAGCGGC GCTCGTCGCC GACGCGAACG ACAGCGTGTT CGGGCTTGCC
GCCGGCATCT GGACGCGCGA CTACAAGCGC GCGTGGCGCG TCGCGCGCGC GCTCGACGCG
GGCACCGTGT GGATCAACAC GTACAAGCAG TTCTCGATCT CGACGCCGTT CTCGGGCCGG
AAGGAAAGCG GGATGGGCCG CGAGAAGGGC AGCCTCGGGA TTCGCGAGTA CATGCAGCAG
AAGAGCCTCT ACTGGGGCTT GAACGATTCG CCGCTGCCGT GGGCGAACTG A
 
Protein sequence
MTAFDSSLVP SGDILIGGEW RRGRGATTPS FYPADGSLNT EIHMADAADA REAVQAADAA 
WRRADWAGLK PHQRADVLYR IADLIHAHRE ALAQLQRRDN GKPINETRAL VASAASTFRY
FAACAQTLDE ALTPSRGDYL SMSVHEPLGV VAAITPWNSP IASDAQKLGP ALAAGNAVVL
KPAEVTPLAS LALARLCEQA GVPRGVISVL PGKGSVIGDA LVRDPLVKKV SFTGGTEVGR
GIARLAAEKL MPLSLELGGK SPTIVFDDAE LDHAVNGVLY GIFSSSGESC IAGSRLFVQR
SIYGAFVARL VEAARKLRVG DPASERTQMG PLITARHRDT VERYVALGRD EGARVLCGGE
RPTGEGRDAG FFYLPTILDG LSNHARICRE EIFGPVLVAL PFDDEAALVA DANDSVFGLA
AGIWTRDYKR AWRVARALDA GTVWINTYKQ FSISTPFSGR KESGMGREKG SLGIREYMQQ
KSLYWGLNDS PLPWAN