Gene BURPS668_0888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0888 
Symbol 
ID4882051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp868598 
End bp870049 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content73% 
IMG OID640126816 
Productputative vanillin dehydrogenase 
Protein accessionYP_001057939 
Protein GI126438947 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.359253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGACA TCTCGATGCT GATCGGCGGC GAGCGTCGCC CGGCCACGGG CGGCGCGACG 
TTCGAGCGCC GCAATCCGCT CGACGGGGAG CTCGCGACGC GCGCACCCGC CGCGACCGCC
GCCGACGCGC GCGCGGCCGT GGACGCCGCA TCCGCCGCGT TCGCGCCGTG GGCCGCGCTC
GGCCCGAGCG CGCGCCGCGC GCTGCTGCTG AAGGCCGCTG CCGCGCTCGA GGGCAAGCGC
GACGCGTTCA TCGCCGCGAT GGCGGCCGAG ACGGGCGCAT CGGCGATCTG GGCACGGTTC
AACGTCGAGC TCGCCGCGAA CGGCCTCGTC GAGGCGGCCG CGCTGACGAC GCAGATCGGC
GGCGAGCTGA TTCCGTCCGA CGTGCCGGGC TCGCTCGCGA TGGGCGTGCG GCAGCCGGCG
GGCGTCGTGC TCGGCATCGC GCCGTGGAAC GCACCCGTGA TCCTCGGCGT GCGTGCGCTC
GCGCTGCCGC TCGCATGCGG CAACACGGTG GTGTTCAAGG GCTCGGAGCT GTGCCCGGCC
ACGCACGGCC TCATCGCCGA CGCGCTGCAC GAAGCGGGGC TGCCTCGCGG CGTCGTGAAT
TTCGTGACGA ACGCGCCCGC CGATGCCGGC GCCGTCGTCG ACGCGATGAT CGCGCACCCG
GCCGTGCGCC GCGTGAACTT CACGGGCTCG ACGCGGGTGG GCCGGATCAT CGCCGAGCGC
TGCGCACGGC ATCTGAAGCC CGCCGTGCTC GAGCTCGGCG GCAAGGCGCC GTTCGTCGTG
CTCGACGACG CCGATCTCGA CGCGGCCGTC GCGGCGGCTG CGTTCGGCGC GTTCGCGAAT
TCCGGGCAGA TCTGCATGTC GACCGAGCGG ATCATCGTCG ACGAGCGGAT CGCCGACGCG
TTCGTCGCGA AGCTCGCCGA CAAGGCCGCG TCGCTGCCGT TGGGCGATCC GCGCAACGGG
CCCGTCGTGC TCGGCTCGGT GATCGACGCA CAGACCGTCG AGCGCTGCAA CGCGCTCATC
GACGACGCGC TCGCGAAAGG CGCGGTGCTG CGCTGCGGCG GCAAGGCCGA CAGCACGCTG
ATGCCCGCGA CGCTCGTCGA CCGCGTGACG CCCGCGATGC GCCTCTACGC GGAGGAATCG
TTCGGGCCGG TGAAGGGCAT CGTGCGCGTC GCGGGCGAGG AGGCGGCGAT CGCGTGCGCG
AACGACAACG CGTTCGGCCT GTCGTCGGCC GTGTTCAGCC GCGACGTCGC ACGCGCGATG
CGCGTTGCCG CGCGGATCGA AGCGGGCATC TGCCACGTGA ACGGGCCGAC CGTTCACGAC
GAGGCGCAGA TGCCGTTCGG CGGCATGAAG GACAGCGGCT TCGGCCACTT CGGCGGCAAG
GCGGGCATCG CCGAGTTCAC CGATCTGCGC TGGATCACCG TGCAGACGGC CCCGCGCCAC
TATCCGTTCT GA
 
Protein sequence
MQDISMLIGG ERRPATGGAT FERRNPLDGE LATRAPAATA ADARAAVDAA SAAFAPWAAL 
GPSARRALLL KAAAALEGKR DAFIAAMAAE TGASAIWARF NVELAANGLV EAAALTTQIG
GELIPSDVPG SLAMGVRQPA GVVLGIAPWN APVILGVRAL ALPLACGNTV VFKGSELCPA
THGLIADALH EAGLPRGVVN FVTNAPADAG AVVDAMIAHP AVRRVNFTGS TRVGRIIAER
CARHLKPAVL ELGGKAPFVV LDDADLDAAV AAAAFGAFAN SGQICMSTER IIVDERIADA
FVAKLADKAA SLPLGDPRNG PVVLGSVIDA QTVERCNALI DDALAKGAVL RCGGKADSTL
MPATLVDRVT PAMRLYAEES FGPVKGIVRV AGEEAAIACA NDNAFGLSSA VFSRDVARAM
RVAARIEAGI CHVNGPTVHD EAQMPFGGMK DSGFGHFGGK AGIAEFTDLR WITVQTAPRH
YPF