Gene BURPS1106A_A1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1068 
Symbolvdh 
ID4906139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1024281 
End bp1025846 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content75% 
IMG OID640144174 
Productvanillin dehydrogenase 
Protein accessionYP_001075103 
Protein GI126458039 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGGC ATGCGGCGGC GGCATGCCGC AGGCGGCGCG CGGCGCGGCA ACGAGGCACG 
CCGCGCGGCG CGCGACGACT GAAGTTCGCA GCACATCGAA GAGACAAGGA GACATCGATG
ATCGACAGAC GGATGCTGAT CGGCGGCGCC TGGTGCGAGG CCGAACACGG CGCGACGTTC
GAGCGGCGCG ATCCGGTGAC GGGCGCGCTC GCGTCGCGCG CGCCGGCCGC GAGCGCCGCC
GACGCCGAGC GCGCGGTGGC CGCCGCGCAC GCGGCGTTTC CCGCGTGGGC CGCGCTCGCG
CCGACCGAGC GCCGCAGGCG CCTGCTGAAG GCGGCCGACC TGATGGACGC GCGCGGTGCG
GCGTTCGTCG CGGCGGGCGT CGCGGAAACG GGCGCGACGC CCGCGTGGAT CGGCTTGAAC
GTCGCGCTCG CGGCGAACGT GCTGCGCGAG GCGGCATCGA TGGCGACGCG GATCTCGGGC
GACGTGATGC CGTCCGACGT GCCCGGCAAT CTCGCGCTCG CGGTGCGCGC GCCGTGCGGC
GTCGTGCTCG GCATCGCCCC GTGGAACGCG CCCGTGATCC TCGGCACGCG CGCGCTCGCG
ATGCCGCTCG CGTGCGGCAA TACCGTCGTG CTGAAGGCGT CCGAGCTGTG CCCCGGCGTG
CATGCGCTGA TCGGCGCGGC GCTGCACGAC GCGGGGCTCG GCGACGGCGT CGTCAACGTG
CTCACGCACG CGGCCGCCGA CGCGCCCGCG CTCGTCGAGC GCCTGATCGT CGATCCGCGC
GTGCGGCGCG TGAACTTCAC GGGTTCGACG CACGTCGGGC GGATCGTCGC GCGGCTCGCA
GCCGAGCATC TGAAGCCCGC GCTGCTCGAA CTCGGCGGCA AGGCGCCCGT CGTCGTGCTC
GACGACGCCG ATCTCGACGC GGCCGTCGAC GCGATCGCGT TCGGCGCGTT CTTCAATCAA
GGGCAGATCT GCATGTCGAC CGAGCGCGTG ATCGCCGCGC GCGCGATCGC CGACGCGCTC
GTCGACAAGC TCGCCGCGAA GGCGCGCACG CTCGCCGCGG GCGATCCGCG CGCGGGCCTG
CCGCTCGGCG CGATGGTGAG CCGCGACGCG GCCGCGCGCG CGGCCGCGCT CGTCGACGAC
GCGGCGTCGC GCGGCGCCGC GCTGCCGCTC GGCTGCCGCG TCGACGGCGC GATCATGCAG
CCGACGATCG TCGATCGCGT GACGCCCGAC ATGCGGCTCT ATCGCGAGGA ATCGTTCGCG
CCCGTCGTCG CGGTGCTGCG CGCGGGCGAC GACGAACACG CGATCGCGCT CGCGAACGAC
AGCGCGTTCG GGCTCGCGGC GAGCGTGTTC GGCCGCGATC TCGCGCGGGC GCTGGCGGTG
GCGCGGCGCA TCGAATCGGG GATCTGCCAC GTGAACGGGC CGACCGTCCA CGACGAAGCG
CAGATGCCGT TCGGCGGCGT GAAGGCGAGC GGCTACGGGC GCTTCGGCGG CGCGGCGTCG
ATCGCGGAAT TCACCGAACT GCGCTGGCTC ACCGTGCAAA CCGCGCCGCG CGCGTATCCG
ATCTGA
 
Protein sequence
MKRHAAAACR RRRAARQRGT PRGARRLKFA AHRRDKETSM IDRRMLIGGA WCEAEHGATF 
ERRDPVTGAL ASRAPAASAA DAERAVAAAH AAFPAWAALA PTERRRRLLK AADLMDARGA
AFVAAGVAET GATPAWIGLN VALAANVLRE AASMATRISG DVMPSDVPGN LALAVRAPCG
VVLGIAPWNA PVILGTRALA MPLACGNTVV LKASELCPGV HALIGAALHD AGLGDGVVNV
LTHAAADAPA LVERLIVDPR VRRVNFTGST HVGRIVARLA AEHLKPALLE LGGKAPVVVL
DDADLDAAVD AIAFGAFFNQ GQICMSTERV IAARAIADAL VDKLAAKART LAAGDPRAGL
PLGAMVSRDA AARAAALVDD AASRGAALPL GCRVDGAIMQ PTIVDRVTPD MRLYREESFA
PVVAVLRAGD DEHAIALAND SAFGLAASVF GRDLARALAV ARRIESGICH VNGPTVHDEA
QMPFGGVKAS GYGRFGGAAS IAEFTELRWL TVQTAPRAYP I