Gene BURPS1106A_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0226 
Symbol 
ID4901930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp209085 
End bp210503 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content68% 
IMG OID640133456 
Productputative coniferyl aldehyde dehydrogenase 
Protein accessionYP_001064509 
Protein GI126452218 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAACG ATCTGCCCGG TTTAGCCACG CTCGACGCGC TGTTGCGCGA CCAGCGCGCC 
GCCTATCTGC GCGCGCCGTA CCCGTCGTGG GCGACGCGCG CCGACCATCT GCGCGCGCTG
CGCAAGATGC TGCTCGAGAA CCGCGATGCG CTCGCCGCCG CGATCAACGC GGACTTCGGC
CATCGCGCGA AGGAAGAAGT GCTGATGTCC GAGATCTGGC TCGCGAAAGA GGAAATCGAC
GAGGCGCTCA AGCACGGCAA GCGCTGGATC AAGCCGAAGA GCCGGACGAT GAACAAGTGG
CTGCGCCCCG CGCGCGCGAA GGTGATGCCG CAGCCGCTCG GCGTCGTCGG CATCGTCGTG
CCGTGGAACT ACCCGGTGCT GCTCGCCGCG GGCCCGCTCA TCTGCGCGCT CGCCGCCGGC
AATCGCGCGA TCGTCAAGAT GTCCGAACTG ACGCCGCGCA CGTCGCAGCT GTTCGAGGAA
CTGATCTCGA AAACCTTCGC GCGCGATCAC GTCGCGGTGG TCAACGGCGA TGCGCAAATC
GGCGCGGCGT TCAGCGGGCT GCCGTTCGAT CATCTGCTCT TCACCGGCTC GACGAACGTC
GGCCGGCACG TGATGCGCGC GGCCGCCGAG CACCTCACGC CCGTCACGCT CGAGCTGGGC
GGCAAGTCGC CCGTGATCGT CGGGCCGCGC GCGCGCTTCG ACGCGGCGGT CGACGCCGTC
ATCACCGGCA AGACGCTGAA CGCGGGCCAG ACCTGCATCG CGCCCGACTA TGTGCTCGTG
CCGCGCGGCA AGGAAGCCGA ATTCGTCGCG CGCGCGCGCG CGCGGATGGC TCGGCTCTAT
CCGAATCTGT CGACGAACCC GGACTATACG TCGCTCATCT CCGAGCGCCA CTTCGCACGG
CTGCAGCGGC TCGCGAGCGA AGCGCAGCAG GCGGGCGCGC AACTCCATCC GCTCACGGAC
GCGGCGCCCG ATCCCGCGCT GCGCCGCCTG CCGCCCGTGC TCGTCACGCA GGCGCCCGAT
GCATCGCAGT TGATGCAGGA AGAGATCTTC GGGCCGCTGC TGCCGATCGT TCCGTACGAC
ACGCTCGACG ATGCGATCGC CTACGTGAAC GCGCGGCCGC GGCCGCTCGC GCTGTATCTG
TTCGACGAAG ACCGCACGAC CATCGAGCGC GTGATGCGCG ACACGATCTC GGGCGGCGTG
ACGGTCAACG ACACGCTGAT GCACATCGCG TGCGGCACGC TGCCGTTCGG CGGCGTCGGC
GCGAGCGGGA TGGGCGCGTA CCACGGCTAC GACGGCTTCG TCACGTTCTC GAAGATGAAG
CCCGTGCTCA CGCAGCCGCG CCTGAACACG CGCGCGATGA TCGCGCCGCC GTACGGCAAG
CGCTTCGCGG CGATCCTCAA GCTGATGCTG AAGTTCTGA
 
Protein sequence
MKNDLPGLAT LDALLRDQRA AYLRAPYPSW ATRADHLRAL RKMLLENRDA LAAAINADFG 
HRAKEEVLMS EIWLAKEEID EALKHGKRWI KPKSRTMNKW LRPARAKVMP QPLGVVGIVV
PWNYPVLLAA GPLICALAAG NRAIVKMSEL TPRTSQLFEE LISKTFARDH VAVVNGDAQI
GAAFSGLPFD HLLFTGSTNV GRHVMRAAAE HLTPVTLELG GKSPVIVGPR ARFDAAVDAV
ITGKTLNAGQ TCIAPDYVLV PRGKEAEFVA RARARMARLY PNLSTNPDYT SLISERHFAR
LQRLASEAQQ AGAQLHPLTD AAPDPALRRL PPVLVTQAPD ASQLMQEEIF GPLLPIVPYD
TLDDAIAYVN ARPRPLALYL FDEDRTTIER VMRDTISGGV TVNDTLMHIA CGTLPFGGVG
ASGMGAYHGY DGFVTFSKMK PVLTQPRLNT RAMIAPPYGK RFAAILKLML KF