Gene BURPS668_0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0214 
Symbol 
ID4882138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp203241 
End bp204659 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content68% 
IMG OID640126142 
Productputative coniferyl aldehyde dehydrogenase 
Protein accessionYP_001057267 
Protein GI126438854 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAACG ATCTGCCCGG TTTAGCCACG CTCGACGCGC TGTTGCGCGA CCAGCGCGCC 
GCCTATCTGC GCGCGCCGTA CCCGTCGTGG GCGACGCGCG CCGACCACCT GCGCGCGCTG
CGCAAGATGC TGCTCGAGAA CCGCGATGCG CTCGCCGCCG CGATCAACGC GGACTTCGGC
CATCGCGCGA AGGAAGAAGT GCTGATGTCC GAGATCTGGC TCGCGAAAGA GGAAATCGAC
GAGGCGCTCA AGCACGGCAA GCGCTGGATC AAGCCGAAGA GCCGGACGAT GAACAAGTGG
CTGCGCCCCG CGCGCGCGAA GGTGATGCCG CAGCCGCTCG GCGTCGTCGG CATCGTCGTG
CCGTGGAACT ACCCGGTGCT GCTCGCCGCG GGCCCGCTCA TCTGCGCGCT CGCCGCCGGC
AATCGCGCGA TCGTCAAGAT GTCCGAACTG ACGCCGCGCA CGTCGCAGCT GTTCGAGGAA
CTGATCTCGA AAACCTTCGC GCGCGATCAC GTCGCGGTGG TCAACGGCGA TGCGCAAATC
GGCGCGGCGT TCAGCGGGCT GCCGTTCGAT CATCTGCTCT TCACCGGCTC GACGAACGTC
GGCCGGCACG TGATGCGCGC GGCCGCCGAG CACCTCACGC CCGTCACGCT CGAGCTGGGC
GGCAAGTCGC CCGTGATCGT CGGGCCGCGC GCGCGCTTCG ACGCGGCGGT CGACGCCGTC
ATCACCGGCA AGACGCTGAA CGCGGGCCAG ACCTGCATCG CGCCCGACTA TGTGCTCGTG
CCGCGCGGCA AGGAAGCCGA ATTCGTCGCG CGCGCGCGCG CGCGGATGGC TCGGCTCTAT
CCGAATCTGT CGACGAACCC GGACTATACG TCGATCATCT CCGAGCGCCA CTTCGCACGG
CTGCAGCGGC TCGCGAGCGA AGCGCAGCAG GCGGGCGCGC AACTCCATCC GCTCACGGAC
GCGGCGCCCG ATCCCGCGCT GCGCCGCCTG CCGCCCGTGC TCGTCACGCA GGCGCCCGAT
GCGTCGCAGT TGATGCAGGA AGAGATCTTC GGGCCGCTGC TGCCGATCGT TCCGTACGAC
ACGCTCGACG ATGCGATCGC CTACGTGAAC GCGCGGCCGC GACCGCTCGC GCTGTATCTG
TTCGACGAAG ACCGCACGAC CATCGAGCGC GTGATGCGCG ACACGATCTC GGGCGGCGTG
ACGGTCAACG ACACGCTGAT GCACATCGCG TGCGGCACGC TGCCGTTCGG CGGCGTCGGC
GCGAGCGGGA TGGGCGCGTA CCACGGCTAC GACGGCTTCG TCACGTTCTC GAAGATGAAG
CCCGTGCTCA CGCAGCCGCG CCTGAACACG CGCGCGATGA TCGCGCCGCC GTACGGCAAG
CGCTTCGCGG CGATCCTCAA GCTGATGCTG AAGTTCTGA
 
Protein sequence
MKNDLPGLAT LDALLRDQRA AYLRAPYPSW ATRADHLRAL RKMLLENRDA LAAAINADFG 
HRAKEEVLMS EIWLAKEEID EALKHGKRWI KPKSRTMNKW LRPARAKVMP QPLGVVGIVV
PWNYPVLLAA GPLICALAAG NRAIVKMSEL TPRTSQLFEE LISKTFARDH VAVVNGDAQI
GAAFSGLPFD HLLFTGSTNV GRHVMRAAAE HLTPVTLELG GKSPVIVGPR ARFDAAVDAV
ITGKTLNAGQ TCIAPDYVLV PRGKEAEFVA RARARMARLY PNLSTNPDYT SIISERHFAR
LQRLASEAQQ AGAQLHPLTD AAPDPALRRL PPVLVTQAPD ASQLMQEEIF GPLLPIVPYD
TLDDAIAYVN ARPRPLALYL FDEDRTTIER VMRDTISGGV TVNDTLMHIA CGTLPFGGVG
ASGMGAYHGY DGFVTFSKMK PVLTQPRLNT RAMIAPPYGK RFAAILKLML KF