Gene BMA10247_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_3050 
Symbol 
ID4894414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009080 
Strand
Start bp3014018 
End bp3015637 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content67% 
IMG OID640151698 
Productaldehyde dehydrogenase family protein 
Protein accessionYP_001082570 
Protein GI126448839 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCGCG GCGCGGCGTG CTGGCACGGT TTTGTCGTAA CGAACGGCGT CGCGCACGTC 
GCGCGGCGCA TCACCGATAC CGACATGCGA GGAGCAACGA TGAATCACGC GGACATGCAA
CATCTGAACA TCGAATTCCC GTACCGCAAG CAGTACGGGA ATTTCATCGG CGGCGAATGG
GTCGCCCCGG TCGGCGGCGA GTATTTCGAC AACGTCTCGC CCGTCACCGG CCGGCCGTTC
ACCGCGATCC CTCGCTCGCG CGAAGCCGAC ATCGAGCTCG CGCTCGACGC CGCTCACGCG
GCCAAGGCGG GCTGGGCCGC GAAGGGCGCG GCCGAGCGCG CGAACGTGCT GCTGAGGATC
GCCGACCGGA TGGAGGCGAA CCTCACGCGC CTCGCCGTCG CCGAGACGAT CGACAACGGC
AAGCCGCTGC GCGAAACCAC CGCAGCCGAC GTGCCGCTCG CGATCGACCA CTTCCGCTAC
TTCGCGGGCT GCATCCGCGC GCAGGAAGGC TCGATCGCCG ATATCGGCGG CGACATGGTG
GCCTACCACT TCCACGAGCC GCTCGGCGTC GTCGGCCAGA TCATCCCGTG GAACTTCCCG
CTGCTGATGG CCGCGTGGAA GCTCGCGCCG GCGCTCGCGG CCGGCAACTG CGTCGTGCTC
AAGCCGGCCG AGCAGACGCC CGCGTCGATC CTCGTGTTCG CCGAGCTGAT CCAGGATCTG
CTGCCGCCCG GCGTGCTCAA CATCGTCAAC GGCTTCGGCC TCGAGGCCGG CAAGCCGCTC
GCGTCGAGCA AGCGGATCGC GAAGATCGCG TTCACGGGCG AGACGTCGAC GGGCCGCCTC
ATCATGCAGT ACGCGAGCGA GAACCTGATT CCCGTCACGC TCGAGCTGGG CGGCAAGAGC
CCGAATATTT TCTTCGCCGA CGTGATGGAT CGCGACGACA GCTACTTCGA CAAGGCGCTC
GAAGGCTTCG CGATGTTCGC GCTGAACCAG GGCGAAGTCT GCACGTGCCC ATCGCGCGCG
CTCGTCGAGG AGAGCATCTA CGATCGCTTC ATCGAACGCG CGCTCAAGCG CGTCGAGGCG
ATCAAGCAGG GCCATCCGCT CGATTCGCAG ACGATGATCG GCGCGCAGGC GTCGGCCGAG
CAGCTCGAGA AGATCCTGTC GTACATCGAC ATCGGCCGCG GCGAAGGCGC GCAATGCCTG
ACGGGCGGCG AGCGCAACGT GCTCGGCGGC GAGCTCGCCG AAGGCTATTA CGTGAAGCCG
ACCGTGTTCC GCGGCCACAA CAAGATGCGC ATCTTCCAGG AAGAAATCTT CGGGCCGGTG
CTCGCGGTGA CGACGTTCAA GACCGAGGAG GAAGCGCTCG AGATCGCGAA CGACACGCTG
TACGGCCTGG GCGCCGGCGT CTGGACGCGC GACGGCAACC GCGCGTACCG CTTCGGCCGC
GGCATCCAGG CGGGCCGCGT GTGGACGAAC TGCTATCACG CGTATCCGGC GCACGCGGCG
TTCGGCGGCT ACAAGCAATC CGGCATCGGC CGCGAGACGC ACAAGATGAT GCTCGACCAC
TACCAGCAGA CGAAGAACCT GCTCGTCAGC TACAGCGAAA AGCCGCTCGG GTTCTTCTGA
 
Protein sequence
MHRGAACWHG FVVTNGVAHV ARRITDTDMR GATMNHADMQ HLNIEFPYRK QYGNFIGGEW 
VAPVGGEYFD NVSPVTGRPF TAIPRSREAD IELALDAAHA AKAGWAAKGA AERANVLLRI
ADRMEANLTR LAVAETIDNG KPLRETTAAD VPLAIDHFRY FAGCIRAQEG SIADIGGDMV
AYHFHEPLGV VGQIIPWNFP LLMAAWKLAP ALAAGNCVVL KPAEQTPASI LVFAELIQDL
LPPGVLNIVN GFGLEAGKPL ASSKRIAKIA FTGETSTGRL IMQYASENLI PVTLELGGKS
PNIFFADVMD RDDSYFDKAL EGFAMFALNQ GEVCTCPSRA LVEESIYDRF IERALKRVEA
IKQGHPLDSQ TMIGAQASAE QLEKILSYID IGRGEGAQCL TGGERNVLGG ELAEGYYVKP
TVFRGHNKMR IFQEEIFGPV LAVTTFKTEE EALEIANDTL YGLGAGVWTR DGNRAYRFGR
GIQAGRVWTN CYHAYPAHAA FGGYKQSGIG RETHKMMLDH YQQTKNLLVS YSEKPLGFF