Gene BMA10247_A1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A1801 
Symbol 
ID4889134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp1737952 
End bp1739517 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content75% 
IMG OID640148066 
Productaldehyde dehydrogenase (NADP) family protein 
Protein accessionYP_001078984 
Protein GI126445641 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGGC ATGCGGCGGC GGCATGCCGC AGGCGGCGCG CGGCGCGGCA ACGAGGCACG 
CCGCGCGGCG CGCGACGACT GAAGTTCGCA GCACATCGAA GAGACAAGGA GACATCGATG
ATCGACAGAC GGATGCTGAT CGGCGGCGCC TGGTGCGAGG CCGAACACGG CGCGACGTTC
GAGCGGCGCG ATCCGGTGAC GGGCGCGCTC GCGTCGCGCG CGCCGGCCGC GAGCGCCGCC
GACGCCGAGC GCGCGGTGGC CGCCGCGCAC GCGGCGTTTC CCGCGTGGGC CGCGCTCGCG
CCGACCGAGC GCCGCAGGCG CCTGCTGAAG GCGGCCGACC TGATGGACGC GCGCGGTGCG
GCGTTCGTCG CGGCGGGCGT CGCGGAAACG GGCGCGACGC CCGCGTGGAT CGGCTTGAAC
GTCGCGCTCG CGGCGAACGT GCTGCGCGAG GCGGCATCGA TGGCGACGCG GATCTCGGGC
GACGTGATGC CGTCCGACGT GCCCGGCAAT CTCGCGCTCG CGGTGCGCGC GCCGTGCGGC
GTCGTGCTCG GCATCGCCCC GTGGAACGCG CCCGTGATCC TCGGCACGCG CGCGCTCGCG
ATGCCGCTCG CGTGCGGCAA TACCGTCGTG CTGAAGGCGT CCGAGCTGTG CCCCGGCGTG
CATGCGCTGA TCGGCGCGGC GCTGCACGAC GCGGGGCTCG GCGACGGCGT CGTCAACGTG
CTCACGCACG CGGCCGCCGA CGCGCCCGCG CTCGTCGAGC GCCTGATCGC CGATCCGCGC
GTGCGGCGCG TGAACTTCAC GGGTTCGACG CACGTCGGGC GGATCGTCGC GCGGCTCGCA
GCCGAGCATC TGAAGCCCGC GCTGCTCGAA CTCGGCGGCA AGGCGCCCGT CGTCGTGCTC
GACGACGCCG ATCTCGACGC GGCCGTCGAC GCGATCGCGT TCGGCGCGTT CTTCAATCAA
GGGCAGATCT GCATGTCGAC CGAGCGCGTG ATCGCCGCGC GCGCGATCGC CGACGCGCTC
GTCGACAAGC TCGCCGCGAA GGCGCGCACG CTCGCCGCGG GCGATCCGCG CGCGGGCCTG
CCGCTCGGCG CGATGGTGAG CCGCGACGCG GCCGCGCGCG CGGCCGCGCT CGTCGACGAC
GCGGCGTCGC GCGGCGCCGC GCTGCCGCTC GGCTGCCGCG TCGACGGCGC GATCATGCAG
CCGACGATCG TCGATCGCGT GACGCCCGAC ATGCGGCTCT ATCGCGAGGA ATCGTTCGCG
CCCGTCGTCG CGGTGCTGCG CGCGGGCGAC GACGAACACG CGATCGCGCT CGCGAACGAC
AGCGCGTTCG GGCTCGCGGC GAGCGTGTTC GGCCGCGATC TCGCGCGGGC GCTGGCGGTG
GCGCGGCGCA TCGAATCGGG GATCTGCCAC GTGAACGGGC CGACCGTCCA CGACGAAGCG
CAGATGCCGT TCGGCGGCGT GAAGGCGAGC GGCTACGGGC GCTTCGGCGG CGCGGCGTCG
ATCGCGGAAT TCACCGAACT GCGCTGGCTC ACCGTGCAAA CCGCGCCGCG CGCGTATCCG
ATCTGA
 
Protein sequence
MKRHAAAACR RRRAARQRGT PRGARRLKFA AHRRDKETSM IDRRMLIGGA WCEAEHGATF 
ERRDPVTGAL ASRAPAASAA DAERAVAAAH AAFPAWAALA PTERRRRLLK AADLMDARGA
AFVAAGVAET GATPAWIGLN VALAANVLRE AASMATRISG DVMPSDVPGN LALAVRAPCG
VVLGIAPWNA PVILGTRALA MPLACGNTVV LKASELCPGV HALIGAALHD AGLGDGVVNV
LTHAAADAPA LVERLIADPR VRRVNFTGST HVGRIVARLA AEHLKPALLE LGGKAPVVVL
DDADLDAAVD AIAFGAFFNQ GQICMSTERV IAARAIADAL VDKLAAKART LAAGDPRAGL
PLGAMVSRDA AARAAALVDD AASRGAALPL GCRVDGAIMQ PTIVDRVTPD MRLYREESFA
PVVAVLRAGD DEHAIALAND SAFGLAASVF GRDLARALAV ARRIESGICH VNGPTVHDEA
QMPFGGVKAS GYGRFGGAAS IAEFTELRWL TVQTAPRAYP I