Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_A1801 |
Symbol | |
ID | 4889134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009079 |
Strand | - |
Start bp | 1737952 |
End bp | 1739517 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640148066 |
Product | aldehyde dehydrogenase (NADP) family protein |
Protein accession | YP_001078984 |
Protein GI | 126445641 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGGC ATGCGGCGGC GGCATGCCGC AGGCGGCGCG CGGCGCGGCA ACGAGGCACG CCGCGCGGCG CGCGACGACT GAAGTTCGCA GCACATCGAA GAGACAAGGA GACATCGATG ATCGACAGAC GGATGCTGAT CGGCGGCGCC TGGTGCGAGG CCGAACACGG CGCGACGTTC GAGCGGCGCG ATCCGGTGAC GGGCGCGCTC GCGTCGCGCG CGCCGGCCGC GAGCGCCGCC GACGCCGAGC GCGCGGTGGC CGCCGCGCAC GCGGCGTTTC CCGCGTGGGC CGCGCTCGCG CCGACCGAGC GCCGCAGGCG CCTGCTGAAG GCGGCCGACC TGATGGACGC GCGCGGTGCG GCGTTCGTCG CGGCGGGCGT CGCGGAAACG GGCGCGACGC CCGCGTGGAT CGGCTTGAAC GTCGCGCTCG CGGCGAACGT GCTGCGCGAG GCGGCATCGA TGGCGACGCG GATCTCGGGC GACGTGATGC CGTCCGACGT GCCCGGCAAT CTCGCGCTCG CGGTGCGCGC GCCGTGCGGC GTCGTGCTCG GCATCGCCCC GTGGAACGCG CCCGTGATCC TCGGCACGCG CGCGCTCGCG ATGCCGCTCG CGTGCGGCAA TACCGTCGTG CTGAAGGCGT CCGAGCTGTG CCCCGGCGTG CATGCGCTGA TCGGCGCGGC GCTGCACGAC GCGGGGCTCG GCGACGGCGT CGTCAACGTG CTCACGCACG CGGCCGCCGA CGCGCCCGCG CTCGTCGAGC GCCTGATCGC CGATCCGCGC GTGCGGCGCG TGAACTTCAC GGGTTCGACG CACGTCGGGC GGATCGTCGC GCGGCTCGCA GCCGAGCATC TGAAGCCCGC GCTGCTCGAA CTCGGCGGCA AGGCGCCCGT CGTCGTGCTC GACGACGCCG ATCTCGACGC GGCCGTCGAC GCGATCGCGT TCGGCGCGTT CTTCAATCAA GGGCAGATCT GCATGTCGAC CGAGCGCGTG ATCGCCGCGC GCGCGATCGC CGACGCGCTC GTCGACAAGC TCGCCGCGAA GGCGCGCACG CTCGCCGCGG GCGATCCGCG CGCGGGCCTG CCGCTCGGCG CGATGGTGAG CCGCGACGCG GCCGCGCGCG CGGCCGCGCT CGTCGACGAC GCGGCGTCGC GCGGCGCCGC GCTGCCGCTC GGCTGCCGCG TCGACGGCGC GATCATGCAG CCGACGATCG TCGATCGCGT GACGCCCGAC ATGCGGCTCT ATCGCGAGGA ATCGTTCGCG CCCGTCGTCG CGGTGCTGCG CGCGGGCGAC GACGAACACG CGATCGCGCT CGCGAACGAC AGCGCGTTCG GGCTCGCGGC GAGCGTGTTC GGCCGCGATC TCGCGCGGGC GCTGGCGGTG GCGCGGCGCA TCGAATCGGG GATCTGCCAC GTGAACGGGC CGACCGTCCA CGACGAAGCG CAGATGCCGT TCGGCGGCGT GAAGGCGAGC GGCTACGGGC GCTTCGGCGG CGCGGCGTCG ATCGCGGAAT TCACCGAACT GCGCTGGCTC ACCGTGCAAA CCGCGCCGCG CGCGTATCCG ATCTGA
|
Protein sequence | MKRHAAAACR RRRAARQRGT PRGARRLKFA AHRRDKETSM IDRRMLIGGA WCEAEHGATF ERRDPVTGAL ASRAPAASAA DAERAVAAAH AAFPAWAALA PTERRRRLLK AADLMDARGA AFVAAGVAET GATPAWIGLN VALAANVLRE AASMATRISG DVMPSDVPGN LALAVRAPCG VVLGIAPWNA PVILGTRALA MPLACGNTVV LKASELCPGV HALIGAALHD AGLGDGVVNV LTHAAADAPA LVERLIADPR VRRVNFTGST HVGRIVARLA AEHLKPALLE LGGKAPVVVL DDADLDAAVD AIAFGAFFNQ GQICMSTERV IAARAIADAL VDKLAAKART LAAGDPRAGL PLGAMVSRDA AARAAALVDD AASRGAALPL GCRVDGAIMQ PTIVDRVTPD MRLYREESFA PVVAVLRAGD DEHAIALAND SAFGLAASVF GRDLARALAV ARRIESGICH VNGPTVHDEA QMPFGGVKAS GYGRFGGAAS IAEFTELRWL TVQTAPRAYP I
|
| |