Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0043 |
Symbol | |
ID | 4905843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 33094 |
End bp | 34674 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640143150 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001074086 |
Protein GI | 126456147 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0253475 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATCA CCGGCGAGAT GTTGATTGGC GCGGCCGCGG TGCGCGGTAG CGAAGGCACG ATGCGCGCTT ACGCGCCGGC GCAGGGCGTC GAGCTCGAGC CGACGTTCGG CGCGGGCGGT GCGGCCGACG TCGATCGCGC GTGCCGCCTC GCGAACGCCG CTTTCGATCC CTTTCGTCAG GCGCCGCTCG AGACGCGCGC ACGCTTTCTC GAGGCGATCG CCGAGCGCAT CGTCGGGCTC GGCGATCCAT TGATCGAACG CGCGCACGCG GAATCGGCGC TGCCCGTCGC GCGGCTCGAA GGCGAGCGCG CGCGCACGGT CGGTCAGCTC AGGCTCTTCG CGGCGATCGT GCGCGACGGC CGCTGGCTGA GCGCGACGCT CGATTCCGCG CAGCCCGAGC GCAAGCCGCT GCCGCGCGCC GATCTGCGCT TGCAGAAGAT TCCCGTCGGC CCGGTCGCGG TGTTCGGCGC GAGCAATTTC CCGCTCGCGT TCTCGGTCGC GGGCGGCGAC ACCGCTTCGG CGTTCGCGGC CGGCTGCCCC GTCGTCGCGA AGGCGCACCC CGCGCATCTC GGCACGTCGG AGCTCGTCGG GCGCGCGATC CGGCAGGCTG TCGCCGATTG CGGTTTGCAC GAGGGCGTGT TCTCGCTCGT CGTCGGCGCG GGCAACGCGA TCGGCGAGGC GCTCGTCGCG CATCCCGCGA TCAGGGCGGT CGGCTTCACC GGCTCGCGCG CGGGCGGCCT TGCGCTGATG GGCGTTGCCG CGCGGCGGCA CGAGCCGATT CCGGTCTTCG CGGAAATGAG CAGCATCAAT CCGTTCTTCG TGTTGCCCGG CGCGTTGCGC GCACGCGGTG CGCAAATCGC GCAAGGCTTC GTCGAATCGC TGACGCTCGG CGTCGGGCAG TTCTGCACGA ACCCGGGGCT CGTCGTCGCG CTCGAAGGGC CCGACCTGAA GGCGTTCGTC GACGCGGCCG CGCAGGCGCT CTCGCAAAAG GGCGCGCAGA CGATGCTGAC CTCGGGCATC GCGTCGTCTT ACGAGAGCGC GGTCGCGGCG CGCCGCGCGG CCGCGGGCGT CAGCGAGGTC GCGCGCGGCG TGCGCAGCGA CGCGCGGAAC GCCGCGTTGC CCGCGCTCTT CACGACGACG CACACGCAGT TCGTCCAGAA CCCGCAGCTC GAAGCCGAGA TCTTCGGGCC GACGTCGCTC GTCGTCGCGT GCCGCGACAT CGACGAGATG ATCGCGCTTG CCGAGCATGT CGAGGGGCAA CTGAGCGCGA CGCTGCATCT CGAAGACGAC GATGTCGATC TGGCGCGCAA ACTGCTGCCG ACGCTCGAGC GCCGCGCCGG CCGCATCGTC GCGAACGGCT ATCCGACGGG CGTCGAGGTC GCGTACGCGA TGGTGCACGG CGGGCCGTTT CCGGCGACGT CGGACCCGCG CAGCACATCG GTGGGCGCGC TTGCGATCGA GCGCTTCCTG CGGCCCGTCT GCTATCAGGA TTTGCCGGCG GCGTTGTTGC CCGAGGCGCT CGCCGACGCG AATCCGCTCG GCCTCTGGCG CCTGCGCGAC GGCCAACTCG GCAAGGCGTG A
|
Protein sequence | MQITGEMLIG AAAVRGSEGT MRAYAPAQGV ELEPTFGAGG AADVDRACRL ANAAFDPFRQ APLETRARFL EAIAERIVGL GDPLIERAHA ESALPVARLE GERARTVGQL RLFAAIVRDG RWLSATLDSA QPERKPLPRA DLRLQKIPVG PVAVFGASNF PLAFSVAGGD TASAFAAGCP VVAKAHPAHL GTSELVGRAI RQAVADCGLH EGVFSLVVGA GNAIGEALVA HPAIRAVGFT GSRAGGLALM GVAARRHEPI PVFAEMSSIN PFFVLPGALR ARGAQIAQGF VESLTLGVGQ FCTNPGLVVA LEGPDLKAFV DAAAQALSQK GAQTMLTSGI ASSYESAVAA RRAAAGVSEV ARGVRSDARN AALPALFTTT HTQFVQNPQL EAEIFGPTSL VVACRDIDEM IALAEHVEGQ LSATLHLEDD DVDLARKLLP TLERRAGRIV ANGYPTGVEV AYAMVHGGPF PATSDPRSTS VGALAIERFL RPVCYQDLPA ALLPEALADA NPLGLWRLRD GQLGKA
|
| |