Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1242 |
Symbol | |
ID | 4903718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1176330 |
End bp | 1177820 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640144348 |
Product | aldehyde dehydrogenase (NAD) family protein |
Protein accession | YP_001075277 |
Protein GI | 126456955 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACTGCAT TCGACTCATC GCTCGTGCCG TCGGGCGACA TCCTGATCGG CGGAGAGTGG CGCCGCGGCC GCGGCGCGAC GACGCCGAGC TTCTATCCGG CCGACGGCTC GCTCAACACC GAGATCCACA TGGCCGACGC GGCCGACGCG CGCGAAGCGG TGCAGGCGGC CGACGCCGCG TGGCGCCGCG CGGACTGGGC GGGCCTGAAG CCGCATCAGC GCGCGGACGT GCTGTACCGC ATCGCCGATC TGATTCACGC GCATCGCGAG GCGCTCGCGC AACTGCAGCG GCGCGACAAC GGCAAGCCGA TCAACGAGAC GCGCGCGCTC GTCGCGAGCG CGGCGAGCAC GTTCCGCTAT TTCGCCGCGT GCGCGCAGAC GCTCGACGAA GCGCTGACGC CGTCGCGCGG CGATTATCTG TCGATGAGCG TGCACGAGCC GCTCGGCGTC GTCGCGGCGA TCACGCCGTG GAATTCGCCG ATCGCCTCCG ATGCGCAGAA GCTCGGCCCG GCGCTCGCGG CGGGCAACGC CGTCGTGCTG AAGCCGGCCG AGGTGACGCC GCTCGCGTCG CTCGCGCTCG CGCGGCTCTG CGAGCAGGCG GGCGTGCCGC GCGGCGTGAT CTCGGTGCTG CCGGGCAAGG GTTCGGTGAT CGGCGATGCG CTCGTGCGCG ATCCGCTCGT GAAGAAGGTG TCGTTTACGG GCGGCACCGA GGTGGGCCGG GGCATCGCGC GGCTCGCGGC CGAGAAGCTG ATGCCGCTGT CGCTCGAACT GGGCGGCAAG TCGCCGACGA TCGTGTTCGA CGACGCCGAG CTCGATCACG CGGTCAACGG CGTGTTGTAC GGCATCTTCA GCTCGTCGGG CGAATCGTGC ATCGCGGGCT CGCGCCTGTT CGTCCAGCGC TCGATCTACG GCGCGTTCGT CGCGCGCCTC GTCGAAGCGG CGCGCAAGCT GCGCGTCGGC GATCCGGCGA GCGAGCGCAC GCAGATGGGC CCGCTCATCA CCGCGCGGCA TCGCGACACG GTCGAGCGCT ACGTCGCGCT CGGCCGCGAC GAGGGCGCCC GCGTGCTGTG CGGCGGCGAG CGGCCGACAG GCGAGGGCCG CGACGCGGGC TTCTTCTATC TGCCGACGAT TCTCGACGGC CTGTCGAACC ACGCACGCAT TTGCCGGGAG GAAATCTTCG GGCCGGTGCT CGTCGCGCTG CCGTTCGACG ACGAAGCGGC GCTCGTCGCC GACGCGAACG ACAGCGTGTT CGGGCTTGCC GCCGGCATCT GGACGCGCGA CTACAAGCGC GCGTGGCGCG TCGCGCGCGC GCTCGACGCG GGCACCGTGT GGATCAACAC GTACAAGCAG TTCTCGATCT CGACGCCGTT CTCGGGCCGG AAGGAAAGCG GGATGGGCCG CGAGAAGGGC AGCCTCGGGA TTCGCGAGTA CATGCAGCAG AAGAGCCTCT ACTGGGGCTT GAACGATTCG CCGCTGCCGT GGGCGAACTG A
|
Protein sequence | MTAFDSSLVP SGDILIGGEW RRGRGATTPS FYPADGSLNT EIHMADAADA REAVQAADAA WRRADWAGLK PHQRADVLYR IADLIHAHRE ALAQLQRRDN GKPINETRAL VASAASTFRY FAACAQTLDE ALTPSRGDYL SMSVHEPLGV VAAITPWNSP IASDAQKLGP ALAAGNAVVL KPAEVTPLAS LALARLCEQA GVPRGVISVL PGKGSVIGDA LVRDPLVKKV SFTGGTEVGR GIARLAAEKL MPLSLELGGK SPTIVFDDAE LDHAVNGVLY GIFSSSGESC IAGSRLFVQR SIYGAFVARL VEAARKLRVG DPASERTQMG PLITARHRDT VERYVALGRD EGARVLCGGE RPTGEGRDAG FFYLPTILDG LSNHARICRE EIFGPVLVAL PFDDEAALVA DANDSVFGLA AGIWTRDYKR AWRVARALDA GTVWINTYKQ FSISTPFSGR KESGMGREKG SLGIREYMQQ KSLYWGLNDS PLPWAN
|
| |