Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0907 |
Symbol | |
ID | 4886478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 885007 |
End bp | 886479 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640130847 |
Product | aldehyde dehydrogenase (NAD) family protein |
Protein accession | YP_001061906 |
Protein GI | 126444500 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03250] putative phosphonoacetaldehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.140416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTCAATC ACCGCTCCGC CGCTTTCGCG CGCGTTGCCC GCGCGCTCGA CGCCCGCGGC AAGACGTCGT CCGGCGCGAC GCTCATCGTC AGTCATCCGT ACAATCGGGA AACGGTTGCC GAGCTGCCGC TCGACAGCGG CGACGATGCG CGCCGCAAGC TGCAGCGCGC CGCGCGCTTT CGCAGCGCGC TCAGCCGGCA CGAGCGCATC GCCGTGTTCG ACAAGGCGAT CGCGCTGCTC GCGGCGGAAA AGCGCGATGC GTCGATTCTC ATCACGCTCG AATCGGGCCT GTGCCGCAAG GACACGATGT ACGAGGTCGA TCGCGTGATC AACGTGCTGC ACGCGGCGAT CGCGGAACTG AACCGGGACG ACGGCCAGAC TTTCTCGTGC GACAACGCGA CGAGCGACGA GCGACGCAAG ATCTTCACGG TTCGCGAGCC GCTGCGCGGC GTCATCGTCG CGATCACGCC GTTCAACCAT CCGATGAACC AGGTCGCGCA CAAGATCTGC CCGGCGATCG CGTCGAACAA CCGGATCGTG CTCAAGCCGT CGGAGAAGAC GCCGCTGTCG GCGCTGTACC TGCTCGACCT GTTTCGCGAG GCGGGCCTGC CCGAGCCGAT GTTCGACGTC GTGATCGGCG AGCCAAACGC GCTCGGCGCG GCGCTCGTTT GCGACGAGCA TGTCGAGCTC GTCGCGTTTA CCGGCAGCGT CGCGGTCGGC AAGCGGATCG CGCAGATGGC CGGCTATCGG CGCACGGTGC TCGAACTCGG CGGCAACGAT CCGCTGATCG TGATGGAAGA CGCCGATCTC GAGCGCGCGG CGCAGCTCGC CGCCAAGGGC TCGTACAAGA ATTCCGGGCA GCGCTGCACG GCGGTCAAGC GCATTCTCGT CGAGCGCACG GTCGCGCGCC CCTTCACCGA GCTGCTCGTC GAGCACAGCC GCCGCTGGCA AACCGGCGAT CCGATGGACG AGCGCGTCGA CATCGGCACG CTGATCGACG ATGCGGCCGC GATCGAATGC GCGCGGCGCG TCGACGAAGC GCGCGACGCC GGCGCGCGCG TGCTGCTCGG CCACCAGCGC GACGGCGCGG CCTACGCGCC CACCGTGCTC GAGCGCGTCG GCCCGGCGCT GCGCCTCGTC CAGCAGGAAA CGTTCGGCCC GGTGTCGCCC GTGATCACGT TCTGCGGCCT CGACGAAGCA GTCGCGATCG CGAACAGCAC GCGCTATGGG CTATCGTCCG GCGTGTGCAC GAACCGGCTC GACTACATCA CGCACCTGAT CGCCCATCTC GACGTCGGCA CGGTCAACGT GTGGGAAGTG CCGGGCTTCC GGCTGGAAAG CACGCCGTTC GGCGGCGTCA AGGATTCGGG GCTCGGCAGC AAGGAAGGCA TGCAGGAGGC AATCAAGAAC TTCACGAACC TGAAGACCTA TTCGCTGCCG TGGGACACCC TCGCGCATCC GCACGCGGCA TGA
|
Protein sequence | MLNHRSAAFA RVARALDARG KTSSGATLIV SHPYNRETVA ELPLDSGDDA RRKLQRAARF RSALSRHERI AVFDKAIALL AAEKRDASIL ITLESGLCRK DTMYEVDRVI NVLHAAIAEL NRDDGQTFSC DNATSDERRK IFTVREPLRG VIVAITPFNH PMNQVAHKIC PAIASNNRIV LKPSEKTPLS ALYLLDLFRE AGLPEPMFDV VIGEPNALGA ALVCDEHVEL VAFTGSVAVG KRIAQMAGYR RTVLELGGND PLIVMEDADL ERAAQLAAKG SYKNSGQRCT AVKRILVERT VARPFTELLV EHSRRWQTGD PMDERVDIGT LIDDAAAIEC ARRVDEARDA GARVLLGHQR DGAAYAPTVL ERVGPALRLV QQETFGPVSP VITFCGLDEA VAIANSTRYG LSSGVCTNRL DYITHLIAHL DVGTVNVWEV PGFRLESTPF GGVKDSGLGS KEGMQEAIKN FTNLKTYSLP WDTLAHPHAA
|
| |