Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_2213 |
Symbol | dhaS |
ID | 3690442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 2471253 |
End bp | 2472725 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637728669 |
Product | aldehyde dehydrogenase |
Protein accession | YP_333608 |
Protein GI | 76809876 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00475065 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGCAAC GATTTCAGCA GTACATCGAC GGTACGTTCG AACATGCCGC CAGCGAGTTC GACAGCATCG ATCCGGCGGG CGGTACTGTA TGGGCGCGCA TGCCGGCTGC GAGTGCGGAC GACGTGGATC GCGCGGTGCG TGCGGCACAT CGCGCGTTGA ATGAAGCGGC ATGGGCGAAC CTGACTGCGA GCGAACGCGG CAAGCTGCTG TATCGGCTTG CCGAATTGAT CGAACGCGAT GCGCTGCGCC TTGCCGAGCT CGAAACGCGG GACACCGGCA AGATTATCCG CGAAACGCGC AGTCAGATCG GCTATGTCGC CGAGTACTAC CGTTACTATG CGGGCGTGGC CGACAAGATT CAAGGCGCGT GGCTGCCTGT CGACAAGCCC GACATGGAAG TCACGCTGCG GCGTGAGCCG GTGGGCGTCG TTGCGGCCAT CGTCCCGTGG AATTCGCAGC TGTTTCTCTC CGCCGTGAAG GTTGGCCCCG CGCTCGCCGC GGGTTGCACG GTCGTGCTCA AGGCCTCCGA AGACGGCCCT GCGCCGTTGC TCGAATTCGC GCGGCTCGTG CATGAAGCGG GTTTTCCGAA GGGCGTCGTC AATATCGTGA CGGGATTCGG CAACGATTGC GGACGCACGT TGACGAGCCA CCCGCTCGTG TCGCACATCG CCTTCACGGG TGGACCCGAG ACAGCGCGCC ACGTCGTCCG CAACTCGGCC GACAACCTTG CGGCGATATC GCTGGAGCTC GGTGGCAAAT CTCCCGTGCT CGTGTTCGAC GATGCCGATC TGGAGAGCAC ATGCAATGCC GTGATCGCCG GCATCTTCGC CGCGACGGGG CAAAGCTGCG TGGCGGGTTC ACGCCTGATC GTGCAGCGGG GCATTCACGA TGCACTCGTC GAGCGGTTGA CCGCTCGGGC GCGCGCGATT CGCATCGGCG ATCCGCAAGA CATGGCAACC GAGATGGGTC CGCTCGCGAC GCGGCGTCAA CTCGAACATA TTCAGCGCGT ATTGGACGCC AGCATAGAAG CAGGCGGCCG CGTCGTCACC GGTGGATCTC AGCCGGAGGG ATTGGCTGCG GGCCACTACT TCCTGCCCAC GATCGTCGAC TGTCCGAACG CGAAAGTGCC CAGCGTTATG GAAGAGTTGT TCGGACCGGT CCTCAGCGTC GTGATGTTCG ATACGGAAGC CGAGGCCATC GCACTTGCGA ACGACACGAA GTATGGCCTG GCCTCCGGCG TATTCACGGG CGATTTGACG CGTGCTCATC GGCTCACGCG GGCGCTGCGC GCGGGGATCG TCTGGGTCAA TACCTATCGC GCCGTGTCGC CGATCGTGCC GTTCGGCGGC TATGGCTTGA GCGGCCTCGG CCGCGAAGGC GGTTTCGAGG CGGTGCTCGA ATACACGCGC ACGAAGTCCG TCTGGATTCG CACGTCGGAC GAGCCGATCG CCGATCCGTT CGTGATGCGC TGA
|
Protein sequence | MLQRFQQYID GTFEHAASEF DSIDPAGGTV WARMPAASAD DVDRAVRAAH RALNEAAWAN LTASERGKLL YRLAELIERD ALRLAELETR DTGKIIRETR SQIGYVAEYY RYYAGVADKI QGAWLPVDKP DMEVTLRREP VGVVAAIVPW NSQLFLSAVK VGPALAAGCT VVLKASEDGP APLLEFARLV HEAGFPKGVV NIVTGFGNDC GRTLTSHPLV SHIAFTGGPE TARHVVRNSA DNLAAISLEL GGKSPVLVFD DADLESTCNA VIAGIFAATG QSCVAGSRLI VQRGIHDALV ERLTARARAI RIGDPQDMAT EMGPLATRRQ LEHIQRVLDA SIEAGGRVVT GGSQPEGLAA GHYFLPTIVD CPNAKVPSVM EELFGPVLSV VMFDTEAEAI ALANDTKYGL ASGVFTGDLT RAHRLTRALR AGIVWVNTYR AVSPIVPFGG YGLSGLGREG GFEAVLEYTR TKSVWIRTSD EPIADPFVMR
|
| |