Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0057 |
Symbol | |
ID | 4886069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 46891 |
End bp | 48471 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640129998 |
Product | NAD-dependent aldehyde dehydrogenases |
Protein accession | YP_001061063 |
Protein GI | 126443729 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.641488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATCA CCGGCGAGAT GTTGATTGGC GCGGCCGCGG TGCGCGGTAG CGAAGGCACG ATGCGCGCTT ACGCGCCGGC GCAGGGCGTC GAGCTCGAGC CGACGTTCGG CGCGGGCGGT GCGGCCGACG TCGATCGCGC GTGCCGCCTC GCGAACGCCG CTTTCGATCC CTTTCGTCAG GCGCCGCTCG AGACGCGCGC ACGCTTTCTC GAGGCGATCG CCGAGCGCAT CGTCGGGCTC GGCGATCCAT TGATCGAACG CGCGCACGCG GAATCGGCGC TGCCCGTCGC GCGGCTCGAA GGCGAGCGCG CGCGCACGGT CGGTCAGCTC AGGCTCTTCG CGGCGATCGT GCGCGACGGC CGCTGGCTGA GCGCGACGCT CGATTCCGCG CAGCCCGAGC GCAAGCCGCT GCCGCGCGCC GATCTGCGCT TGCAGAAGAT TCCCGTCGGC CCGGTCGCGG TGTTCGGCGC GAGCAATTTC CCGCTCGCGT TCTCGGTCGC GGGCGGCGAC ACCGCTTCGG CGTTCGCGGC CGGCTGCCCC GTCGTCGCGA AGGCGCACCC CGCGCATCTC GGCACGTCGG AGCTCGTCGG GCGCGCGATC CGGCAGGCTG TCGCCGATTG CGGCTTGCAC GAGGGCGTGT TCTCGCTCGT CGTCGGCGCG GGCAACGCGA TCGGCGAGGC GCTCGTCGCA CATCCCGCGA TCAGGGCGGT CGGCTTCACC GGCTCGCGCG CGGGCGGCCT TGCGCTGATG GGCGTTGCCG CGCGGCGGCG CGAGCCGATT CCGGTCTTCG CGGAAATGAG CAGCATCAAT CCGTTCTTCG TGTTGCCCGG CGCGTTGCGC GCACGCGGTG CGCAAATCGC GCAAGGCTTC GTCGAATCGC TGACGCTCGG CGTCGGGCAG TTCTGCACGA ACCCGGGGCT CGTCGTCGCG CTCGAAGGGC CCGACCTGAA GGCGTTCGTC GACGCGGCCG CGCAGGCGCT CTCGCAAAAG GGCGCGCAGA CGATGCTGAC CTCGGGCATC GCGTCGTCTT ACGAGAGCGC GGTCGCGGCG CGCCGCGCGG CCGCAGGCGT CAGCGAGGTC GCGCGCGGCG TGCGCAGCGA CGCGCGGAAC GCCGCGTTGC CCGCGCTCTT CACGACGACG CACACGCAGT TCGTCCAGAA CCCGCAGCTC GAAGCCGAGA TCTTCGGGCC GACGTCGCTC GTCGTCGCGT GCCGCGACAT CGACGAGATG ATCGCGCTTG CCGAGCATGT CGAGGGGCAA CTGAGCGCGA CGCTGCATCT CGAAGACGAC GATGTCGATC TGGCGCGCAA ACTGCTGCCG ACGCTCGAGC GCCGCGCCGG CCGCATCGTC GCGAACGGCT ATCCGACGGG CGTCGAGGTC GCGTACGCGA TGGTGCACGG CGGGCCGTTT CCGGCGACGT CGGACCCGCG CAGCACATCG GTGGGCGCGC TTGCGATCGA GCGCTTCCTG CGGCCCGTCT GCTATCAGGA TTTGCCGGCG GCGTTGTTGC CCGAGGTGCT CGCCGACGCG AATCCGCTCG GCCTCTGGCG CCTGCGCGAC GGCCAACTCG GCAAGGCGTG A
|
Protein sequence | MQITGEMLIG AAAVRGSEGT MRAYAPAQGV ELEPTFGAGG AADVDRACRL ANAAFDPFRQ APLETRARFL EAIAERIVGL GDPLIERAHA ESALPVARLE GERARTVGQL RLFAAIVRDG RWLSATLDSA QPERKPLPRA DLRLQKIPVG PVAVFGASNF PLAFSVAGGD TASAFAAGCP VVAKAHPAHL GTSELVGRAI RQAVADCGLH EGVFSLVVGA GNAIGEALVA HPAIRAVGFT GSRAGGLALM GVAARRREPI PVFAEMSSIN PFFVLPGALR ARGAQIAQGF VESLTLGVGQ FCTNPGLVVA LEGPDLKAFV DAAAQALSQK GAQTMLTSGI ASSYESAVAA RRAAAGVSEV ARGVRSDARN AALPALFTTT HTQFVQNPQL EAEIFGPTSL VVACRDIDEM IALAEHVEGQ LSATLHLEDD DVDLARKLLP TLERRAGRIV ANGYPTGVEV AYAMVHGGPF PATSDPRSTS VGALAIERFL RPVCYQDLPA ALLPEVLADA NPLGLWRLRD GQLGKA
|
| |