Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0186 |
Symbol | |
ID | 3850115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 215984 |
End bp | 217558 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637839859 |
Product | aldehyde dehydrogenase family protein |
Protein accession | YP_440744 |
Protein GI | 83720460 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTGA GCGGCGAACT GATGCTGGGC GGCGAGCGCG TCGCGCCCGG CGAGCGCGCC GCCGTGCGCG CGACCGATCC GGCGACAGGC GCGACGCTCG AGCCGCCGTT CGCGCTCGCG ACGCATGCGG ACGTCGCACG CGCGTGCGAG CTGGCGGCCG CCTCGTTCGA CGCGTACCGC GACACCGCCC CCGAGGCGCG CGCCGCGTTT CTCGAAGCGA TCGCAACCGA GATCGAAGCG CTCGGCGACG CGCTGATCGA ACGCGCGATC GCCGAAACCG CGTTGCCGCG CGCGCGCCTC GAAGGCGAGC GCGCGCGCAC CTGCGGCCAA TTGCGTTTGT TCGCGGCGGT CGTGCGCGCG GGCGATGCAT TTGGCGCGCG CATCGATCCC GCGCTGCCCG AGCGCCGGCC GCTGCCGCGC GCCGATCTGC GGATGCGGCG CATCGCGCTC GGCCCCGTCG CGGTGTTCGG CGCGAGCAAT TTCCCGCTCG CGTTCTCGGT CGCGGGCGGC GATACCGCGT CCGCGCTCGC GGCCGGCTGC CCGGTCGTCG TGAAGGCGCA TCCCGCGCAT CCGGGCACGT CCGAGCTCGT CGGCCGCGCG CTCGCCGCCG CGCTCGCGCG ATGCGGGCTG CCGGCGGGCG TGTTCTCGCT CGTCCAGGCT GACAACGACG TCGCGCTCGC GCTCGTCGCC GATGCGCGCA TCCAGGCGGT CGGCTTCACG GGCTCGCGCG CGGGCGGCCA GGCGCTGCTG CGCGTCGCGC AATCGCGTGC GCAGCCGATT CCGATGTATG GCGAACTGAG CGCGATCAAC CCGGTGTTCC TGCTGCCCGA CGCGCTCGCG CGGCGCGGCG GCGCGCTCGG CCGGCAATTC GTCGCGTCGC TCACGCTCGG CGCCGGGCAG TTCTGCACGA ATCCCGGGCT GCTGCTCGCG ATCGACGGGC CGGGCCTCGA TGCGTTCTCG AACGCCGCCG CCGACGCGCT CGTCACGAGC GTCGCGCAGC CGATGCTGAC GCCAGGCATT CACGCAGCGT ACGTGCGCGG CGTCGAGCGT CTCGCGGACT CGGACCATGT GCGGTGCGTC GCGCGCGGCG AGGCGAGCGA CCTGCCGAAT CGCGGATGCG CCGGCCTGTT CGACACGGAT GCGCGGCACT TCATCGCGCA GCCCGCGCTG CACGACGAGA TCTTCGGCGC GACGGCGCTC CTCGTGCGCT GCCGCGACGC GGCCGAGCTG CGCGCGGTTG CCGAAATGCT CGACGGCCAG TTGACGGCGA CCTTGCATCT CGACGACGGC GACGCGCCGC TCGCGCGCGC GCTGTTGCCG GTGCTTGAGC GCAAGGCGGG CCGCATCGTC GCGAACGGCT GGCCGACGGG CGTCGAAGTT TGTGACGCGA TGGTGCATGG CGGCCCGTGG CCCGCGACGA CGGATGCGCG CGCGACGTCG GTCGGCACCG CGGCGATCGA GCGCTTTCTG CGGCCCGTCT GCTATCAGGA CCTGCCCGCC GGGCTGCTGC CGCCCGCGCT GCGGGACGAC AACCCGCAGC GGCTGCGCCG CCTTGTCGAC GGAAACTGGG TCTGA
|
Protein sequence | MSVSGELMLG GERVAPGERA AVRATDPATG ATLEPPFALA THADVARACE LAAASFDAYR DTAPEARAAF LEAIATEIEA LGDALIERAI AETALPRARL EGERARTCGQ LRLFAAVVRA GDAFGARIDP ALPERRPLPR ADLRMRRIAL GPVAVFGASN FPLAFSVAGG DTASALAAGC PVVVKAHPAH PGTSELVGRA LAAALARCGL PAGVFSLVQA DNDVALALVA DARIQAVGFT GSRAGGQALL RVAQSRAQPI PMYGELSAIN PVFLLPDALA RRGGALGRQF VASLTLGAGQ FCTNPGLLLA IDGPGLDAFS NAAADALVTS VAQPMLTPGI HAAYVRGVER LADSDHVRCV ARGEASDLPN RGCAGLFDTD ARHFIAQPAL HDEIFGATAL LVRCRDAAEL RAVAEMLDGQ LTATLHLDDG DAPLARALLP VLERKAGRIV ANGWPTGVEV CDAMVHGGPW PATTDARATS VGTAAIERFL RPVCYQDLPA GLLPPALRDD NPQRLRRLVD GNWV
|
| |