Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0216 |
Symbol | |
ID | 4902601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 202592 |
End bp | 203731 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640133446 |
Product | aldo/keto reductase family oxidoreductase |
Protein accession | YP_001064499 |
Protein GI | 126454356 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTGCACG ATTCGCGACG GTTGGCTAAC ATGGCCCGCT GGGGTACGTC CGGCCGGAAG GATCCGGCCT CGTTCAACGT CTCGACGGGA ATTCAGATGG CCTACGAAGC AGCTTCAGAA CGCTATGCGG ACATGCAGTA TCGCGTGAGC GGCAAATCCG GGCTCAAATT GCCGGCGCTT TCGCTCGGCT TGTGGCACAA CTTCGGCGAC ACGACGCCGA TCTCGACGCA GCGCGAGATC CTGCGCACCG CATTCGATCT CGGCATCACG CACTTCGATC TCGCGAACAA CTACGGGCCG CCGTACGGCA GCGCCGAAAC GAACTTCGGC CGGCTGCTGC GCGAGGATTT CAAGCCGTAT CGCGACGAGC TGCTGATTTC GACGAAAGCC GGCTGGGACA TGTGGCCCGG CCCGTACGGC AGCGGCGGCG GCTCGCGCAA GTACGTGCTC GCGAGCCTCG ACCAGAGCTT GCGGCGCATG GGGCTCGACT ATGTCGACAT CTTCTATTCG CACCGCTTCG ACGCGCACAC GCCGCTCGAG GAAACCGCGA GCGCGCTCGC GAGCGCCGTG CAGCAGGGCA AGGCGCTCTA CGTCGGGGTC TCGTCGTATT CGGCGGCGAG CACGCGCGAG ATCGCGAAGC TGCTCGCCGA ATACAAGGTG CCGCTGCTGA TCCACCAGCC CGCGTACAAC ATGCTCAACC GCTGGATCGA GCGCGAGCTG CTCGACGCGC TCGACGAGAC GGGCTCGGGC TGCATCGCGT TCACGCCGCT CGCGCAGGGG CTTCTGACCT CGAAGTATCT GAACGGCGTG CCGGCGGATG CGCGGATCAA CAAGCCGGGC GGCGGATCGC TGAAGGAAGC TCACCTGAGC GCGGAGAACC TCGAGCACGT GCGCAAGCTG AACGAGATCG CGCAGCGGCG CGGCCAGAGC CTCGCGCAGA TGGCGCTTGC CTGGGTGCTG CGCGATTCGC GCGTCACGTC CGCGTTGATC GGTGCGAGCC GCGCGGAGCA GGTGCGCGAG AACGTCGCGG CGCTCGCCCA TCTCGCGTTC AGCGACGACG AGATCGCCGA GATCGACCGC TATGCGACCG AAGGCGGGAT CAATCTGTGG GAAAAGCCGT CCACCGATCA GGCGATCTGA
|
Protein sequence | MLHDSRRLAN MARWGTSGRK DPASFNVSTG IQMAYEAASE RYADMQYRVS GKSGLKLPAL SLGLWHNFGD TTPISTQREI LRTAFDLGIT HFDLANNYGP PYGSAETNFG RLLREDFKPY RDELLISTKA GWDMWPGPYG SGGGSRKYVL ASLDQSLRRM GLDYVDIFYS HRFDAHTPLE ETASALASAV QQGKALYVGV SSYSAASTRE IAKLLAEYKV PLLIHQPAYN MLNRWIEREL LDALDETGSG CIAFTPLAQG LLTSKYLNGV PADARINKPG GGSLKEAHLS AENLEHVRKL NEIAQRRGQS LAQMALAWVL RDSRVTSALI GASRAEQVRE NVAALAHLAF SDDEIAEIDR YATEGGINLW EKPSTDQAI
|
| |