Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0856 |
Symbol | |
ID | 4887987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 833983 |
End bp | 834984 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640130796 |
Product | dehydrogenase |
Protein accession | YP_001061855 |
Protein GI | 126442982 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.944789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGACGA AGATGCATGC ATGGAGCGCG CAACACGTGC CGCCGCAGGG CGGAAAAGTC GCGGTCGTCA CGGGGGCCAA CAGCGGCCTC GGCTGGCAGA TCGCGCAAAC GCTCGCCGCC AAGGGCGCGC AAGTCGTGAT GGGCTGCCGG GATACGGCCA AGGGCGAACT GGCCGCGCAT GCGATCCGCA CCCGCTATCC GCGCGCCCGA ATCGAAGTCG AGGCGCTCGA TCTCGCCGAC CTCGCCAGCG TCTGCCGTTT CGCCGACGCC GTCGCCGATC GCCACGGCCG CGTCGACATT CTCTGCAACA ACGCGGGCGT GATGTTCCTG CCGCTGCGCC ACACGCGCGA TGGCTTCGAA ATGCAGATGG GCACGAACCA CCTCGGCCAC TTCGCGTTGA CGGGGCTGTT GCTGCCCGCG TTGCGCGCAT CGCACCGCGC GCGCGTCGTG ACGATGTCGA GCGGCTTCAA CCGGCTCGGC AAGATCCGCC TCGACAACAT GCTCGCCGAG CGCGGCTACA ACAAGTACCG CGCGTATTGC GACAGCAAGC TCGCGAACCT GATGTTCACG CTCGAGCTGC AGCGCCGCTT CGATCAAGCG TGCCTGCCGA TCCTGAGCGT GGCCGCGCAC CCCGGCTATG CGGCCACCCA CCTGCAGTTC GCGGGCCCCG AAATGGCGAA CTCGTCGCTC GGCACGTTCG CGATGCGCCT GTCGAACCGG CTCGTCGCCC AATCGGCCGA TGTCGGCGCG CTGCCCGCGA TCCATGCGGC GACGGCGGTC GACGTCGACG GCGGCGCATA CATCGGCCCG GCCCATCTCT GCGAGACGCG CGGCTATCCC GCCGAGGCAC GCATCCCGCG TCAGGCGCGC GACGTGCGCA TGGGCAAGCG CCTGTGGGAA AAATCCGAGC AACTGACCGG CGTGCGCTAT CTCGACACGC CGCCGCCGCC CGGTTCGCGC CGCCGCGCAT CGCGCGACGA CGCGACGTTC GGCGCGCTCT GA
|
Protein sequence | METKMHAWSA QHVPPQGGKV AVVTGANSGL GWQIAQTLAA KGAQVVMGCR DTAKGELAAH AIRTRYPRAR IEVEALDLAD LASVCRFADA VADRHGRVDI LCNNAGVMFL PLRHTRDGFE MQMGTNHLGH FALTGLLLPA LRASHRARVV TMSSGFNRLG KIRLDNMLAE RGYNKYRAYC DSKLANLMFT LELQRRFDQA CLPILSVAAH PGYAATHLQF AGPEMANSSL GTFAMRLSNR LVAQSADVGA LPAIHAATAV DVDGGAYIGP AHLCETRGYP AEARIPRQAR DVRMGKRLWE KSEQLTGVRY LDTPPPPGSR RRASRDDATF GAL
|
| |