Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1037 |
Symbol | |
ID | 4905772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 1004469 |
End bp | 1005323 |
Gene Length | 855 bp |
Protein Length | 284 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640144143 |
Product | short chain dehydrogenase |
Protein accession | YP_001075073 |
Protein GI | 126457822 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCATCG TGCGCATCGC CGAACAAGAA TCGCGATCAC TCGGGAGACA CCCACGCATG GCTGCAATCG ATCTGCTGAA ACCGTACGCT GGGCTGCGCG TACTCGTGAC GGGCGGCGCG TCCGGCATCG GGCTCGCGAT CGCCGACGCG TTCGCCGAAT GCGGCGGCCA GGTCCATGTC TGCGACGCAT CCGGGAAGGC GCTCGCCGCG CTCGCGCAGC GCCCCTCGCG CGCCGCGCTC GGCACGACGC TCGCCGACGT CGCCGATGCG GCCGCGGTCG AGCGCGTGTT CGACGACGTC ACGCGCACGC TCGGCGGGCT CGACGTGCTC GTGAACAACG CCGGCATCGC CGGGCCGACG GGCGGCATCG ACGAGATCGA TCCCGCGCAA TGGGAACAGA CGGTCGCGGT CAACCTGAAC GCGCAGTTCC AGTTCGCGCG CCGCGCGGTG CCGATGCTGC GCGACGCGCC GCACGGCGGC GCGATCATCG CGCTGTCGTC GGTCGCGGGG CGTCTCGGCT ATGCGTTGCG CACGCCGTAC TCGGCCACGA AATGGGCCGT CGTCGGCCTC GTGAAAAGCC TCGCGATCGA GCTCGGCCCG CTCGGCATCC GCGTGAACGC GATCCAGCCG GGCATCGTGC GCGGCCCGCG CATCCGCCGC GTGATCGAGG CGCGCGCCGC GCAACTCGGC ATCGGCTACG ACGAGATGCA GGCGCGCTAT CTCGAGAAGA TCTCGCTGCG CCGGATGACC GATCCGGACG AGATCGCCGC GACCGCGCTG TTCCTCTGCT CGCCGGGCGG GCACGGGATT TCCGGGCAGG CGATTTCCGT CTGCGGCAAC GTCGAGGCGC TCTGA
|
Protein sequence | MRIVRIAEQE SRSLGRHPRM AAIDLLKPYA GLRVLVTGGA SGIGLAIADA FAECGGQVHV CDASGKALAA LAQRPSRAAL GTTLADVADA AAVERVFDDV TRTLGGLDVL VNNAGIAGPT GGIDEIDPAQ WEQTVAVNLN AQFQFARRAV PMLRDAPHGG AIIALSSVAG RLGYALRTPY SATKWAVVGL VKSLAIELGP LGIRVNAIQP GIVRGPRIRR VIEARAAQLG IGYDEMQARY LEKISLRRMT DPDEIAATAL FLCSPGGHGI SGQAISVCGN VEAL
|
| |