Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2870 |
Symbol | |
ID | 4901811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 2824644 |
End bp | 2825702 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640136096 |
Product | D-isomer specific 2-hydroxyacid dehydrogenase family protein |
Protein accession | YP_001067117 |
Protein GI | 126454705 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1052] Lactate dehydrogenase and related dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.156686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCCG CACGGGCGAT GCGCGGCGCG ATGCGCGCGC GCCGCCCGCG CGGCGCAACG GAGACGGCAA TGCAGAAAAT CCTGGTCGCG CGCCCGATCT TTCCGGACGT GATCGAGCGG CTCAAGCAGT ATTTCGACGT CGACTGGAAC GACGGCGACG CGCTCGCCCC CGATGCGCTG AAGGCGCGCC TCGCGGACAA GGACGGCGCG CTGACGGCGG GCGACATGAT CGACGCGTCG GTGCTCGCGG CCGCGCCGCG GCTGCGCGTC GTGTCGAACA TGGCGGTCGG CTACAACAAC TTCGACATCG GCGCGTTCGA CGCCGCGCAC GTGCTCGGCA CCAACACGCC CGACGTGCTG ACCGAGACGA CGGCCGATTT CGGCTGGGCG CTGATGATGG CGGCCGCGCG GCGGATCACC GAATCCGAGC ACTGGCTGCG CGCGGGGCAA TGGCGCAAGT GGTCGTACGA CAGCTTTCTC GGCGCGGACA TTCACGGCGC GACGCTCGGC GTGCTCGGCA TGGGCCGCAT CGGCCAGGCG CTCGCGCGCC GCGCGCGCGG CTTCGGCATG CGCGTGATCT ATCACAACCG CTCGCGCGTC GCGCCCGAGA TCGAGGCCGA GCTCAACGCC GAATACGTGC CGAAGGCGGC GCTGCTCGTG CAAGCCGATC ACGTCGTGCT CGTGCTGCCG TACTCGGCGC AAAGTCATCA CACGATCGGC GCGGCCGAGC TCGCGCTGAT GAAGCCGAGC GCGACGCTCA CGAACATCGC GCGCGGCGGG ATCGTCGACG ACGCGGCGCT CGCCGACGCG CTGCGCGAGA AGCGGATCGC GGCGGCGGGC CTCGACGTGT TCGAAGGCGA GCCGAGCGTG CATCCGGCGC TGCTCGACGT GCCGAACGTC GTGCTGACGC CGCACATCGC GAGCGCGAGC GAAGGCACGC GCCGCGCGAT GGCGAATCTC GCGGCGGACA ACCTGATCGC GGCGCTCGGC GCGGGCCCGC GCGCGGGCCG CCCGCCGAAT CCGATCAATC CCGGCGTGCT GGGGAAGGCT CGCATATGA
|
Protein sequence | MTAARAMRGA MRARRPRGAT ETAMQKILVA RPIFPDVIER LKQYFDVDWN DGDALAPDAL KARLADKDGA LTAGDMIDAS VLAAAPRLRV VSNMAVGYNN FDIGAFDAAH VLGTNTPDVL TETTADFGWA LMMAAARRIT ESEHWLRAGQ WRKWSYDSFL GADIHGATLG VLGMGRIGQA LARRARGFGM RVIYHNRSRV APEIEAELNA EYVPKAALLV QADHVVLVLP YSAQSHHTIG AAELALMKPS ATLTNIARGG IVDDAALADA LREKRIAAAG LDVFEGEPSV HPALLDVPNV VLTPHIASAS EGTRRAMANL AADNLIAALG AGPRAGRPPN PINPGVLGKA RI
|
| |