Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_0918 |
Symbol | dgoA |
ID | 3689640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 954733 |
End bp | 955881 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637727374 |
Product | galactonate dehydratase |
Protein accession | YP_332331 |
Protein GI | 76809443 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.462959 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCA CCCGCCTCGA AACCTTCGTC GTGCCGCCTC GATGGCTGTT CCTGAAGATC GAGACCGACG CGGGCATCGT CGGCTGGGGC GAGCCGATCG TCGAAGGCCG CGCGCATACG GTCGAGGCGG CCGTGCACGA GCTCGCCGAC TACCTCGTCG GCCAGGATCC GCTTCGCATC GAGGACCACT GGCAGGTGAT GTACCGCGCG GGCTTCTACC GCGGCGGGCC GATCACGATG AGCGCGATCG CGGGCATCGA CCAGGCGCTC TGGGACATCA AGGGCAAGCA TCACGGCGCG CCCGTGCATG CGCTCCTCGG CGGCCCGGTG CGCGAGCGGA TCAAGGTGTA TTCGTGGATC GGCGGCGATC GGCCGAGCGA CGTCGCGAAC AACGCGCGCG CGGTCGTCGA ACGCGGCTTC CAGGCGGTGA AGATGAACGG CTCGGAAGAG CTGCAGATCG TCGACACCTT CGACAAGGTC GACAAGGTGA TCGCGAACGT CGCGGCGGTG CGCGACGCGG TGGGCCCGTA CGTCGGCATC GGCGTCGATT TCCACGGCCG CGTGCACAAG CCGATGGCGA AGGTACTCGC CAGGGAGCTC GATCCGTACA AGCTGATGTT CATCGAGGAG CCCGTGCTGT CGGAGAACGC CGAGGCGCTG CGCGACATCG CGAACCAGAC GAGCACGCCG ATCGCGCTCG GCGAGCGGCT CTACTCGCGC TGGGATTTCA AGCGCATTCT CGAAGGCGGC TACGTCGACA TCGTGCAGCC CGACGCGTCG CACGCGGGCG GGATCACCGA GTGCCGGAAG ATCGCGACGC TCGCGGAAAG CTACGACGTC GCGCTCGCGC TGCACTGCCC GCTCGGGCCG ATCGCGCTCG CCGCGTGCCT GCAGCTCGAC GCGGTCAGCT ACAACGCGTT CATTCAGGAG CAGAGCCTCG GCATTCACTA CAACCAGGGC AGCGATCTGC TCGACTATCT GCGCAACCCG GACGTGTTCC GCTACGCGGA CGGCTTCGTC GCGATTCCGC AGGGGCCCGG GCTCGGCATC GACGTCGACG AGGACAAGGT GTGCGAGATG GCGAAAACCG GGCACCGCTG GCGTAATCCG GTATGGCGGC ACGCGGACGG CAGCGTCGCC GAGTGGTGA
|
Protein sequence | MKITRLETFV VPPRWLFLKI ETDAGIVGWG EPIVEGRAHT VEAAVHELAD YLVGQDPLRI EDHWQVMYRA GFYRGGPITM SAIAGIDQAL WDIKGKHHGA PVHALLGGPV RERIKVYSWI GGDRPSDVAN NARAVVERGF QAVKMNGSEE LQIVDTFDKV DKVIANVAAV RDAVGPYVGI GVDFHGRVHK PMAKVLAREL DPYKLMFIEE PVLSENAEAL RDIANQTSTP IALGERLYSR WDFKRILEGG YVDIVQPDAS HAGGITECRK IATLAESYDV ALALHCPLGP IALAACLQLD AVSYNAFIQE QSLGIHYNQG SDLLDYLRNP DVFRYADGFV AIPQGPGLGI DVDEDKVCEM AKTGHRWRNP VWRHADGSVA EW
|
| |