Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1632 |
Symbol | |
ID | 3844951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1912115 |
End bp | 1913866 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637838933 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_439826 |
Protein GI | 83718062 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.66884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGCAT CGAAACCCAA GCTGCGCTCG GCCCAATGGT TCGGCACCCA TGACAAGAAC GGCTTCATGT ACCGGAGCTG GATGAAGAAT CAGGGCATTC CCGATCACGA ATTCGACGGC CGGCCGATCG TCGGCATCTG CAACACCTGG TCGGAGCTCA CGCCGTGCAA CGCGCACTTT CGCAAGCTCG CCGAGCACGT GAAGCGCGGC GTCTACGAGG CGGGCGGCTT TCCGGTCGAG TTCCCGGTGT TCTCGAACGG CGAATCGAAC CTGCGGCCGT CCGCGATGCT CACGCGCAAT CTCGCGTCGA TGGACGTCGA GGAGGCGATT CGCGGCAACC CGATCGACGC GGTCGTGCTG CTCGCCGGCT GCGACAAGAC GACCCCCGCG CTGCTGATGG GCGCGGCGAG CTGCGACGTG CCGGCGATCG TCGTGTCCGG CGGCCCGATG TTGAACGGCA AGCTCGACGG CAAGAACATC GGCTCCGGCA CCGCCGTCTG GCAACTGCAC GAAGCGCTGA AGGCGGGCGA GATCGACCTG CACCGCTTCC TGTCGGCGGA GGCCGGCATG TCGCGCTCGG CGGGCACCTG CAACACGATG GGCACGGCGT CGACGATGGC GTGCCTGGCC GAAGCGCTCG GCGTCGCGCT GCCGCACAAC GCGGCGATTC CGGCCGTCGA CGCGCGCCGC TACGTGCTCG CGCACATGTC GGGCATGCGC ATCGTCGGGA TGGCGCACGA AGGGCTCGTG CTGTCGAAGA TCCTCACGCG CGCGGCGTTC GAGAACGCGA TCCGCGTGAA CGCGGCGATC GGCGGCTCGA CGAACGCGGT GATCCATCTG AAGGCGATCG CCGGGCGGCT CGGCGTGCCG CTCGAGCTCG AGGACTGGCT GCGCCTCGGC CGCGGCACGC CGACGATCGT CGATCTGATG CCGTCCGGCC GGTTCCTGAT GGAGGAGTTC TATTACGCGG GCGGGCTGCC CGCCGTGCTG CGCCGGCTCG GCGAGGCGAA CCTGCTGCCG CATCCGGGCG CGCTGACCGT CAACGGCCAA TCGCTGTGGG ACAACGTGCG CGACGCGCCG AGCCACGACG ACGAGGTGAT CCGTCCGCTC GATCGGCCGC TGATCGCCGA CGGCGGCATC CGGATCTTGC GCGGCAATCT CGCGCCGCGC GGCGCGGTGC TGAAACCGTC CGCGGCGAGC CCGGAATTGC TGAAGCACCG CGGCCGCGCG GTCGTGTTCG AGAACTTCGA GCACTACAAG GCGACGATCG ACGACGAGGC GCTCGACGTC GACGCGAACT CGGTGCTCGT GCTGAAGAAC TGCGGCCCGC GCGGCTATCC GGGCATGGCC GAGGTCGGCA ACATGGGGCT GCCGCCGAAA CTGTTGCGGC AGGGCGTGAA GGACATGGTG CGGATCTCGG ATGCGCGGAT GAGCGGCACC GCGTACGGCA CGGTCGTGCT GCACGTCGCG CCGGAAGCGG CGGCGGGCGG CCCGCTCGCG GCGGTGCGCA ACGGCGACTG GATCGAGCTC GATGGCGAGG CGGGCACGCT CACGCTCGAC GTGAGCGACG ACGAGCTCGC GCGCCGGCTG TCGGATCACG ATCCGGCGAG CGCGCCGGGC GTCGCCGAGC ATGCGGCGGG CGGCGGCTAC GCGCGTCTTT ACGTCGACCA CGTGCTGCAG GCGGACGAGG GCTGCGACCT CGACTTCCTG GTCGGCCGGC GCGGCGCCGC GGTGCCGCGG CATTCGCACT GA
|
Protein sequence | MSASKPKLRS AQWFGTHDKN GFMYRSWMKN QGIPDHEFDG RPIVGICNTW SELTPCNAHF RKLAEHVKRG VYEAGGFPVE FPVFSNGESN LRPSAMLTRN LASMDVEEAI RGNPIDAVVL LAGCDKTTPA LLMGAASCDV PAIVVSGGPM LNGKLDGKNI GSGTAVWQLH EALKAGEIDL HRFLSAEAGM SRSAGTCNTM GTASTMACLA EALGVALPHN AAIPAVDARR YVLAHMSGMR IVGMAHEGLV LSKILTRAAF ENAIRVNAAI GGSTNAVIHL KAIAGRLGVP LELEDWLRLG RGTPTIVDLM PSGRFLMEEF YYAGGLPAVL RRLGEANLLP HPGALTVNGQ SLWDNVRDAP SHDDEVIRPL DRPLIADGGI RILRGNLAPR GAVLKPSAAS PELLKHRGRA VVFENFEHYK ATIDDEALDV DANSVLVLKN CGPRGYPGMA EVGNMGLPPK LLRQGVKDMV RISDARMSGT AYGTVVLHVA PEAAAGGPLA AVRNGDWIEL DGEAGTLTLD VSDDELARRL SDHDPASAPG VAEHAAGGGY ARLYVDHVLQ ADEGCDLDFL VGRRGAAVPR HSH
|
| |