Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0570 |
Symbol | mhpE |
ID | 3844932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 672769 |
End bp | 673812 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637837875 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_438770 |
Protein GI | 83717692 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.739871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACTGA TCAGCGATGC GACCTTGCGC GACGGCAACC ACGCGATTCG TCACCAGTTG AGCGCCGCGC AGATACATGC GTATGCACGC GCGGCGGACG AAGCCGGCAT CGACATCGTC GAAGTCGGTC ACGGCAACGG TCTCGGCGGC TCGTCGTGCC TGCTCGGGCA GACGCCGATC GGCGATCGCC TGATGCTCGA GACCGCGCGC GCCGCGCTGC GCACGAGCCG GCTGGGCGTG CATTTCATCC CCGGGCTCGG CAAGGCGGCG GACATTGCGC TCGCGCTCGA GATCGGCGTC GACGTCGTGC GCGTCGCGAC GCATTGCACG GAAGCGAACG TGTCCGCGCG ATTCATCGAG CAGACCCGGG TGGCCGGCCG CACGGCGTTC GGCGTGCTGA TGATGTCGCA CATGGCGCCG TCCGACGTGC TGCTCGCGCA GGCGAAGTTG ATGGAGCGGT ACGGCGCGCA GGCGGTGGTG CTGATGGACA GCGCCGGCTA TTCGACGCCG TCGCTCGTGC GCGCGAAGGT CGAGCGCCTC GTCGACGGCC TCGATATCGA CGTCGGCTTT CATGCGCACA ACAACCTCGG CCTCGCGGTT GCGAACAGTC TCGTCGCGCT CGAAGCGGGG GCGCGCATCG TCGATGCGTG CGTGAAGGGC TTCGGCGCCG GCGCGGGCAA TACGCAGCTC GAAACGCTCG TCGCCGCGAT GGAGCGCGAA GGGCATGACA CACGCACGAC GTTCGAGCAT GTGATGGCGC TCGCGCGCGG CACCGAGGCG TTTCTCAATC CGAAGACGCC GCATATCCAG CCGGCGAACA TCGCGAGCGG GCTGTACGGA CTCTTCTCCG GCTACGTGCC GCATATCCAG AAAGCCGCGC AGGAATTCGG CGTGAACGAG TTCGAGCTGT ACAAGCGGCT CGCGGAGCGC AAGCTCGTGG CCGGACAAGA GGACATCATC ATCGAAGAGG CGAGCCGTCT CGCACGCGAA CGGGACGTGC AGCGCGCGAC GGACGGCGTG CGGATCAGCG AGCTGTCCGC GTGA
|
Protein sequence | MILISDATLR DGNHAIRHQL SAAQIHAYAR AADEAGIDIV EVGHGNGLGG SSCLLGQTPI GDRLMLETAR AALRTSRLGV HFIPGLGKAA DIALALEIGV DVVRVATHCT EANVSARFIE QTRVAGRTAF GVLMMSHMAP SDVLLAQAKL MERYGAQAVV LMDSAGYSTP SLVRAKVERL VDGLDIDVGF HAHNNLGLAV ANSLVALEAG ARIVDACVKG FGAGAGNTQL ETLVAAMERE GHDTRTTFEH VMALARGTEA FLNPKTPHIQ PANIASGLYG LFSGYVPHIQ KAAQEFGVNE FELYKRLAER KLVAGQEDII IEEASRLARE RDVQRATDGV RISELSA
|
| |