Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2667 |
Symbol | aceE |
ID | 4900466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 2628121 |
End bp | 2630793 |
Gene Length | 2673 bp |
Protein Length | 890 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640135894 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_001066920 |
Protein GI | 126454194 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTACG TCGCAGCCGA ACGCGACGAC GACGCGCAGG AAACCGTCGA ATGGCTCGAA GCGCTCGACG GCGTGATTTC GTCGGTCGGC CCCGGCCGCG CGCACTACCT GATCGAAAAG CAGATCGAAT TCGCGCGCAT GCACGGCGAG CACTTGCCGT TCTCCGCGAA CACCCCGTAC ATCAACACGA TTCCCGTCGA AGCCCAGGCG AAGATTCCGG GCGACCAGGA CATCGAGCAC CGGATCCGCT CGTACACGCG CTGGAACGCG ATCGCGATGG TGCTGCGCGC GGGCAAGCAC ACGAACGTCG GCGGCCACAT CGCGTCGTTC GCTTCGGCCG CGACGCTCTA TGACGTCGGC TACAACCACT TCTGGCACGC GCCGTCCGCC GAGCACGGCG GCGATCTCGT GTTCGTGCAG GGCCATTCGT CGCCCGGCAT TTACTCGCGC GCGTTCCTGC TCGGCCGCCT GACGGAAGAT CAGCTCGACA ACTTCCGCCA GGAAGTGGGC GGCAACGGCA TCTCGTCGTA TCCGCACCCG TGGCTGATGC CGGATTTCTG GCAGTTCCCG ACCGTGTCGA TGGGCCTCGG TCCCATCATG GCGATCTATC AGGCGCGCTT CATGAAGTAC CTGCAGGCGC GCGGGATCGT GAAGACGGAA GGCCGCAAGG TCTGGGCGTT CCTCGGCGAC GGCGAGACCG ACGAGCCGGA ATCGCTCGGC GCGATCGGCA TGGCGAGCCG CGAGAAGCTC GATAACCTCG TGTTCGTGAT CAACTGCAAC CTGCAGCGTC TCGACGGCCC GGTGCGCGGC AACGGCAAGA TCATCCAGGA GCTCGAATCG GAGTTCCGCG GCGCCGGCTG GAACGTGATC AAGGTGATCT GGGGCAGCCG CTGGGATGCG CTGTTCGCGC GCGACAAGAC GGGCGCGCTG ATGCGCCGGA TGATGGAAGC CGTCGACGGC GAGTATCAGA CGTACAAGTC GGAGTCGGGC GCGTACGTGC GCGAGCACTT CTTCAACACG CCGGAGCTGA AGGCGCTCGT CGCCGACTGG TCCGACGACG ACATCTGGAA CCTGAACCGC GGCGGCCACG ATCCGCACAA GATCTACGCG GCGTTCCACG AGGCGAGCAA TTCGAAGGGC GCGCCGACGG TGATCCTTGC GAAGACGATC AAGGGCTACG GGATGGGCGA AGCCGGCCAG GCGATGAACA TCACGCACCA GCAGAAGAAG TTGCCCGTCG AGCAACTGAA GAAGTTCCGC GACCAGTTCC GCCTGCCGAT CGCCGACGAC GTGATCGCCG ACGTGCCGTA CCTGAAGTTC GACGAAGGCT CGAAGGAACT CGAGTACATG CGCGCGCACC GCCAGGCGCT CGGCGGCTAT CTGCCGCAGC GCCGCCAGAA GGCGCAATCG CTGCCGGTGC CGGCGCTCGA CGCGTTCGAG CCGCTGCTCA AGGGCACGGG CGAAGGCCGC GAGATCTCGA CGACGATGGC GTTCGTGCGG ATCCTGAACA TCCTGCTGAA GGACAAGGCG CTCGGCAAGC GCGTCGTGCC GATCGTGCCG GACGAATCGC GCACGTTCGG CATGGAGGGC CTGTTCCGCC AGATCGGCAT CTGGAACCAG GAAGGCCAGA AGTATGTGCC GGAAGATTCC GATCAACTGA TGTTCTACAA GGAATCGGAA ACCGGCCAGA TCCTGCAGGA AGGCATCAAC GAAGCGGGCG GCATGTGCGA CTGGATCGCG GCGGCGACGT CGTACTCGAC GCACGGCGAG ATCATGGTGC CGTTCTATAT CTTCTATTCG ATGTTCGGCT TCCAGCGCAT CGGCGATCTC GCGTGGGCGG CGGGCGACAT GCGCTCGCGC GGTTTCCTGC TCGGCGGCAC CGCGGGCCGC ACGACGCTCA ACGGCGAAGG CCTGCAGCAC GAAGACGGCC ACTCGCTGAT GTGGGCGGCT TCGGTGCCGA ACTGCGTGAG CTACGATCCG ACGTTCGGCT ACGAGCTCGC CGTCATCGTG CAGGACGGCC TGCGCCGGAT GGTGCAGGAG CAGGAGGACG TCTATTACTA CGTGACGGTG ATGAACGAGA ACTACGAGCA CCCGGCGATT CCGCAGGGCG AGCACGTGGC GGCCGACATC ATCAAGGGCA TGTACGCGTT CAGGAAGGCC GACGCCGACA AGAAGGCGCC GCGCGTGCAA CTGCTCGGCG CGGGCACGAT CTTCAACGAA GTGATCGCCG CGGCGGACCT GCTGAAGAAC GACTGGGGCG TCGCCGCCGA TCTCTGGAGC GTGCCGAGCT TCACCGAGCT CGCGCGCGAA GGCCATGACG TCGAGCGCTG GAACCTGCTG CATCCGGCCG AGGCGCGCCG CCTGTCGCAC GTGCAGACGT GCCTGAAGGA CACGCAGGGC CCGGTGATCG CGTCGACCGA CTACGTCCGC GCGCTCGCCG ACCAGATCCG CGGCCAGATC GACCGCCGCT ACGTCGTGCT CGGCACCGAC GGCTTCGGCC GCTCGGACAC GCGCGGCGCG CTGCGCCACT TCTTCGAGGT GGACCGCTAC TGGGTCACGG TCGCGGCGCT CAACGCGCTC GCCGATGAAG GCACGATCGA GCGCAAGGTC GTCGCCGACG CCATCGCGAA GTACAACCTC GACCCGGCCA AGCCCAACCC GATGACGGTT TAA
|
Protein sequence | MKYVAAERDD DAQETVEWLE ALDGVISSVG PGRAHYLIEK QIEFARMHGE HLPFSANTPY INTIPVEAQA KIPGDQDIEH RIRSYTRWNA IAMVLRAGKH TNVGGHIASF ASAATLYDVG YNHFWHAPSA EHGGDLVFVQ GHSSPGIYSR AFLLGRLTED QLDNFRQEVG GNGISSYPHP WLMPDFWQFP TVSMGLGPIM AIYQARFMKY LQARGIVKTE GRKVWAFLGD GETDEPESLG AIGMASREKL DNLVFVINCN LQRLDGPVRG NGKIIQELES EFRGAGWNVI KVIWGSRWDA LFARDKTGAL MRRMMEAVDG EYQTYKSESG AYVREHFFNT PELKALVADW SDDDIWNLNR GGHDPHKIYA AFHEASNSKG APTVILAKTI KGYGMGEAGQ AMNITHQQKK LPVEQLKKFR DQFRLPIADD VIADVPYLKF DEGSKELEYM RAHRQALGGY LPQRRQKAQS LPVPALDAFE PLLKGTGEGR EISTTMAFVR ILNILLKDKA LGKRVVPIVP DESRTFGMEG LFRQIGIWNQ EGQKYVPEDS DQLMFYKESE TGQILQEGIN EAGGMCDWIA AATSYSTHGE IMVPFYIFYS MFGFQRIGDL AWAAGDMRSR GFLLGGTAGR TTLNGEGLQH EDGHSLMWAA SVPNCVSYDP TFGYELAVIV QDGLRRMVQE QEDVYYYVTV MNENYEHPAI PQGEHVAADI IKGMYAFRKA DADKKAPRVQ LLGAGTIFNE VIAAADLLKN DWGVAADLWS VPSFTELARE GHDVERWNLL HPAEARRLSH VQTCLKDTQG PVIASTDYVR ALADQIRGQI DRRYVVLGTD GFGRSDTRGA LRHFFEVDRY WVTVAALNAL ADEGTIERKV VADAIAKYNL DPAKPNPMTV
|
| |