Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A3191 |
Symbol | |
ID | 4887904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 3019573 |
End bp | 3021030 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640133127 |
Product | branched-chain alpha-keto acid dehydrogenase subunit E2 |
Protein accession | YP_001064182 |
Protein GI | 126445162 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.581266 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGTGC ACGTCATCAA GATGCCGGAC ATCGGCGAAG GGATCGCGGA GGTCGAGCTC GGGCTGTGGC ACGTGAAGGT CGGCGATCGC GTGAAGGAAG ACCAGGCGAT CGCCGACGTG ATGACCGACA AGGCGTCGGT GGAGATTCCG TCGCCCGTCA CGGGCGTCGT CGTCGCGCTG GGCGGCAAGG AAGGCGACGT GCTCGCGGTG GGCAGCGAGC TCGTGCGGCT CGAAGTCGAA GGCGACGGCA ATCACAAGGC CGAGCCCGAC GGCGGCGCGC GCGCCGCCGC CGCGCAGCCC GAACGCGTCG CCGACACGGC GCACGCGCAT GCGAGCGCCG CTGCGAAATC CGCGCGCGGC GAGCACGGTG CCGGCCACGG GCGCGACGAT GCGCGCGCCG CATCGAGCGG CACATCGAGC GGCGCCTCGC ACGCGCAGCA CGAGCACGCC GAACGCGAGG CGCGCGGGCA TCGCGAATCC AGCGAATGCC GCGAAGACCG AAGCGCGTCG CGCGACGCTT CGCAAACCGA CGTCGAACGC GGGCCCGCGT CGCCGCCGCC CGCGCGCCGG CCGGGCGAGC GCCCGCTCGC GTCGCCTGCG GTGCGCAAGC GCGCGTGGGA TCTCGGCGTC GAGCTGCGCT ACGTGCACGG CACGGGCGAG GCAGGCCGAA TCCTGCACGA GGATCTCGAC GCGTACCTGC AAGGCCGCGG CGCGGCCGCG CAGCGCGCGC GCGGCGGCCA GGCCGCGTAC GTCGAGCGCC ACGACGAAGA GGCGGTGCCC GTGATCGGCC TGCGCCGCAA GATCGCGCAG CGGATGCAGG ACGCGAAGCG GCGCATCCCG CACTTCAGCT ACGTCGAGGA GATCGACGTC ACCGAGCTCG AAGCGCTGCG CGCGGAGCTG AACCGCAAGT ATGGCGACAC GCGCGGCCGG CTGACGGTGT TGCCGCTGCT CGCGCGCGCG ATGGTGATCG CGCTGCGCGA GTTCCCGCAG ATCAACGCGC GCTACGACGA CGAAGCCGGC GTCGTCACGC GTCACGGCGC GGTGCATCTG GGCATCGCGA CGCAGAGCAA GGCGGGCCTG ATGGTGCCCG TCGTGCGCCA CGCGGAGGCG CGCGATCCGT GGTCGATCGC GGCCGAGGTC GCGCGGCTCG CGGATGCGGC GCGCGCGGGC CGCGCGGAGC GCGACGAGCT GTCGGGCTCG ACGATCACGA TCACGAGCCT GGGCGCGCTG GGCGGCATCG CGTCGACGCC CGTCATCAAT TCGCCCGAAG TCGGCATCGT CGGCGTGAAC CGGATCGTCG AGCGGCCGAT GTTCCGCGGC GGCGCGGTGG TCGCGCGCAA GCTGATGAAC CTGTCTTCGT CGTTCGATCA CCGCGTGATC GACGGCATGG ACGCGGCCGA GTTCATCCAG GCCGTGCGCG CGCTGCTCGA GCAGCCCGCC CTTCTTTTCG TGGAATGA
|
Protein sequence | MGVHVIKMPD IGEGIAEVEL GLWHVKVGDR VKEDQAIADV MTDKASVEIP SPVTGVVVAL GGKEGDVLAV GSELVRLEVE GDGNHKAEPD GGARAAAAQP ERVADTAHAH ASAAAKSARG EHGAGHGRDD ARAASSGTSS GASHAQHEHA EREARGHRES SECREDRSAS RDASQTDVER GPASPPPARR PGERPLASPA VRKRAWDLGV ELRYVHGTGE AGRILHEDLD AYLQGRGAAA QRARGGQAAY VERHDEEAVP VIGLRRKIAQ RMQDAKRRIP HFSYVEEIDV TELEALRAEL NRKYGDTRGR LTVLPLLARA MVIALREFPQ INARYDDEAG VVTRHGAVHL GIATQSKAGL MVPVVRHAEA RDPWSIAAEV ARLADAARAG RAERDELSGS TITITSLGAL GGIASTPVIN SPEVGIVGVN RIVERPMFRG GAVVARKLMN LSSSFDHRVI DGMDAAEFIQ AVRALLEQPA LLFVE
|
| |