Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1187 |
Symbol | |
ID | 5055412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1074602 |
End bp | 1075828 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640468735 |
Product | branched-chain alpha-keto acid dehydrogenase subunit E2 |
Protein accession | YP_001153408 |
Protein GI | 145591406 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.255358 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0273122 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAGT TCAAGTTCCC CGACTTGGGG GAGGGGCTCG TCGAGGGCGA GATCGTCAAG TGGCACGTGA AGGAGGGGGA TTTCGTGAAG GAGGGCGACC CCCTTGTGGA TGTTATGACG GAGAAGGCCA ACGTGACGTT GCCTGCCCCA GCCACAGGCA AGGTTGTGAA GATATTCGCG AAGGAGGGGG AGATCGTGAA GGTGGGACAG GTCCTCTGCG TCATAGAGGA GGTCGCCGCC CAAGAGGCGT CGCCCAAGGC GCCTGCCGCC GAGGCGTCGA CCTCCCAGAA GGTCGTTGCA ATGCCGGCGG CGAGGAGGCT GGCCAGGGAG CTGGGAATAG ATCTGTCTAA GGTGAAGGGG ACCGGGCCGG GCGGGGTGAT CACGGTCGAG GACGTGAGGC GCGCGGCCGA GGAGCTGGCG AGGCAAGAGA AGGCGCCGCC CGCCCCGCCG CCGGCGGCCG TCCAGCCGCC CCCGGCGATT GCCCAGCCCC AGGCTCCGGC AGCAGCCCAG TTGCCTCAAC CGGTTGCTGA GGAGGAGAGG ATACCGGTGA GGGGGATCAG AAGGGCAGTC GCCGAGAAGA TGGCCAAGTC TGCCTCCGCC ATACCCCACG CCTACCACTT CGAGGAGGTG GACGTCACGG AGCTCGTCTC GCTGAGGGAG AGGCTGAGGC AGGAGGCGGA GAGGCTGGGG GTTAAGCTGA CCTACCTCCC CTTCGTGGCC AAGGCGGTCG CGGTGGCGCT GAGGGAGTTC CCCATGTTGA ACTCCAGCTT CGACGAGGAG AGGGGCGAGA TCGTGGTGAA GAGGAGGATA CACTTGGGCT TCGCCGTGGA CACTGAGCAG GGGCTGATGG TCGTGGTGGT GAGGGATGCC GATAAGAAGA GCGTGTTGGA GATAGCGAGG GAGCTCAACG CCTTGGCGGA GAGGGCGAGG GCCGGCAAGG CCTCCGTGGA CGAGGTCAGG GGATCCACCT TCACCATCAC CAACATAGGC GCCATAGGGG GAGTGGGGGG CTTGCCCATC ATAAACTACC CCGAGGCGGC GATAATGGCC CTGGGCAAGA TCAGGAAGAT CCCCAGGGTA GTAAACGGCG CGGTCGTCCC CAGAGACGTC ATGAACGTGG TGGTGGGGTT CGACCACAGG GTGGTGGACG GGGCATACGT GGCGAGGTTC ACCAACAGAG TCAAGGAGCT GCTGGAGGAC GTGGGCAAGC TCCTCCTGTA CATATGA
|
Protein sequence | MIEFKFPDLG EGLVEGEIVK WHVKEGDFVK EGDPLVDVMT EKANVTLPAP ATGKVVKIFA KEGEIVKVGQ VLCVIEEVAA QEASPKAPAA EASTSQKVVA MPAARRLARE LGIDLSKVKG TGPGGVITVE DVRRAAEELA RQEKAPPAPP PAAVQPPPAI AQPQAPAAAQ LPQPVAEEER IPVRGIRRAV AEKMAKSASA IPHAYHFEEV DVTELVSLRE RLRQEAERLG VKLTYLPFVA KAVAVALREF PMLNSSFDEE RGEIVVKRRI HLGFAVDTEQ GLMVVVVRDA DKKSVLEIAR ELNALAERAR AGKASVDEVR GSTFTITNIG AIGGVGGLPI INYPEAAIMA LGKIRKIPRV VNGAVVPRDV MNVVVGFDHR VVDGAYVARF TNRVKELLED VGKLLLYI
|
| |