Gene Pars_1187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1187 
Symbol 
ID5055412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1074602 
End bp1075828 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content64% 
IMG OID640468735 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_001153408 
Protein GI145591406 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.255358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0273122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGT TCAAGTTCCC CGACTTGGGG GAGGGGCTCG TCGAGGGCGA GATCGTCAAG 
TGGCACGTGA AGGAGGGGGA TTTCGTGAAG GAGGGCGACC CCCTTGTGGA TGTTATGACG
GAGAAGGCCA ACGTGACGTT GCCTGCCCCA GCCACAGGCA AGGTTGTGAA GATATTCGCG
AAGGAGGGGG AGATCGTGAA GGTGGGACAG GTCCTCTGCG TCATAGAGGA GGTCGCCGCC
CAAGAGGCGT CGCCCAAGGC GCCTGCCGCC GAGGCGTCGA CCTCCCAGAA GGTCGTTGCA
ATGCCGGCGG CGAGGAGGCT GGCCAGGGAG CTGGGAATAG ATCTGTCTAA GGTGAAGGGG
ACCGGGCCGG GCGGGGTGAT CACGGTCGAG GACGTGAGGC GCGCGGCCGA GGAGCTGGCG
AGGCAAGAGA AGGCGCCGCC CGCCCCGCCG CCGGCGGCCG TCCAGCCGCC CCCGGCGATT
GCCCAGCCCC AGGCTCCGGC AGCAGCCCAG TTGCCTCAAC CGGTTGCTGA GGAGGAGAGG
ATACCGGTGA GGGGGATCAG AAGGGCAGTC GCCGAGAAGA TGGCCAAGTC TGCCTCCGCC
ATACCCCACG CCTACCACTT CGAGGAGGTG GACGTCACGG AGCTCGTCTC GCTGAGGGAG
AGGCTGAGGC AGGAGGCGGA GAGGCTGGGG GTTAAGCTGA CCTACCTCCC CTTCGTGGCC
AAGGCGGTCG CGGTGGCGCT GAGGGAGTTC CCCATGTTGA ACTCCAGCTT CGACGAGGAG
AGGGGCGAGA TCGTGGTGAA GAGGAGGATA CACTTGGGCT TCGCCGTGGA CACTGAGCAG
GGGCTGATGG TCGTGGTGGT GAGGGATGCC GATAAGAAGA GCGTGTTGGA GATAGCGAGG
GAGCTCAACG CCTTGGCGGA GAGGGCGAGG GCCGGCAAGG CCTCCGTGGA CGAGGTCAGG
GGATCCACCT TCACCATCAC CAACATAGGC GCCATAGGGG GAGTGGGGGG CTTGCCCATC
ATAAACTACC CCGAGGCGGC GATAATGGCC CTGGGCAAGA TCAGGAAGAT CCCCAGGGTA
GTAAACGGCG CGGTCGTCCC CAGAGACGTC ATGAACGTGG TGGTGGGGTT CGACCACAGG
GTGGTGGACG GGGCATACGT GGCGAGGTTC ACCAACAGAG TCAAGGAGCT GCTGGAGGAC
GTGGGCAAGC TCCTCCTGTA CATATGA
 
Protein sequence
MIEFKFPDLG EGLVEGEIVK WHVKEGDFVK EGDPLVDVMT EKANVTLPAP ATGKVVKIFA 
KEGEIVKVGQ VLCVIEEVAA QEASPKAPAA EASTSQKVVA MPAARRLARE LGIDLSKVKG
TGPGGVITVE DVRRAAEELA RQEKAPPAPP PAAVQPPPAI AQPQAPAAAQ LPQPVAEEER
IPVRGIRRAV AEKMAKSASA IPHAYHFEEV DVTELVSLRE RLRQEAERLG VKLTYLPFVA
KAVAVALREF PMLNSSFDEE RGEIVVKRRI HLGFAVDTEQ GLMVVVVRDA DKKSVLEIAR
ELNALAERAR AGKASVDEVR GSTFTITNIG AIGGVGGLPI INYPEAAIMA LGKIRKIPRV
VNGAVVPRDV MNVVVGFDHR VVDGAYVARF TNRVKELLED VGKLLLYI