Gene Pars_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1189 
Symbol 
ID5055825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1076790 
End bp1077902 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content59% 
IMG OID640468737 
Productpyruvate dehydrogenase (acetyl-transferring) 
Protein accessionYP_001153410 
Protein GI145591408 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.605402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0471235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGAAA TAGATTTTAA TCAAAATTAT AAGATATCAG TAAAGGAGCC CCAAGTCCTA 
AGGGTCATAG AGCCTGACGG AACGTTGAGA GAGGAGGCAG AGCTCGGGTA CAAGCCGTCC
GAGGGGGAGC TGGTTAAATT ATACCGCTAC ATGGTAACCG CCCGGGTTCT CGACAGGCAC
GCCTTGCTTC TGCACAGGAT GGGCAAGGTT AAGTCCACTT ATGGTCCTCA CGAGGGTCAT
GAAGCCGCCG ATGCCGGCAC TGTCCACGTA TTGAAGCCGG AGGACTGGAT TGCCCCGTAT
TACCGGATGC TCACGGCTCT CTTGATCCGC GGCGTGCCGT TGCAGACCAT CTGGGCTAAG
TTCTTTGCGA AGCAAGGGGA TCCTGACAAG GGGAGGAACT TGACGGTTGA GTGGGGCGGC
TTCGCCAAGT GGCGCATTTT GTCCGTCGGC GCCCCGATCG GCCACCAGTA CATCTACGCG
GCCGGCTTCG CCTACGCTCT TAGGTACATG AAGAGGGATG AGATAGTGGC GGCCTATATA
GGCGATGGCG GCACCTCCAC TAACGGCTTC CACACGGGCC TCAACTTTGC CGGCGTCTTC
AAACTACCCG TCGTGTTCTA CGTCTACAAC AACCAATACG CCATATCGGT GCCCGTGCGC
AGCCAGACTG CCGTGACGAG GCTGGCCATC AAGGCCGCCG CATACGGCAT AGAGGGGATC
GCTACCGACG GCATGGATCT CCTCGCGGTG CTCAAGGCGG CTCACTACGC GGTATCCAAG
GCGAGGAGGG GCGAGCCGGT GCTGGTGGAG CTGATCACGT ATCGCTTTGG CCCCCACACA
ACCGCCGACG ACCCGGCGAC GCGCTATAGG GATCCAGCCG AGGCCGAGGA ATGGAGGCGC
TACGACCCCA TAGCGAGGCT CGGGGCTTAC TTCAAGAAAT ACGGCATCTT GACCGAGAGG
GAGATAAAGC TGACGTGGGA GGAGGCGGAG GCAGAGGTCA AGGTGGCGGC CAAGGAGGCC
GAGTCGTACC CCGAAATACC GAAGGAGTGG ATCGTCGAGG ATGTATACAG CTTTATCCCG
CCACACTTGA GGGAGGAGCT GGAGGAGCTA TGA
 
Protein sequence
MLEIDFNQNY KISVKEPQVL RVIEPDGTLR EEAELGYKPS EGELVKLYRY MVTARVLDRH 
ALLLHRMGKV KSTYGPHEGH EAADAGTVHV LKPEDWIAPY YRMLTALLIR GVPLQTIWAK
FFAKQGDPDK GRNLTVEWGG FAKWRILSVG APIGHQYIYA AGFAYALRYM KRDEIVAAYI
GDGGTSTNGF HTGLNFAGVF KLPVVFYVYN NQYAISVPVR SQTAVTRLAI KAAAYGIEGI
ATDGMDLLAV LKAAHYAVSK ARRGEPVLVE LITYRFGPHT TADDPATRYR DPAEAEEWRR
YDPIARLGAY FKKYGILTER EIKLTWEEAE AEVKVAAKEA ESYPEIPKEW IVEDVYSFIP
PHLREELEEL