Gene Pars_1188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1188 
Symbol 
ID5055470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1075825 
End bp1076793 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content64% 
IMG OID640468736 
Producttransketolase, central region 
Protein accessionYP_001153409 
Protein GI145591407 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.489911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0419885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGCCA ACATGGCCAA GGCTCTGAAC ATGGCTCTTC GCGAGGAGAT GGAAAGGGAT 
CCCCGGGTGG TGATCCTCGG CGAGGACGTG GGGAAGAAGG GCGGCGTCTT CCTAATAACC
GAGGGGCTCT ACGAGAAGTT CGGCCCCGAG CGCGTCATAG ACACGCCGCT CAACGAGGGC
GGCATCATCG GCTTCGCCCT GGGGATGGCC CTAGCGGGTC TCAAGCCCGT GGCGGAGATC
CAGTTCGCAG ATTTCTTCTG GCTAGGAGCC GACGAGCTGT TAAACCACGT GGCTAAGATC
AGGTACCGCT CCGGCGGCAA CTTCAAGGCG CCTTTAGTGG TCAGGATGCC CTACGGCGCC
GGCGTCAAGT CGGGCCTTTA CCACAGCCAA AGCCCCGAGG CCTACCTAGT GCACACACCC
GGCCTCGTGG TGGTGGCGCC CTCCACGCCC TACAACGCCA AGGGCCTCCT CAAGGCCGCC
ATAAGGAGCG ACGACCCGGT CGTGTTCCTG GAGCCCAAGG CCCTCTACAG AGCGCCGAGG
GAGGAGGTCC CCGAGGAGGA CTACGTGGTG CCGCTGGGGA AGGCGAGGAT AGCGAGGGAG
GGAGACGACG TAACCTTGGT CACATACGGC GCCATGTTGC CAAGATGTCT GGAGGCCGCC
GAGAAGGCCA AGGCGTCTGT GGAGGTGGTG GACCTCCAGA CCCTCAACCC CATGGACTAC
GAGACGGTGA TCAAGAGCGT GTCGAAGACC GGCAGGCTTG TGGTGGTCCA CGACGCCCCG
AAGACCGGCG GCCTCGGCGC CGAGGTGGCG GCCATCGTCG CCGAGAAGGC CCTCCACGCG
CTGACGGCGC CCGTGGTTCG CGTGGCCGGC CCAGACGTGC CCCAGGCCCC TGTCGTCCAC
GACGACGTAT ACGTCCCGAC GGTCGAGAGG ATACTGAGGG CGATAGACAA GGTGATGGCC
TACTCATGA
 
Protein sequence
MIANMAKALN MALREEMERD PRVVILGEDV GKKGGVFLIT EGLYEKFGPE RVIDTPLNEG 
GIIGFALGMA LAGLKPVAEI QFADFFWLGA DELLNHVAKI RYRSGGNFKA PLVVRMPYGA
GVKSGLYHSQ SPEAYLVHTP GLVVVAPSTP YNAKGLLKAA IRSDDPVVFL EPKALYRAPR
EEVPEEDYVV PLGKARIARE GDDVTLVTYG AMLPRCLEAA EKAKASVEVV DLQTLNPMDY
ETVIKSVSKT GRLVVVHDAP KTGGLGAEVA AIVAEKALHA LTAPVVRVAG PDVPQAPVVH
DDVYVPTVER ILRAIDKVMA YS