Gene Pars_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0472 
Symbol 
ID5054827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp416601 
End bp417929 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content58% 
IMG OID640468037 
Producthydroxypyruvate reductase 
Protein accessionYP_001152722 
Protein GI145590720 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2379] Putative glycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.488589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTATTA AAAATAAGGA GGTACTTCTC CGTATTCCCA AGGCAGACGT CCTACTAGAC 
GCCGTTGAGG CGGCCCTCGA AGCCGCCGAT CCCTACAACG CCGTTTTGTC CAAAGTTAAG
CTCCTTGGCA ATTACGTCGA GGTTGAGGGA AAGAGGTTCG AAATCGGGAG GGCTGTGCAC
GTGGTAGGCT TCGGCAAGGC CTCGGCTAGG ATGGCCGAGG CGTTGGTGGA GATCTTCGGA
GACTTAATAG CCGGCGGCGT CGTGATCACG CCTACGGGAG GCAATCGCGT TGGCCCCGTT
GAGGTGTTGA AAGGCAACCA CCCCCTCCCC GGAGAAGACA CGCTTAAAGC ATCCAAGAGG
CTTCTAGAAT ATCTACAAGA AGTTAGAGAG GGGGATACGG TCTTTGTGGC GATCTCCGGC
GGCGGATCTG CCCTATTCGA GGTGCCGGAG GAGGGGGTAG AGCTGGGCGA AATCGCCAAG
CTTTCGGACG AGTTGATGAA GAGGGGGGCC GACATCGTTG AGCTCAACAC AGTTAGAAAA
AGGCTCTCCG CGGTGAAGGG TGGGAAGCTG TTGAGGAACA TAAAGGCGAG GCGCGTGGTC
TCCCTAATCG TTAGCGACGT GGTGGGCGAC CGCCTCGACA CAATAGCCTC CGGCCCCACG
GCGCCCGACG CGACGGACAA GACCTTCGCA GTTGCAGTGT TGAAGAAATA CGGCCTCTGG
GACTCGTTGC CCGAGAGATT GCGTCGCCTA ATTGAGATTG AAACCCCTAA GGCGGGGGAT
CCGCTTTTCG ATAAGGTCAT AAACGTGCCT GTGGTCAACA ACCTCGGCTC TCTCCAGAAG
GCGGCAGAGC GCCTCGCCTT GCGCGGCTAC AACACGATAA TACTAACGTC CATGCTGGAG
GGGGAGGCCC GTGAGGTCGG GAGGGTTCTC GCCTCTGTTA TTAAAAGCGC GGCGCTCCAC
GGCTTTCCAG CATCTCCGCC CGTCGCAATT CTAGCCGGCG GTGAGACCGT AGTAACTGTG
AGAGGCAGAG GAAGGGGAGG CAGAAACCAG GAGATGTGCC TCTCCCTAGC CATGGCGATA
AGAGGGCTCA ACGCCACGGC TGCCTGCGTG GCCACTGACG GCATCGACGG GAACAGCCCA
GCCGCCGGAG CCCTCATCGA CGGCGGCGTC GTAGAGGAGG CCGAGAGGCT GGGGGTGAAC
CCGGCTGAGT ACCTAGACAA CAACGACAGC TACACCTTCT TCGAGAAGCT CGGCAGGGCC
ATAATCACCG GCTACACAGG GGTAAACGTC AACGACATAT TCCTCGCGGT TGTGGATAAA
GATAAATAG
 
Protein sequence
MIIKNKEVLL RIPKADVLLD AVEAALEAAD PYNAVLSKVK LLGNYVEVEG KRFEIGRAVH 
VVGFGKASAR MAEALVEIFG DLIAGGVVIT PTGGNRVGPV EVLKGNHPLP GEDTLKASKR
LLEYLQEVRE GDTVFVAISG GGSALFEVPE EGVELGEIAK LSDELMKRGA DIVELNTVRK
RLSAVKGGKL LRNIKARRVV SLIVSDVVGD RLDTIASGPT APDATDKTFA VAVLKKYGLW
DSLPERLRRL IEIETPKAGD PLFDKVINVP VVNNLGSLQK AAERLALRGY NTIILTSMLE
GEAREVGRVL ASVIKSAALH GFPASPPVAI LAGGETVVTV RGRGRGGRNQ EMCLSLAMAI
RGLNATAACV ATDGIDGNSP AAGALIDGGV VEEAERLGVN PAEYLDNNDS YTFFEKLGRA
IITGYTGVNV NDIFLAVVDK DK