Gene Pars_1309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1309 
Symbol 
ID5054444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1181788 
End bp1182828 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content57% 
IMG OID640468855 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001153524 
Protein GI145591522 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.19082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAGG TCGTCAAGTT GGCCCACGGG GCAGGCAGTG TGGAGACGTC GCAAATCCTC 
GAGTCATTGA TTTTCTCCAA GATCGAGGAG AGGCTTAAAA AAGTGGAGGG TGGCTTGGGT
ATAGACTTCC CCGACGATGC GGCGGCAATA CCCATGGGCG ATGGGCGCTT TTTGGTCGTG
ACGGTAGACT CCTACACGGT TAACCCGCCA TTTTTCCCCG GAGGCGATAT AGGCGTCTTA
GCGGCCTCAG GCTCTATCAA CGATGTCTTA ATGTTAGGCG GAAAGCCCAT TGCCCTCATG
GACGCCATCA TAGTAGAGGA GGGCTTCCCC CTGGAAGATC TGAGGAGAAT CGTGGATTCA
ATGTTGAGGG TGTTGCGCGA GGAGGGCGTC GCGCTGATAG GCGGCGACTT CAAGGTGATG
CCGAAGGGCC AGATAGACAA GATAGCGATA GCCACAGTGG GCATTGGGAT AGCCGATAGG
CTGATAGTGG ACAGGCCCCA GCCTGGCGAT AAAATAGTCG TGAGCGGATA TCTCGGAGAT
CACGGGGCTG TGATCTTGGC GAGGCAGATC GGCATAATAG ACGAAGGCTC GGGAGGTGGG
CTCGTAAGCG ACGTAAAGCC CTTGACCAGG CTCATGTTAC CTCTAGTCGA GAAGTACGGC
CCCCACATCC ACGCAGCACG CGACCCGACT AGAGGCGGGT TAGCCATGGC GCTCAACGAC
TGGGCCAAGG CCTCCGGCAC TGTCATCATC GTGGAAGAAT CTGCGATACC CATTAGGCCC
CAGGTGGCGT ACTACGCCAA CATGTTGGGC ATAGACCCCC TGGCGCTGGC CAGCGAAGGC
GCGGCCGTGC TATCTGTAAG CCCCGACGTA GCCGAAGAGG TCGTGGAGTT TATGAAGAAG
CTCGGCTTCG ACAATGCTGC AATCATAGGC GAGGTTAGAA AAGCCGAGAG GTACAGAGGG
TACGTCCTGC TCAAGACCGT GGTAGGGGGG CTGAGAATAC TTGAGGCTCC CCGTGGGGAC
CTCGTCCCGA GGATATGCTA A
 
Protein sequence
MGEVVKLAHG AGSVETSQIL ESLIFSKIEE RLKKVEGGLG IDFPDDAAAI PMGDGRFLVV 
TVDSYTVNPP FFPGGDIGVL AASGSINDVL MLGGKPIALM DAIIVEEGFP LEDLRRIVDS
MLRVLREEGV ALIGGDFKVM PKGQIDKIAI ATVGIGIADR LIVDRPQPGD KIVVSGYLGD
HGAVILARQI GIIDEGSGGG LVSDVKPLTR LMLPLVEKYG PHIHAARDPT RGGLAMALND
WAKASGTVII VEESAIPIRP QVAYYANMLG IDPLALASEG AAVLSVSPDV AEEVVEFMKK
LGFDNAAIIG EVRKAERYRG YVLLKTVVGG LRILEAPRGD LVPRIC