Gene Pisl_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_0203 
Symbol 
ID4618314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp192610 
End bp193641 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content55% 
IMG OID639783285 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_929728 
Protein GI119871721 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0139093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0653049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAGC TTTCCCACGG ATCTGGAGGA GTTGAGACAG CCGAGATAAT TGAAAAGCTT 
TTCTTAAAGC GTTTACCAGA GAGTCTTAAA AAGGTGGCCG GGGGGCTTGG GCTTGATTTT
CCTGATGATG CGGCAGCTAT ACCTATGGGA GATGGCCGGT ATCTCGTAGT GACTATTGAC
GCATATACAG TCAACCCGCC CTTTTTCCCC GGGGGCGACA TCGGGGTGCT CGCCGCCTCG
GGCTCTATAA ACGACGTGTT AATGCTCGGC GGTAGGCCCG TCGCCATGTT AGATTCGATT
ATAGCAGAAG AGGGACTCCC CTACGAGACG CTCGACAGAG TAGTCAAGTC CTTCCTCTCT
GTCCTAGAGA CGGAGGGCGT GGCCCTTATC GGCGGAGATT TCAAAGTAAT GCCCAAGGGC
CAGCTCGATA AGATTGTGAT CACGACCGTG GGGATAGGGG TCGCGGAGAG AGTCATCGTG
GATAGGCCGA GACACGGTGA CAAGATCGTG GTGAGCGACT TCGTGGGAGA TCACGGCGCT
GTGATCCTTA TGTTGCAGAT GGGGGACGTG GATAAGCCAG AGCAACTCAA ACTAAAGAGT
GACGTGAAGC CGCTTACCAA GCTCATGGTG CCGCTGGTGG AGAAATACGG CGAGTATATC
CATGCGGCTA GAGACCCCAC GAGGGGGGGC CTCGCCATGG TGTTGAACGA CTGGGCTAAG
GCTGGCGGCG GCGTAATAGT GGTAGAGGAG GAGAGTCTCC CTGTGAGACC AGAGGTGGCG
TCATACGCCG GGATGCTCGG CATAGACCCG CTTTATTTAG CAAGCGAGGG TGTTGCAGTC
CTCGCTATTG ATCCCTCTGT CGCAGAGGAA GTGGTGAAGT TCGTGAGGGG GCTGGGCTTT
CAAAACGCGA GAATTGTCGG CGAGTTTAGA GAGGCGAAAC AACACAGAGG GTATGTCTTG
CTTAAAACTC TCGCAGGCGG GCTTAGGATT CTGGAGCCTC CCAGAGGCGA CATAGTGCCG
AGGATATGCT GA
 
Protein sequence
MIKLSHGSGG VETAEIIEKL FLKRLPESLK KVAGGLGLDF PDDAAAIPMG DGRYLVVTID 
AYTVNPPFFP GGDIGVLAAS GSINDVLMLG GRPVAMLDSI IAEEGLPYET LDRVVKSFLS
VLETEGVALI GGDFKVMPKG QLDKIVITTV GIGVAERVIV DRPRHGDKIV VSDFVGDHGA
VILMLQMGDV DKPEQLKLKS DVKPLTKLMV PLVEKYGEYI HAARDPTRGG LAMVLNDWAK
AGGGVIVVEE ESLPVRPEVA SYAGMLGIDP LYLASEGVAV LAIDPSVAEE VVKFVRGLGF
QNARIVGEFR EAKQHRGYVL LKTLAGGLRI LEPPRGDIVP RIC