Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_0203 |
Symbol | |
ID | 4618314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 192610 |
End bp | 193641 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639783285 |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_929728 |
Protein GI | 119871721 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0309] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR02124] hydrogenase expression/formation protein HypE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0139093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0653049 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAAGC TTTCCCACGG ATCTGGAGGA GTTGAGACAG CCGAGATAAT TGAAAAGCTT TTCTTAAAGC GTTTACCAGA GAGTCTTAAA AAGGTGGCCG GGGGGCTTGG GCTTGATTTT CCTGATGATG CGGCAGCTAT ACCTATGGGA GATGGCCGGT ATCTCGTAGT GACTATTGAC GCATATACAG TCAACCCGCC CTTTTTCCCC GGGGGCGACA TCGGGGTGCT CGCCGCCTCG GGCTCTATAA ACGACGTGTT AATGCTCGGC GGTAGGCCCG TCGCCATGTT AGATTCGATT ATAGCAGAAG AGGGACTCCC CTACGAGACG CTCGACAGAG TAGTCAAGTC CTTCCTCTCT GTCCTAGAGA CGGAGGGCGT GGCCCTTATC GGCGGAGATT TCAAAGTAAT GCCCAAGGGC CAGCTCGATA AGATTGTGAT CACGACCGTG GGGATAGGGG TCGCGGAGAG AGTCATCGTG GATAGGCCGA GACACGGTGA CAAGATCGTG GTGAGCGACT TCGTGGGAGA TCACGGCGCT GTGATCCTTA TGTTGCAGAT GGGGGACGTG GATAAGCCAG AGCAACTCAA ACTAAAGAGT GACGTGAAGC CGCTTACCAA GCTCATGGTG CCGCTGGTGG AGAAATACGG CGAGTATATC CATGCGGCTA GAGACCCCAC GAGGGGGGGC CTCGCCATGG TGTTGAACGA CTGGGCTAAG GCTGGCGGCG GCGTAATAGT GGTAGAGGAG GAGAGTCTCC CTGTGAGACC AGAGGTGGCG TCATACGCCG GGATGCTCGG CATAGACCCG CTTTATTTAG CAAGCGAGGG TGTTGCAGTC CTCGCTATTG ATCCCTCTGT CGCAGAGGAA GTGGTGAAGT TCGTGAGGGG GCTGGGCTTT CAAAACGCGA GAATTGTCGG CGAGTTTAGA GAGGCGAAAC AACACAGAGG GTATGTCTTG CTTAAAACTC TCGCAGGCGG GCTTAGGATT CTGGAGCCTC CCAGAGGCGA CATAGTGCCG AGGATATGCT GA
|
Protein sequence | MIKLSHGSGG VETAEIIEKL FLKRLPESLK KVAGGLGLDF PDDAAAIPMG DGRYLVVTID AYTVNPPFFP GGDIGVLAAS GSINDVLMLG GRPVAMLDSI IAEEGLPYET LDRVVKSFLS VLETEGVALI GGDFKVMPKG QLDKIVITTV GIGVAERVIV DRPRHGDKIV VSDFVGDHGA VILMLQMGDV DKPEQLKLKS DVKPLTKLMV PLVEKYGEYI HAARDPTRGG LAMVLNDWAK AGGGVIVVEE ESLPVRPEVA SYAGMLGIDP LYLASEGVAV LAIDPSVAEE VVKFVRGLGF QNARIVGEFR EAKQHRGYVL LKTLAGGLRI LEPPRGDIVP RIC
|
| |