Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1613 |
Symbol | |
ID | 4601905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1558551 |
End bp | 1559612 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639774386 |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_921011 |
Protein GI | 119720516 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0309] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR02124] hydrogenase expression/formation protein HypE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.112161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGCGTA TCCCGGGTGG CAAGGGGGTC GTAGAGCTAG CGGACGGCCA GGGGGGCGTG GAGACCCTCA GGCTCCTCGA GGAGCTCTTC TTCAGGAGGG TGAAGGAGCA CCTCAAGAGG GTGGAGGGGG GCGTCGGGAT AGATTTCCCG GACGACGGCG CGCTCATACC CCTGCCCGGC GGGGGCTTCC TCGTCGTCAC GGCGGACTCT TACACCGTGA ACCCTCCGTT CTTCCCGGGC GGGAACATAG GTAGCCTCGC GGTTCACGGA TCGATAAACG ACGTCGTGAT GATGGGCGGG AGGCCTGTCG CCATGCTCGA CACTATAGTG GTCGAGGAGG GCTTCCCGAT GGGCGACCTG GAGGCCATCG TCAACTCTAT GCTCGAGGCG CTCGAAAAGG AAGGTGTGCC GCTGATAGGC GGGGACTTCA AGGTCATGCC GAAGGGCCAG CTCGACAGGA TAACTGTGAC CACGGTGGGG CTGGGGGTTG CGCGTAGCCC CATAGTCGAC AAGCCGAGGG GCGGCGACAA GATAGTCGTC ACGGACTACG TCGGGGACCA CGGCGCGGTC ATCCTGATGC TCCAGATGGG GATGGGGGAC GTCGAGGAGA TAGCGAGCGG TTCTCTGAAG AGCGACTCGA AGCCCCTGAC GCCGCTACTA CCTGTCTTCG AGAAGTTCGC CGGGTACATC CACGCGGCGA GGGACCCGAC CCGCGGGGGG CTCGCGGGTG TTCTGAACGA GTGGGCTAGG AGCGGCGGGC TCCTGGCTGT CATCAGGGAG GAGGAGGTGC CGGTGCGGGA CGCCGTCAGG AGGTTCGCGG AGATGCTCGG CGTAGACCCC CTCTACCTCG CGAGCGAAGG CGCAGCGGTC CTCTCGGTCT CCCCCGAGGT AGCCGAGGAA GTAGTCTCCG AGCTCAGGGG GAGGGGTTTC CCGAACGCGA GGGTTATCGG GGAGTTCCGC GAGAACCCCA AGTACAGGGG CTTTGTCGTC TCCGAGACGG GGGTCGGCGG GCACAGGATT ATCGAGCCAC CTAGAGGGGT CATTGTCCCC AGGATATGCT GA
|
Protein sequence | MRRIPGGKGV VELADGQGGV ETLRLLEELF FRRVKEHLKR VEGGVGIDFP DDGALIPLPG GGFLVVTADS YTVNPPFFPG GNIGSLAVHG SINDVVMMGG RPVAMLDTIV VEEGFPMGDL EAIVNSMLEA LEKEGVPLIG GDFKVMPKGQ LDRITVTTVG LGVARSPIVD KPRGGDKIVV TDYVGDHGAV ILMLQMGMGD VEEIASGSLK SDSKPLTPLL PVFEKFAGYI HAARDPTRGG LAGVLNEWAR SGGLLAVIRE EEVPVRDAVR RFAEMLGVDP LYLASEGAAV LSVSPEVAEE VVSELRGRGF PNARVIGEFR ENPKYRGFVV SETGVGGHRI IEPPRGVIVP RIC
|
| |