Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0041 |
Symbol | |
ID | 4600982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 29704 |
End bp | 30792 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639772794 |
Product | radical SAM domain-containing protein |
Protein accession | YP_919454 |
Protein GI | 119718959 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1180] Pyruvate-formate lyase-activating enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.821324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATA AGTTGCTGGG TAAACCGTTC GTCAGGGAGG CGGCATTCTG GGAGCCGGTT CAGGGGAAGC CGGGCTACGT GAAGTGCAAT CTCTGCAACA GGAGGTGCGT GATAGCCCCC GGTAGGTTCG GGGTTTGCGG TGTGAGGAAG AATATCGACG GCAAGCTTTA CACGCTGGTC TACGGCCTCT TGACAGCCGC GAATCTAGAC CCTATCGAGA AGAAGCCTCT CTCCCACTTC TACCCGGGTA GCGCGGTGTT CTCGGTGTCC ACGCCCGGCT GCAACTTTTT CTGCCAGTTC TGCCAGAACT GGGAGATAAG CCAGAGCAGG CTGGAGAGAG GGCTCTACGG GCACTACTAC CCCCCGGAGG ACGTCGTAAG GGAGGCTAAG AGGCTGCAGG CGGACGGGAT CTCGTACACC TACAACGAGC CAACGATATT CTACGAGTTC ATGCTGGACA CTGCGCGCCT AGCGAAGAAG GAGGGCCTCT TCAACACGAT GGTTACGAAC GGCTACATAT CCCCGGAGGC CCTCGACGAG CTGGCGCCCT ACCTGGACGC CGCCACCGTG GACTTCAAGG GAGGAGGCGA CCCGGAGTTT TACAGAAAGT TTATGGGGGT GCCAGACCCA AGCCCCATCT ACGACACGTT GCTCAGAATG AAGGAGAAGG GGATCCACGT AGAGATAACG AACCTTGTGG TTCCAATAGT GGGCGACGAC GAGGAGAAGC TGAGGTCCCT GGCTAGGTGG GTAGCGGAGA ACTTGGGCGA CGAGACGCCC TTCCACCTCC TGAGGTTCTA CCCCCACTAC AAGATGATCG ACTACCCGCC GACGGAGGTC GGGGACCTCG AAAAGCTCGC GGGGGTGGCG AGGGAGGAGG GGCTCAAGTA CGTCTACATA GGGAACGTGT GGGGGCACCC CCTCGAGAAC ACTTACTGCC CGAAGTGCGG CCACAGGGTC ATAGAGAGGA GGGGCTTCTT CATAGTGAAG TGGGATTTAA CGGAGGACAA CAGGTGCCCG GTGTGCGGCG CGAAGATAAA CATAAAGGGG AGCTACAGGA AAAGAAGCTG GGACGTCTTC TTCTACTAG
|
Protein sequence | MSDKLLGKPF VREAAFWEPV QGKPGYVKCN LCNRRCVIAP GRFGVCGVRK NIDGKLYTLV YGLLTAANLD PIEKKPLSHF YPGSAVFSVS TPGCNFFCQF CQNWEISQSR LERGLYGHYY PPEDVVREAK RLQADGISYT YNEPTIFYEF MLDTARLAKK EGLFNTMVTN GYISPEALDE LAPYLDAATV DFKGGGDPEF YRKFMGVPDP SPIYDTLLRM KEKGIHVEIT NLVVPIVGDD EEKLRSLARW VAENLGDETP FHLLRFYPHY KMIDYPPTEV GDLEKLAGVA REEGLKYVYI GNVWGHPLEN TYCPKCGHRV IERRGFFIVK WDLTEDNRCP VCGAKINIKG SYRKRSWDVF FY
|
| |