Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0073 |
Symbol | |
ID | 4600937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 56850 |
End bp | 57920 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639772827 |
Product | radical SAM domain-containing protein |
Protein accession | YP_919486 |
Protein GI | 119718991 |
COG category | [R] General function prediction only |
COG ID | [COG0535] Predicted Fe-S oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTATACA TAGTGCACAC CACCGGCCAG TGTAACCTGA AGTGCGATTA CTGTGGAGGA AGCTTTGACC CAAGGAAAGT GCCGTGGAGC GTGAAGTACG ACTACCACCT GCTCAAGAAG TTGGTTGAAC GCGACGAAGA CTCTATCGTC GCGTTCTACG GAGGAGAACC GCTTCTCAAC AAGGACTTCG TGAAGTGGGT TCTGGAAAAC GTGAGGGCAA AGCACTTCGT GATTCAAACC AACGGAACAC TCGTGAAGTC TCTCCCTGAG AGGTACTGGC ACATGTTCGA CGCCGTGTTG CTATCCGTGG ACGGACGGGA GAGCGTAACA GACAAGCATA GAGGTTCAGG AGTGTATAGG CGGGTCGTCG AGTCTGCTAG ATGGCTCAGA GAGATAGGCT TCAAGGGGGA CCTGATAGCC AGGATGACAG TAACGGAGGA CACCGACATC TACGAAGACG TCAAGCACCT CCTCTCCTTG GGTCTCTTCA GCCATGTGCA CTGGCAGCTA GACGTCGTGT GGAGCGATAG GTGGAAGTGC TTCTCGTGCT GGAGGGATGG AAACTACCTA CCCGGAGTGG CGAAGCTCGT CAAGGAGTGG TTAGAGGAAA TGCGCAGAGG AAAGGTTCTG GGAATAGCGC CCTTCAAGGC GCTCATTACG CGAGCATTCC ACAAAAGCTA CTCGGCGCCC CCCTGCGGCG CGGGGGTAAG TTCTTTCTCG ATACTCACCG ACGGCAGGAT CGTCGCATGC CCCATAGCAG TGGAGGAGCG TTGGGCACTC GTCGGGAGAG TGGAGGACGG GGAGATACAC CCAGAAAGAG CCCCTAAAAT ATCTGAGCCG TGCTCCTCGT GCCGTTACTT CGAGTTCTGC GGTGGTCGTT GCCTCTACGC CTACATGGAG AGGCTGTGGG GCGAGGATGG CTTCCGCGAA GTGTGCGTAG CGACACAGAG GTTCATAGAC ATAGTTCTCA GCACGCTCAA CGAGGTGAAG GAGCTACTCT CAGCCGGGGT GGTCTCTATG AGCGATATTT ATTACCCTCC CTTCAACAAC ACCGTGGAAG TCATCCCCTA G
|
Protein sequence | MLYIVHTTGQ CNLKCDYCGG SFDPRKVPWS VKYDYHLLKK LVERDEDSIV AFYGGEPLLN KDFVKWVLEN VRAKHFVIQT NGTLVKSLPE RYWHMFDAVL LSVDGRESVT DKHRGSGVYR RVVESARWLR EIGFKGDLIA RMTVTEDTDI YEDVKHLLSL GLFSHVHWQL DVVWSDRWKC FSCWRDGNYL PGVAKLVKEW LEEMRRGKVL GIAPFKALIT RAFHKSYSAP PCGAGVSSFS ILTDGRIVAC PIAVEERWAL VGRVEDGEIH PERAPKISEP CSSCRYFEFC GGRCLYAYME RLWGEDGFRE VCVATQRFID IVLSTLNEVK ELLSAGVVSM SDIYYPPFNN TVEVIP
|
| |