Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1781 |
Symbol | |
ID | 4601925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1723632 |
End bp | 1724858 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639774554 |
Product | hypothetical protein |
Protein accession | YP_921179 |
Protein GI | 119720684 |
COG category | [S] Function unknown |
COG ID | [COG1602] Uncharacterized conserved protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.420312 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAGC GCGCGGCGGC CACGTGCCTG CAGTGCAGGG GGGCTAAGAG GCTCTGCGGT AAGAGTAGCT GCCCTGTGCT CGACTTGTGG CTAGCACTCG AGAGGGTCAG GGTTCCGGAG ACAAGAGAGA TCGACGGTTA CTCTCCGCCC ACCGTTTTCG TCGGGAGGCA CGGGTACCCA GAGGTCAGGT TCAGCGTCGG CGTCCCGTCC ATCGAGGGAG ACCCCGCGCT GTTCGAGGAC CAGGAGCGGT GGCTCTCGAT GCCCCTACGC GACGTGATAG GCATGAGGCT CGGGATAGTA AGGGGCGAGG TACGGGTCAA GGTCGACGGT CGGGGGCTCT CGGACGAGGT TAGGCTTGCC GCTCTCTCCT CTAGGCCCGT CGACGTCGAG ATCCTACTGG AGAAAGCCCC CAGGGCGAGG CCACTGGTAG ACCTCTTCTC CCCTCCCCTG GGGCCCGCCG GCCCGGCTGC GAAGATAAGC GTGCTGGGTA ACCCGAGCGT GCCCAGGAGC CTGGAGAAAG CCTACTACGA CTACGACCTG GGCGCCCGCG AGGCTATATA TGCACTCTAC AGGGAGGGCG TGCCGGTACA CTACATCCAG AGGGCTCTAT CCGTGGGGGC TCTGGGGCTG GGAGGGCGTA GGAGGATCGT CCCGACTAGG TGGGCTATAA CAGCCGTGGA CTCGACGCTG TCGCAGGAGC TGATCGAAGA GGTTAAGAGG CTGGACTACT TCGACGAGTA CCTGTTCTTC GAGAGAAAGT TCTCGGATAA CACGTTCGTG GCGATAATAG CGCCCGGCGC GTGGAGCTAC GAGTGGATAG AGGCGTGGTT CCCGCACACG ACGTGGAACC CCTCGGCGAG GCTCGAGGTC GAGGGGGACT GGGAGGGGTT CAAGGGTAGA ACCACGTACG CCTCGCTGGG AGGCTGCTAC TACGCCGCCA GGCTTGCAAC CGCGGAGTTC ATGCTCCGGG AGAAGAGGCA GGGAACGGCG ATACTCCTCC GCGAGATATA CGAGGGCTTC TTCCTGCCGA TAGGGGTCTG GTTCGTACGG GAAAACGTGA GGGAGCTCTT CAGGTCCAAG CCGGAGAGGT ACGAAAGCCT CGAAGAGGTG CTACGCAGGT TGGAGAAGTC TACGAGGCTA CCCCTGGGCA CGTGGCTCGC CGCGTCAACC CTCCTGAGGA GGCTTTTGAG GCAGAGTAGC ATCGAGGCGT ACATATGGAG GGGGTAG
|
Protein sequence | MGERAAATCL QCRGAKRLCG KSSCPVLDLW LALERVRVPE TREIDGYSPP TVFVGRHGYP EVRFSVGVPS IEGDPALFED QERWLSMPLR DVIGMRLGIV RGEVRVKVDG RGLSDEVRLA ALSSRPVDVE ILLEKAPRAR PLVDLFSPPL GPAGPAAKIS VLGNPSVPRS LEKAYYDYDL GAREAIYALY REGVPVHYIQ RALSVGALGL GGRRRIVPTR WAITAVDSTL SQELIEEVKR LDYFDEYLFF ERKFSDNTFV AIIAPGAWSY EWIEAWFPHT TWNPSARLEV EGDWEGFKGR TTYASLGGCY YAARLATAEF MLREKRQGTA ILLREIYEGF FLPIGVWFVR ENVRELFRSK PERYESLEEV LRRLEKSTRL PLGTWLAAST LLRRLLRQSS IEAYIWRG
|
| |