Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1538 |
Symbol | |
ID | 4600380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1485857 |
End bp | 1486918 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639774312 |
Product | hypothetical protein |
Protein accession | YP_920937 |
Protein GI | 119720442 |
COG category | [S] Function unknown |
COG ID | [COG3367] Uncharacterized conserved protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGAGG GAGAAGCGGT AATACTGGCG GAGGGTCTCT ACTCGACGAC TGACGGCAAG ACGGCGCACG GGCTCGTCAG GAAGAGCCTA CGCTACAAGA TAGTCGGGGT GATAGACAGC ACCCTCGCTG GGAGGGATGC AGGCGAAGTG CTCGACGGGA AACCTCGCGG CATAAAGATA TACTCCTCCC TCGAGGAGGC GTTGAAGGAG CACCCCGGGG TAAAGTTCCT CATAATCGGC GTGGCGACTC CTGGTGGCAG GCTCCCGCCG TCCTACAGGG AGGTGGTGAA GGAGGCGTTG AAGAGGGGGA TAAGCGTGGT TTCTGGTTTA CACGAGTTTC TCAGTGACGA CCCGGAGCTC TCTAGGATAG CCAGGGAGAC TGGCGCCGAA ATAATAGACG TGAGGAAGAT ATACTACAAC ACCAGGAAGT TCTACACAGG GAAGATAAAG GAGGTCAAAG CCCTCAAAGT GGTCGTGATA GGGACGGACT CGGCTGTCGG CAAGAGGACG GTTGCCCACA TGGTGACCGA CGAGCTCAAC GCGAGGGGAA TTAAAGCCGT CTTCGTCGGT ACGGGGCAGA CTGCCTGGAT GCAGGGCGCC AAGTACGTCT TCGTGCTTGA CAGCGTGATA AACGACTTCG TCCCCGGAGT ACTGGAGGAC GTCGTGTGGA GGGCGTACAG CGAGGAGAAG CCCAAGGTCA TAGTCGTGCC GGGTCAGGGT AGCCTGCTTC ACCCGGTGTT CCCGGGGAGC TACGAGATAC TGAACCTGTT GAAACCCGAG GTCACCATCC TGCACCACGC GCCAGGTAGG AAGCACTTGG ACGGGTTCCC CGAGTACCCG GTTCCACCCC TGGAGAAGTT CCTGAAGCTC GTCGAGATAA TAACCGATAG GAGGGTCTTC GCGATCACTC TCAGCACCGA GGGGCTCTCG GAGCGGGAGG TGCTCGCCGA GAGGGAGAAG CTTGAGAAGG AGCTCGGGAT CCCCGTCGTG GTCCCGATGA TCGAGGGCGT CGGGAGGATT GTCGACGAGA TAACCAGGAG GTTCCCGGAA GTGGTGGGCT AA
|
Protein sequence | MEEGEAVILA EGLYSTTDGK TAHGLVRKSL RYKIVGVIDS TLAGRDAGEV LDGKPRGIKI YSSLEEALKE HPGVKFLIIG VATPGGRLPP SYREVVKEAL KRGISVVSGL HEFLSDDPEL SRIARETGAE IIDVRKIYYN TRKFYTGKIK EVKALKVVVI GTDSAVGKRT VAHMVTDELN ARGIKAVFVG TGQTAWMQGA KYVFVLDSVI NDFVPGVLED VVWRAYSEEK PKVIVVPGQG SLLHPVFPGS YEILNLLKPE VTILHHAPGR KHLDGFPEYP VPPLEKFLKL VEIITDRRVF AITLSTEGLS EREVLAEREK LEKELGIPVV VPMIEGVGRI VDEITRRFPE VVG
|
| |