Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1682 |
Symbol | |
ID | 4600570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1629399 |
End bp | 1631045 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639774455 |
Product | HEPN domain-containing protein |
Protein accession | YP_921080 |
Protein GI | 119720585 |
COG category | [S] Function unknown |
COG ID | [COG2250] Uncharacterized conserved protein related to C-terminal domain of eukaryotic chaperone, SACSIN |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTAA ACGATTCGAG CGAGGTACCC ACGCTTGCAG CGAGGGATGC CTGCAGGAAT GTCAGCACGA TTCTTTTCGC CTGGCTCTAC GCCTCGGCGG TAATGCTGAT ACTCTACGAG TACAGCGGCT GGCAGCCCCT CGGGGAAGCC GCGAAAAGGT TCGGGGTGTA CGCGCTGACA GGCGGGGACG TAGCCTTCCT GCCGACCCTG CTAGCCTTCC TGGCAGCCCT CCTCGGAGCG GAGTGCGCCC AGCCGGACGA GAGTAGCGGG GTGTACAGGC TGAAGGTCTT CGCGTACTTG TCTACCGTCG CGTACGGGCT TGAAAACCTA GCCGCGAGGG AGGCGGCGTA CCTCCTTCCC ATCCTCGCGT TGCCAGCGAT GTACCCGGTC TTCAAGGTAA CCGCGGGGAG CGGTGGTGAA GCCTGGTTCC TGGGCGTAAC GCTCTTCTTC GCGGGGCTCC TAGCCCTCGC GGTCAAGCCG CCCGTGCTCG CATACGTGGG GTTCGCCTAC GCCTTCGTAG CCGTGGCTAG CCAGAGGTGG CGCGAGCTAC TCCGTAGCGA CGCCTCGCTG CTCGGCAAAG GGTCGGCCCT ACTGCTCCTA GCGGTACCCG CCTTCGCCGC GTCAGCCTCT GCCTACGAGG CTGCCCTCGG CTGGAACACC TACCTCGACG TACCGTTACT CGTGCTCGTC GCCTCGCTCA CGGTCTTCGC GGCTCTCCGC GTGCTTCTCG AGGGGAGTAG CGCGGGCGTA GTGGCGCCGG TGGTATCCTC GGCCGCGGCC TACGCTGCCC TCGAAGTGTT CAGGGGCACC GCCGTATCTG TGGTGGCGTT GCCCGTCACA GTCCTCCTCG CGGCCACGTA CGTGTACAGG CTTCTACCCC GCCGGGACGA CTACGCGGCT ACGGGTATAG CCTTGATGAC GGCTTACATG TCGCTGGTAG AGCCCCTGAG CTACGTGTTC GGGAAAAGCC CCTGCTACGC ATGCACGGCG GTAGCCGCGC CAGCAATAGC CGCGGGCCCG CTAACCACCG TGGGCGTTTT AAGGGTGCTC CTACTCCCAC CCGGGAAGGC GGAGCCAGAG CGGAAGCAGC CCCTCCTCGA AGCCTGGAGT CCTGCCGCGG TCGCCGTGCA GCGCGGGGCG AAGAGAGGGG CTGTTACCCC GGTGCGCGGT GCCGGCTACG CTAGGCCTAG GCGTAGGCGC CCCGTGGATT TCGACGGCTT AGCCGCCAGC TACTACCGGG AGGCGCAGAA GTACATGGAG CTAGCCTACG AGATGAGAAG GAGGGGGCTT CTCGACCAGT CGATGTTCTA CGCCGAGCAG TCCGTGGAGT TCCTGGTAGA CGCCTTGGCC CTCAAGGTGA AGAGGATAGT GCCGTCCGAG ATAGAGGACT TCAGGAAGCA CAGGCTCTTC TCGATGATGC TCTTCCTCGT GGAGGGCGAC GCCGTGCCGA GAAAGGTGGA GGAGTGCCTG CACTTCCTGT CTAAGAGCTA CACGAGGAGG TACAAGCTGG AGAGCGCCGT GACCTGGGAT GAGGCGGACA GGGCAGTAGA GTGTATGGAG CGCGCCTGGG AATACGCCAT GAGGAAGTTC GCGGAGCCTC TGCGCGAGCT ACGGGAGAAG AGCGCCGGGG AAGGGAAAAC CGGGTGA
|
Protein sequence | MGLNDSSEVP TLAARDACRN VSTILFAWLY ASAVMLILYE YSGWQPLGEA AKRFGVYALT GGDVAFLPTL LAFLAALLGA ECAQPDESSG VYRLKVFAYL STVAYGLENL AAREAAYLLP ILALPAMYPV FKVTAGSGGE AWFLGVTLFF AGLLALAVKP PVLAYVGFAY AFVAVASQRW RELLRSDASL LGKGSALLLL AVPAFAASAS AYEAALGWNT YLDVPLLVLV ASLTVFAALR VLLEGSSAGV VAPVVSSAAA YAALEVFRGT AVSVVALPVT VLLAATYVYR LLPRRDDYAA TGIALMTAYM SLVEPLSYVF GKSPCYACTA VAAPAIAAGP LTTVGVLRVL LLPPGKAEPE RKQPLLEAWS PAAVAVQRGA KRGAVTPVRG AGYARPRRRR PVDFDGLAAS YYREAQKYME LAYEMRRRGL LDQSMFYAEQ SVEFLVDALA LKVKRIVPSE IEDFRKHRLF SMMLFLVEGD AVPRKVEECL HFLSKSYTRR YKLESAVTWD EADRAVECME RAWEYAMRKF AEPLRELREK SAGEGKTG
|
| |