Gene Tpen_1682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1682 
Symbol 
ID4600570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1629399 
End bp1631045 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content64% 
IMG OID639774455 
ProductHEPN domain-containing protein 
Protein accessionYP_921080 
Protein GI119720585 
COG category[S] Function unknown 
COG ID[COG2250] Uncharacterized conserved protein related to C-terminal domain of eukaryotic chaperone, SACSIN 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTAA ACGATTCGAG CGAGGTACCC ACGCTTGCAG CGAGGGATGC CTGCAGGAAT 
GTCAGCACGA TTCTTTTCGC CTGGCTCTAC GCCTCGGCGG TAATGCTGAT ACTCTACGAG
TACAGCGGCT GGCAGCCCCT CGGGGAAGCC GCGAAAAGGT TCGGGGTGTA CGCGCTGACA
GGCGGGGACG TAGCCTTCCT GCCGACCCTG CTAGCCTTCC TGGCAGCCCT CCTCGGAGCG
GAGTGCGCCC AGCCGGACGA GAGTAGCGGG GTGTACAGGC TGAAGGTCTT CGCGTACTTG
TCTACCGTCG CGTACGGGCT TGAAAACCTA GCCGCGAGGG AGGCGGCGTA CCTCCTTCCC
ATCCTCGCGT TGCCAGCGAT GTACCCGGTC TTCAAGGTAA CCGCGGGGAG CGGTGGTGAA
GCCTGGTTCC TGGGCGTAAC GCTCTTCTTC GCGGGGCTCC TAGCCCTCGC GGTCAAGCCG
CCCGTGCTCG CATACGTGGG GTTCGCCTAC GCCTTCGTAG CCGTGGCTAG CCAGAGGTGG
CGCGAGCTAC TCCGTAGCGA CGCCTCGCTG CTCGGCAAAG GGTCGGCCCT ACTGCTCCTA
GCGGTACCCG CCTTCGCCGC GTCAGCCTCT GCCTACGAGG CTGCCCTCGG CTGGAACACC
TACCTCGACG TACCGTTACT CGTGCTCGTC GCCTCGCTCA CGGTCTTCGC GGCTCTCCGC
GTGCTTCTCG AGGGGAGTAG CGCGGGCGTA GTGGCGCCGG TGGTATCCTC GGCCGCGGCC
TACGCTGCCC TCGAAGTGTT CAGGGGCACC GCCGTATCTG TGGTGGCGTT GCCCGTCACA
GTCCTCCTCG CGGCCACGTA CGTGTACAGG CTTCTACCCC GCCGGGACGA CTACGCGGCT
ACGGGTATAG CCTTGATGAC GGCTTACATG TCGCTGGTAG AGCCCCTGAG CTACGTGTTC
GGGAAAAGCC CCTGCTACGC ATGCACGGCG GTAGCCGCGC CAGCAATAGC CGCGGGCCCG
CTAACCACCG TGGGCGTTTT AAGGGTGCTC CTACTCCCAC CCGGGAAGGC GGAGCCAGAG
CGGAAGCAGC CCCTCCTCGA AGCCTGGAGT CCTGCCGCGG TCGCCGTGCA GCGCGGGGCG
AAGAGAGGGG CTGTTACCCC GGTGCGCGGT GCCGGCTACG CTAGGCCTAG GCGTAGGCGC
CCCGTGGATT TCGACGGCTT AGCCGCCAGC TACTACCGGG AGGCGCAGAA GTACATGGAG
CTAGCCTACG AGATGAGAAG GAGGGGGCTT CTCGACCAGT CGATGTTCTA CGCCGAGCAG
TCCGTGGAGT TCCTGGTAGA CGCCTTGGCC CTCAAGGTGA AGAGGATAGT GCCGTCCGAG
ATAGAGGACT TCAGGAAGCA CAGGCTCTTC TCGATGATGC TCTTCCTCGT GGAGGGCGAC
GCCGTGCCGA GAAAGGTGGA GGAGTGCCTG CACTTCCTGT CTAAGAGCTA CACGAGGAGG
TACAAGCTGG AGAGCGCCGT GACCTGGGAT GAGGCGGACA GGGCAGTAGA GTGTATGGAG
CGCGCCTGGG AATACGCCAT GAGGAAGTTC GCGGAGCCTC TGCGCGAGCT ACGGGAGAAG
AGCGCCGGGG AAGGGAAAAC CGGGTGA
 
Protein sequence
MGLNDSSEVP TLAARDACRN VSTILFAWLY ASAVMLILYE YSGWQPLGEA AKRFGVYALT 
GGDVAFLPTL LAFLAALLGA ECAQPDESSG VYRLKVFAYL STVAYGLENL AAREAAYLLP
ILALPAMYPV FKVTAGSGGE AWFLGVTLFF AGLLALAVKP PVLAYVGFAY AFVAVASQRW
RELLRSDASL LGKGSALLLL AVPAFAASAS AYEAALGWNT YLDVPLLVLV ASLTVFAALR
VLLEGSSAGV VAPVVSSAAA YAALEVFRGT AVSVVALPVT VLLAATYVYR LLPRRDDYAA
TGIALMTAYM SLVEPLSYVF GKSPCYACTA VAAPAIAAGP LTTVGVLRVL LLPPGKAEPE
RKQPLLEAWS PAAVAVQRGA KRGAVTPVRG AGYARPRRRR PVDFDGLAAS YYREAQKYME
LAYEMRRRGL LDQSMFYAEQ SVEFLVDALA LKVKRIVPSE IEDFRKHRLF SMMLFLVEGD
AVPRKVEECL HFLSKSYTRR YKLESAVTWD EADRAVECME RAWEYAMRKF AEPLRELREK
SAGEGKTG