Gene Tpen_1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1781 
Symbol 
ID4601925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1723632 
End bp1724858 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content63% 
IMG OID639774554 
Producthypothetical protein 
Protein accessionYP_921179 
Protein GI119720684 
COG category[S] Function unknown 
COG ID[COG1602] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.420312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAGC GCGCGGCGGC CACGTGCCTG CAGTGCAGGG GGGCTAAGAG GCTCTGCGGT 
AAGAGTAGCT GCCCTGTGCT CGACTTGTGG CTAGCACTCG AGAGGGTCAG GGTTCCGGAG
ACAAGAGAGA TCGACGGTTA CTCTCCGCCC ACCGTTTTCG TCGGGAGGCA CGGGTACCCA
GAGGTCAGGT TCAGCGTCGG CGTCCCGTCC ATCGAGGGAG ACCCCGCGCT GTTCGAGGAC
CAGGAGCGGT GGCTCTCGAT GCCCCTACGC GACGTGATAG GCATGAGGCT CGGGATAGTA
AGGGGCGAGG TACGGGTCAA GGTCGACGGT CGGGGGCTCT CGGACGAGGT TAGGCTTGCC
GCTCTCTCCT CTAGGCCCGT CGACGTCGAG ATCCTACTGG AGAAAGCCCC CAGGGCGAGG
CCACTGGTAG ACCTCTTCTC CCCTCCCCTG GGGCCCGCCG GCCCGGCTGC GAAGATAAGC
GTGCTGGGTA ACCCGAGCGT GCCCAGGAGC CTGGAGAAAG CCTACTACGA CTACGACCTG
GGCGCCCGCG AGGCTATATA TGCACTCTAC AGGGAGGGCG TGCCGGTACA CTACATCCAG
AGGGCTCTAT CCGTGGGGGC TCTGGGGCTG GGAGGGCGTA GGAGGATCGT CCCGACTAGG
TGGGCTATAA CAGCCGTGGA CTCGACGCTG TCGCAGGAGC TGATCGAAGA GGTTAAGAGG
CTGGACTACT TCGACGAGTA CCTGTTCTTC GAGAGAAAGT TCTCGGATAA CACGTTCGTG
GCGATAATAG CGCCCGGCGC GTGGAGCTAC GAGTGGATAG AGGCGTGGTT CCCGCACACG
ACGTGGAACC CCTCGGCGAG GCTCGAGGTC GAGGGGGACT GGGAGGGGTT CAAGGGTAGA
ACCACGTACG CCTCGCTGGG AGGCTGCTAC TACGCCGCCA GGCTTGCAAC CGCGGAGTTC
ATGCTCCGGG AGAAGAGGCA GGGAACGGCG ATACTCCTCC GCGAGATATA CGAGGGCTTC
TTCCTGCCGA TAGGGGTCTG GTTCGTACGG GAAAACGTGA GGGAGCTCTT CAGGTCCAAG
CCGGAGAGGT ACGAAAGCCT CGAAGAGGTG CTACGCAGGT TGGAGAAGTC TACGAGGCTA
CCCCTGGGCA CGTGGCTCGC CGCGTCAACC CTCCTGAGGA GGCTTTTGAG GCAGAGTAGC
ATCGAGGCGT ACATATGGAG GGGGTAG
 
Protein sequence
MGERAAATCL QCRGAKRLCG KSSCPVLDLW LALERVRVPE TREIDGYSPP TVFVGRHGYP 
EVRFSVGVPS IEGDPALFED QERWLSMPLR DVIGMRLGIV RGEVRVKVDG RGLSDEVRLA
ALSSRPVDVE ILLEKAPRAR PLVDLFSPPL GPAGPAAKIS VLGNPSVPRS LEKAYYDYDL
GAREAIYALY REGVPVHYIQ RALSVGALGL GGRRRIVPTR WAITAVDSTL SQELIEEVKR
LDYFDEYLFF ERKFSDNTFV AIIAPGAWSY EWIEAWFPHT TWNPSARLEV EGDWEGFKGR
TTYASLGGCY YAARLATAEF MLREKRQGTA ILLREIYEGF FLPIGVWFVR ENVRELFRSK
PERYESLEEV LRRLEKSTRL PLGTWLAAST LLRRLLRQSS IEAYIWRG