Gene Tpen_0026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0026 
Symbol 
ID4601063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp19513 
End bp20814 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content55% 
IMG OID639772779 
Producthypothetical protein 
Protein accessionYP_919439 
Protein GI119718944 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1651] Protein-disulfide isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.727163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGA AAGTAGTATA CGCACTACTA GCAGTCGCTG TGGCCGTCGC CGCCGGGCTG 
AGCCTCCTCT TGCTGAAACA GGCTGCGCCG CCGGGAAGCG TCCAGCAGCC GCAGCCGTCC
TGCCAGCCTA CAAGCCTAGT CTACGTGTAC CTTGACGAGG CGCAGAAAAA CCTGGGTGAC
ACTATAACCT CTAACTTCAA GGTAGTGCTA CAGCAGTACG GGATCAACAT AGTGAATGTC
CCCGTGTGCT ACCTGCCGGC ATCGAGCCTC CCGGAGAAGC TCAGAGTGTA CCCGGCGCTC
CTTCTCAAAG GGAATATCTC CGCGCTAGAA CAACTGGTCG TAGGCGAGGT TGGGGGCTAC
AAAGTTCTAA ACCCAGGTGT TTCAGCCTAC ATGGCTTCCT CCGCGGGGGC TTCTCCCGTT
TTCACGTACC AGTCGGAAGC CATAATAGTG AACTCCACTG CTCCGTTTAC AGGCATAAGG
GCGGGCGAGG ACGACATGAG GCGTATACTG AGCCAGCTCT CCCTCTCCAA TGTCTCCAGG
ATAACCTACG TCACGCCGAG TAGTGTGAGT TTCACGTTGA CAAGGTTACC GGCGATAGTC
TTCAAGTCGG ACTACAACCT CTCGAAGGGC TACGCCTTCA TAAAGCCTTT AGGCAACGGA
TACTACACCT TCAGAGAGGA CGTCTCGGGC AGAGTACTGG AATATTTCGG AGTTCAAGTC
TACGAAATCA GAACACCTCC TCCCTCTTAC CTGGCAAGGG AAGGTGTCCC TGTAGGCTCC
TCTAGCTTAA CCCTTTACAT ACTCGAGGAC TACCACTGCC CGTTCTGCGC GAAGCTCATG
GCGAGCCTTG GCGACACCTT TACGAGGCTA GCCAAGTCGG GGTCACTTAA AGTGGTACTT
GTAGACCTAA TAGTGCACCC CGAGGTTGCA GAGATGCACG CTCTAGCTAA GTGCGTTTAC
AACAAGACGG GGGACGGGTA CCTCTACTTC AACTTGTCGA GGAAGCTTTA CGACAAGCTG
AACCAAGGCG TGAGCACTAC ACTCGAAGAC TTATCGAGCA TCGCGTCGAC GTACACTGGT
AAGGCTTTGA TAGACGAGTG CCTCAAGCAG GTTAACGCCG GCGCGGAGCA TGTGAGGTCT
CTCTCCCAGA AGCTGATAAG CGATGGCTAC ACGGGTACGC CGACGCTTAT ATTCTGGAAC
CCCGAGAAGG GGAAAGGCCT CGTCGTACCT GGTTGCCTCG ATATTAACGC GTGCATGACC
CAGGGGCAGC TCGACGAAGT ATTGTCCTGG CTGAAAAGCT AG
 
Protein sequence
MDKKVVYALL AVAVAVAAGL SLLLLKQAAP PGSVQQPQPS CQPTSLVYVY LDEAQKNLGD 
TITSNFKVVL QQYGINIVNV PVCYLPASSL PEKLRVYPAL LLKGNISALE QLVVGEVGGY
KVLNPGVSAY MASSAGASPV FTYQSEAIIV NSTAPFTGIR AGEDDMRRIL SQLSLSNVSR
ITYVTPSSVS FTLTRLPAIV FKSDYNLSKG YAFIKPLGNG YYTFREDVSG RVLEYFGVQV
YEIRTPPPSY LAREGVPVGS SSLTLYILED YHCPFCAKLM ASLGDTFTRL AKSGSLKVVL
VDLIVHPEVA EMHALAKCVY NKTGDGYLYF NLSRKLYDKL NQGVSTTLED LSSIASTYTG
KALIDECLKQ VNAGAEHVRS LSQKLISDGY TGTPTLIFWN PEKGKGLVVP GCLDINACMT
QGQLDEVLSW LKS