Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0026 |
Symbol | |
ID | 4601063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 19513 |
End bp | 20814 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639772779 |
Product | hypothetical protein |
Protein accession | YP_919439 |
Protein GI | 119718944 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1651] Protein-disulfide isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.727163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAGA AAGTAGTATA CGCACTACTA GCAGTCGCTG TGGCCGTCGC CGCCGGGCTG AGCCTCCTCT TGCTGAAACA GGCTGCGCCG CCGGGAAGCG TCCAGCAGCC GCAGCCGTCC TGCCAGCCTA CAAGCCTAGT CTACGTGTAC CTTGACGAGG CGCAGAAAAA CCTGGGTGAC ACTATAACCT CTAACTTCAA GGTAGTGCTA CAGCAGTACG GGATCAACAT AGTGAATGTC CCCGTGTGCT ACCTGCCGGC ATCGAGCCTC CCGGAGAAGC TCAGAGTGTA CCCGGCGCTC CTTCTCAAAG GGAATATCTC CGCGCTAGAA CAACTGGTCG TAGGCGAGGT TGGGGGCTAC AAAGTTCTAA ACCCAGGTGT TTCAGCCTAC ATGGCTTCCT CCGCGGGGGC TTCTCCCGTT TTCACGTACC AGTCGGAAGC CATAATAGTG AACTCCACTG CTCCGTTTAC AGGCATAAGG GCGGGCGAGG ACGACATGAG GCGTATACTG AGCCAGCTCT CCCTCTCCAA TGTCTCCAGG ATAACCTACG TCACGCCGAG TAGTGTGAGT TTCACGTTGA CAAGGTTACC GGCGATAGTC TTCAAGTCGG ACTACAACCT CTCGAAGGGC TACGCCTTCA TAAAGCCTTT AGGCAACGGA TACTACACCT TCAGAGAGGA CGTCTCGGGC AGAGTACTGG AATATTTCGG AGTTCAAGTC TACGAAATCA GAACACCTCC TCCCTCTTAC CTGGCAAGGG AAGGTGTCCC TGTAGGCTCC TCTAGCTTAA CCCTTTACAT ACTCGAGGAC TACCACTGCC CGTTCTGCGC GAAGCTCATG GCGAGCCTTG GCGACACCTT TACGAGGCTA GCCAAGTCGG GGTCACTTAA AGTGGTACTT GTAGACCTAA TAGTGCACCC CGAGGTTGCA GAGATGCACG CTCTAGCTAA GTGCGTTTAC AACAAGACGG GGGACGGGTA CCTCTACTTC AACTTGTCGA GGAAGCTTTA CGACAAGCTG AACCAAGGCG TGAGCACTAC ACTCGAAGAC TTATCGAGCA TCGCGTCGAC GTACACTGGT AAGGCTTTGA TAGACGAGTG CCTCAAGCAG GTTAACGCCG GCGCGGAGCA TGTGAGGTCT CTCTCCCAGA AGCTGATAAG CGATGGCTAC ACGGGTACGC CGACGCTTAT ATTCTGGAAC CCCGAGAAGG GGAAAGGCCT CGTCGTACCT GGTTGCCTCG ATATTAACGC GTGCATGACC CAGGGGCAGC TCGACGAAGT ATTGTCCTGG CTGAAAAGCT AG
|
Protein sequence | MDKKVVYALL AVAVAVAAGL SLLLLKQAAP PGSVQQPQPS CQPTSLVYVY LDEAQKNLGD TITSNFKVVL QQYGINIVNV PVCYLPASSL PEKLRVYPAL LLKGNISALE QLVVGEVGGY KVLNPGVSAY MASSAGASPV FTYQSEAIIV NSTAPFTGIR AGEDDMRRIL SQLSLSNVSR ITYVTPSSVS FTLTRLPAIV FKSDYNLSKG YAFIKPLGNG YYTFREDVSG RVLEYFGVQV YEIRTPPPSY LAREGVPVGS SSLTLYILED YHCPFCAKLM ASLGDTFTRL AKSGSLKVVL VDLIVHPEVA EMHALAKCVY NKTGDGYLYF NLSRKLYDKL NQGVSTTLED LSSIASTYTG KALIDECLKQ VNAGAEHVRS LSQKLISDGY TGTPTLIFWN PEKGKGLVVP GCLDINACMT QGQLDEVLSW LKS
|
| |