Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0291 |
Symbol | |
ID | 4601300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 258656 |
End bp | 260293 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639773049 |
Product | thermosome |
Protein accession | YP_919704 |
Protein GI | 119719209 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02339] thermosome, various subunits, archaeal [TIGR02340] T-complex protein 1, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00322261 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAGG CTCAGATCCC TGTATTAATA CTTAAAGAGG GTACTCAGAG AACCACCGGG AGAGATGCCA GGAAATCCAA CATCTACGCC GCAAAGGTCA TAGCGGAGGC CATGGCGAGC TCTCTAGGTC CTAGGGGCAT GGACAAGCTC CTAGTTGATT CTTTTGGAAA CGCGACGATC ACCGGTGACG GCGCGACCAT ACTCAAGGAG ATGGAAGTAC AGCACCCGGC CGCTAAAATG CTCGTTGAGG TCGCGAAGGC TCAGGATGAC GAGGTTGGAG ACGGTACAAC CACTGTAGTC GTCCTAGCAG GGCAGCTGCT CGCTGCCTCC GAGGAGCTTC TCGACGAGGA CATTCACCCC ACGACAATAG TGGAGGGTTT CGAAAAGGCG CTCGTTGAGG CTACCAGAAT AATTGACGAG ATTTCCGAGA CCGTAGACCC ACTCGACAGG ACTGTCCTCG AGAACGTTGC GAAGACTGCC CTTTCAAGCA AGGTTGTAGC GGACTACAAG GACTTCTTGG CTAAGCTCGT CGTAGACGCT GCTCTGACGG TCGTTGAAAA GAAGGACGGA AAGTACAACC TAAGCCTGGA CGATATCAAG GTGGAGAAGA AGAGAGGAGA GAGCATAACG GAAACAATGC TGGTAAAGGG CATAGTGCTC GACAAGGAGG TTGTGCACCC GGGTATGCCT AAGAGGGTTA CGAACGCGAA GATAGCCCTC CTAGACGCCC CGCTAGAGAT AGAGAAGCCT GAGTGGACTG CTAAGATAAA CGTTACAACC CCGGAGCAGC TGAAGATGTT CCTAGACCAG GAGGCGGAGA TCCTGAGAAA GAAGGTCGAG AAGATTAAGG AGAGTGGTGC TAATGTTGTT TTCTGTCAGA AGGGTATTGA TGATGTTGCT CAGTACTACT TGGCTAAGGC TGGTATTCTT GCTGTTAGGC GTGTGAAGAA GAGTGATATG GAGAAGCTTG CTAGGGCTAC TGGTGCTAGG ATTCTCACTA GGGTGGAGGA TATTACGCCT GAGGCTCTCG GTAGGGCTGA GCTTGTGGAG GAGAGGAAGG TTGCAGACGA GAAGATGGTA TTCGTCGAGG GATGCCCCAA CCCCAAGAGC GTAACAATAC TAGTAAGAGG AGGGGCTGAC CACGTAGTCG ACGAGGCCGA GAGGGCCATA CACGACGCTC TAAGCGTCGT GAGGAACGTG ATCAGAGAGC CTAAGATCGT TGCCGGTGGA GGAGCTGTCG AAATAGAGCT CGCTATGAGG CTCCGAGACT TTGCCAGAAC TCTGCCCAGC AGGGAACAGC TAGCTGTGCA GAAGTACGCC GAGGCGCTTG AAAGCATCGT AGGCATCCTT GCCCAGAACG CCGGAATGGA GCCTATCGAC GTACTAGCAG AACTCAAGAC ACGCCATGCG AAAGGCGAGA AGTGGGCAGG TGTAAATGCC TACACGGCGA AAGTAGAGGA CATGAAGAAG GCAGGCGTCT TGGAGCCCGC GCTCGTAAAG AAACAGGTAC TTAAATCGGC GACAGAGGCC GCTGTAATGA TACTGAGGAT CGACGATATC ATTGCTGCTC AGCCGCCGAA GTCCAAGGAG AAGAAAGGAG AAGAGGAGAA GGAGAAGGAA AAGACGGAGT TTGACTAG
|
Protein sequence | MAQAQIPVLI LKEGTQRTTG RDARKSNIYA AKVIAEAMAS SLGPRGMDKL LVDSFGNATI TGDGATILKE MEVQHPAAKM LVEVAKAQDD EVGDGTTTVV VLAGQLLAAS EELLDEDIHP TTIVEGFEKA LVEATRIIDE ISETVDPLDR TVLENVAKTA LSSKVVADYK DFLAKLVVDA ALTVVEKKDG KYNLSLDDIK VEKKRGESIT ETMLVKGIVL DKEVVHPGMP KRVTNAKIAL LDAPLEIEKP EWTAKINVTT PEQLKMFLDQ EAEILRKKVE KIKESGANVV FCQKGIDDVA QYYLAKAGIL AVRRVKKSDM EKLARATGAR ILTRVEDITP EALGRAELVE ERKVADEKMV FVEGCPNPKS VTILVRGGAD HVVDEAERAI HDALSVVRNV IREPKIVAGG GAVEIELAMR LRDFARTLPS REQLAVQKYA EALESIVGIL AQNAGMEPID VLAELKTRHA KGEKWAGVNA YTAKVEDMKK AGVLEPALVK KQVLKSATEA AVMILRIDDI IAAQPPKSKE KKGEEEKEKE KTEFD
|
| |