Gene Tpen_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0291 
Symbol 
ID4601300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp258656 
End bp260293 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content53% 
IMG OID639773049 
Productthermosome 
Protein accessionYP_919704 
Protein GI119719209 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal
[TIGR02340] T-complex protein 1, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00322261 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGG CTCAGATCCC TGTATTAATA CTTAAAGAGG GTACTCAGAG AACCACCGGG 
AGAGATGCCA GGAAATCCAA CATCTACGCC GCAAAGGTCA TAGCGGAGGC CATGGCGAGC
TCTCTAGGTC CTAGGGGCAT GGACAAGCTC CTAGTTGATT CTTTTGGAAA CGCGACGATC
ACCGGTGACG GCGCGACCAT ACTCAAGGAG ATGGAAGTAC AGCACCCGGC CGCTAAAATG
CTCGTTGAGG TCGCGAAGGC TCAGGATGAC GAGGTTGGAG ACGGTACAAC CACTGTAGTC
GTCCTAGCAG GGCAGCTGCT CGCTGCCTCC GAGGAGCTTC TCGACGAGGA CATTCACCCC
ACGACAATAG TGGAGGGTTT CGAAAAGGCG CTCGTTGAGG CTACCAGAAT AATTGACGAG
ATTTCCGAGA CCGTAGACCC ACTCGACAGG ACTGTCCTCG AGAACGTTGC GAAGACTGCC
CTTTCAAGCA AGGTTGTAGC GGACTACAAG GACTTCTTGG CTAAGCTCGT CGTAGACGCT
GCTCTGACGG TCGTTGAAAA GAAGGACGGA AAGTACAACC TAAGCCTGGA CGATATCAAG
GTGGAGAAGA AGAGAGGAGA GAGCATAACG GAAACAATGC TGGTAAAGGG CATAGTGCTC
GACAAGGAGG TTGTGCACCC GGGTATGCCT AAGAGGGTTA CGAACGCGAA GATAGCCCTC
CTAGACGCCC CGCTAGAGAT AGAGAAGCCT GAGTGGACTG CTAAGATAAA CGTTACAACC
CCGGAGCAGC TGAAGATGTT CCTAGACCAG GAGGCGGAGA TCCTGAGAAA GAAGGTCGAG
AAGATTAAGG AGAGTGGTGC TAATGTTGTT TTCTGTCAGA AGGGTATTGA TGATGTTGCT
CAGTACTACT TGGCTAAGGC TGGTATTCTT GCTGTTAGGC GTGTGAAGAA GAGTGATATG
GAGAAGCTTG CTAGGGCTAC TGGTGCTAGG ATTCTCACTA GGGTGGAGGA TATTACGCCT
GAGGCTCTCG GTAGGGCTGA GCTTGTGGAG GAGAGGAAGG TTGCAGACGA GAAGATGGTA
TTCGTCGAGG GATGCCCCAA CCCCAAGAGC GTAACAATAC TAGTAAGAGG AGGGGCTGAC
CACGTAGTCG ACGAGGCCGA GAGGGCCATA CACGACGCTC TAAGCGTCGT GAGGAACGTG
ATCAGAGAGC CTAAGATCGT TGCCGGTGGA GGAGCTGTCG AAATAGAGCT CGCTATGAGG
CTCCGAGACT TTGCCAGAAC TCTGCCCAGC AGGGAACAGC TAGCTGTGCA GAAGTACGCC
GAGGCGCTTG AAAGCATCGT AGGCATCCTT GCCCAGAACG CCGGAATGGA GCCTATCGAC
GTACTAGCAG AACTCAAGAC ACGCCATGCG AAAGGCGAGA AGTGGGCAGG TGTAAATGCC
TACACGGCGA AAGTAGAGGA CATGAAGAAG GCAGGCGTCT TGGAGCCCGC GCTCGTAAAG
AAACAGGTAC TTAAATCGGC GACAGAGGCC GCTGTAATGA TACTGAGGAT CGACGATATC
ATTGCTGCTC AGCCGCCGAA GTCCAAGGAG AAGAAAGGAG AAGAGGAGAA GGAGAAGGAA
AAGACGGAGT TTGACTAG
 
Protein sequence
MAQAQIPVLI LKEGTQRTTG RDARKSNIYA AKVIAEAMAS SLGPRGMDKL LVDSFGNATI 
TGDGATILKE MEVQHPAAKM LVEVAKAQDD EVGDGTTTVV VLAGQLLAAS EELLDEDIHP
TTIVEGFEKA LVEATRIIDE ISETVDPLDR TVLENVAKTA LSSKVVADYK DFLAKLVVDA
ALTVVEKKDG KYNLSLDDIK VEKKRGESIT ETMLVKGIVL DKEVVHPGMP KRVTNAKIAL
LDAPLEIEKP EWTAKINVTT PEQLKMFLDQ EAEILRKKVE KIKESGANVV FCQKGIDDVA
QYYLAKAGIL AVRRVKKSDM EKLARATGAR ILTRVEDITP EALGRAELVE ERKVADEKMV
FVEGCPNPKS VTILVRGGAD HVVDEAERAI HDALSVVRNV IREPKIVAGG GAVEIELAMR
LRDFARTLPS REQLAVQKYA EALESIVGIL AQNAGMEPID VLAELKTRHA KGEKWAGVNA
YTAKVEDMKK AGVLEPALVK KQVLKSATEA AVMILRIDDI IAAQPPKSKE KKGEEEKEKE
KTEFD