Gene Tpen_0693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0693 
Symbol 
ID4602008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp644108 
End bp645769 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content53% 
IMG OID639773467 
Productthermosome 
Protein accessionYP_920098 
Protein GI119719603 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.540733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCGG TGGCACAACT AGGAGGAGTT CCAGTGTTGA TCCTGAAAGA GGGTACTTCG 
CGCCAGGCCG GGCGCGAAGC GCTTCACCTC AACATAATGA TCGCGAAAGC AGTCGCGGAG
ACCGTCAAGA CTACGCTAGG TCCTAAAGGC ATGGATAAGA TGCTTATCGA CACTCTCGGA
GACATAACGG TCTCCAACGA CGGTGCAACA ATCCTAGACG AGATGGACGT ACAGCACCCG
ATCGCTAAGC TGATGGTCGA GGTAGCAAAA GCACAGGACA AAGAAGTCGG AGACGGCACT
ACGACCGCTG TTGTGCTGAC AGGAGAACTT CTAAAGGAGG CCGAGAAGCT CCTCGAAAAG
AACATACACC CAACAATAAT CGTCAGCGGC TACAAGAAGG CTGCGGAGAA GGCCCGCGAA
ATTCTGGCCT CCAAGGCGAT AAAGGTAGAC CTTAACGACA CGGAGACCTT GAAGAAAGTA
GCAGCGACGT CCATGCGGAG TAAGGCGGTA GCCGCCCTAA GGGACTACTT CGCAGACATA
GCGGTTAAAG CCGTTAAACA GGTAGCCGAG GTAGTTAACG GCAAGTATGT CGTTGATATC
GACAACATTC AGATAATCAA GAAGAAGGGA GGAGCATTCC TGGATACACA GCTCATATAC
GGCATAGTCG TCGACAAGGA GGTTGTGCAC CCGGGTATGC CTAAGAGGGT TACGAACGCG
AAGATAGCCC TTCTAGACGC CCCGCTGGAA GTAGAGAAGA CGGAGATAGA CGCGGAGATT
AGGATCTCCT CGCCGGACCA GATGCACCAG TTCCTCGAGG AGGAGGAGAA GATACTCAGA
GACATGGTCG AGAAGATTAA GGAGAGTGGT GCTAATGTTG TTTTCTGTCA GAAGGGTATT
GATGATGTTG CTCAGTACTA CTTGGCTAAG GCTGGTATTC TTGCTGTTAG GCGTGTGAAG
AAGAGTGATA TGGAGAAGCT TGCTAGGGCT ACTGGTGCTA GGATTCTTAC TAGGGTTGAG
GATATTACGC CTGAGGCTCT CGGTAGGGCT GAGCTTGTGG AGGAGAGGAA GGTTGCAGAC
GAGAAGATGG TATTCGTCGA GGGATGCCCC AACCCCAAGA GCGTAACAAT ACTAGTAAGA
GGAGGCTTTG AGAGGGCTGT CGACGAGGCC GAGAGATCCA TAAAGGATGC GCTCTACGCG
GTGGCAGACG TGCTGAAGCA TCCCTACATA GTGCCTGGAG GCGGAGCGAT CGAGGCTGAG
CTCGCGAGGG AGCTTCGAAA GTACGCTCCG GAGGTCGGAG GAAAGGAGCA GCTCGCGATA
GAAGCATTTG CGAACGCCCT GGAAAGCATA CCAAGGACTC TCGCCGAGAA CTCCGGCCTA
GACCCCATAG ACATAATTGC GGACCTGAGA GCGGCGCACG AGGATCCCTC GAAGTGGAGC
TACGGTGTTG ACGTCGTAAA CGGTGGAGTA ACCGACATGA TCGCACTCGG AGTCTTCGAG
CCGGCAACCG TCAAGGACCA TGCAATAAAG GTAGCGACGG AGGCTGCAGC AATGATACTG
AGGATCGACG ACATAATCTC CGCGTCTAAA CTGGAAGAGA AGAAGGGCGA AAAGGAAAAG
AAGGAGGAGA AGGAGGAAGA AAAGTCCTCT GAGTTCGACT AA
 
Protein sequence
MASVAQLGGV PVLILKEGTS RQAGREALHL NIMIAKAVAE TVKTTLGPKG MDKMLIDTLG 
DITVSNDGAT ILDEMDVQHP IAKLMVEVAK AQDKEVGDGT TTAVVLTGEL LKEAEKLLEK
NIHPTIIVSG YKKAAEKARE ILASKAIKVD LNDTETLKKV AATSMRSKAV AALRDYFADI
AVKAVKQVAE VVNGKYVVDI DNIQIIKKKG GAFLDTQLIY GIVVDKEVVH PGMPKRVTNA
KIALLDAPLE VEKTEIDAEI RISSPDQMHQ FLEEEEKILR DMVEKIKESG ANVVFCQKGI
DDVAQYYLAK AGILAVRRVK KSDMEKLARA TGARILTRVE DITPEALGRA ELVEERKVAD
EKMVFVEGCP NPKSVTILVR GGFERAVDEA ERSIKDALYA VADVLKHPYI VPGGGAIEAE
LARELRKYAP EVGGKEQLAI EAFANALESI PRTLAENSGL DPIDIIADLR AAHEDPSKWS
YGVDVVNGGV TDMIALGVFE PATVKDHAIK VATEAAAMIL RIDDIISASK LEEKKGEKEK
KEEKEEEKSS EFD