Gene Pcal_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_1052 
Symbol 
ID4909321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp992418 
End bp994067 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content56% 
IMG OID640124804 
Productthermosome subunit 
Protein accessionYP_001055943 
Protein GI126459665 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0241449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG CAGTGTTGAC TCAGATAGGT GGAGTTCCAG TGCTGGTGCT CAAGGAGGGC 
ACACAACGGG CGTTTGGGAA AGAGGCGCTT AGGCTCAATA TAATGATTGC GAGGGCAATT
GCCGAGGTCA TGAGGACGAC GCTTGGGCCA AAGGGGATGG ACAAGATGCT CATAGACTCG
CTTGGCGATA TAACTATCAC AAACGACGGC GCCACGATCC TAGACGAGAT GGATGTACAA
CACCCCATCG CCAAGCTCCT TGTTGAGATT TCTAAGTCGC AGGAGGAGGA GGCGGGAGAC
GGCACTACGA CCGCGGTGGT GCTCGCCGGC GCGCTTCTTG AGGAGGCTGA GAAGCTTCTA
GAGAAGAACA TCCACCCGAC GGTAATTGTA AGCGGCTTCA AGAAGGCGCT TGACGTAGCC
ACTGAGCATT TGAGAAAAGT GGCCGTGCCG GTGAATAGGA GCGACGTAGA TACGCTTAAG
AAGATCGCCA TGACTTCCAT GGGCGGCAAG ATAAGCGAGA CTGTGAAGGA GTATTTCGCT
GACTTGGCCG TGAGGGCCGT ACTGCAAGTT GCCGAGGAGA GGAATGGGAA GTGGTACGTG
GACTTGGACA ACATCCAAAT TGTGAAAAAG CACGGGGCCT CCCTCCTTGA CACACAGCTA
GTGTACGGCA TTGTGATAGA CAAGGAGGTT GTACACGCCG CTATGCCGAA GCGCGTGGTA
AACGCCAAGA TAGCCCTCTT AGATGCGCCT CTTGAGGTGG AGAAGCCCGA GATAGATGCG
GAGATCAGAA TCAACGATCC GACGCAGATG AGGGCCTTCT TGGAGGAGGA GGAGAAGATA
CTGAAGGGCT ATGTCGACAA GCTGAAGTCC CTCGGCGTAA CTGCCCTGTT TACCACCAAG
GGAATTGACG ACATAGCGCA GTACTACTTG GCCAAGGCCG GGATCTTGGC CGTGAGGAGA
GTGAAGCGTA GCGACATTGA GAAACTGGTG AGGGCCACCG GCGCCCGCCT TGTCACAAGC
CTCGAAGACC TCACAGAGGC AGACCTAGGC TTCGCCGGCT TGGTGGAAGA GCGCCGCGTG
GGAGATGAGA AGATGGTGTT CGTGGAGCAG TGTAAGAACC CGCGCGCGGT GTCCATATTG
GTGCGCGGCG GCTTTGAGAG GCTCGTGGAC GAGGCTGAGA GAAATCTCGA CGACGCCCTA
TCTGTAGTTG CCGACGTCGT AGAAGAGCCG TACATACTGC CGGCAGGAGG CGCAGCGGAG
ATCGAGGCCG CCAAGGCTGT TAGAGCGTTT GCCCCCAAGG TAGGCGGCAG AGAGCAGTAC
GCAGTTGAGG CCTTCGCAAG AGCCCTAGAG GCAATACCCA AGGCACTTGC AGAAAACGCC
GGCCTCGACC CCATCGACAT ATTGACAGAG CTGACTCACA AGCACGAGCA GCCAGACGGA
TGGAGATACG GCCTAGACGT CTACCAAGGC AAAGTCGTGG ACATGATGAG CCTTGGCCTA
ATCGAGCCGC TTACGGTAAA GATAAACGCG CTTAAAGTGG CCGTCGAGGC CGCCAGCATG
ATCCTGAGAA TAGACGAGAT AATCGCGGCC TCTAAGCTGG AGAAAGAAGA GAAAGAAAAG
AAGGAGGAGA AGAAGGAGGA ATTCGACTAA
 
Protein sequence
MSQAVLTQIG GVPVLVLKEG TQRAFGKEAL RLNIMIARAI AEVMRTTLGP KGMDKMLIDS 
LGDITITNDG ATILDEMDVQ HPIAKLLVEI SKSQEEEAGD GTTTAVVLAG ALLEEAEKLL
EKNIHPTVIV SGFKKALDVA TEHLRKVAVP VNRSDVDTLK KIAMTSMGGK ISETVKEYFA
DLAVRAVLQV AEERNGKWYV DLDNIQIVKK HGASLLDTQL VYGIVIDKEV VHAAMPKRVV
NAKIALLDAP LEVEKPEIDA EIRINDPTQM RAFLEEEEKI LKGYVDKLKS LGVTALFTTK
GIDDIAQYYL AKAGILAVRR VKRSDIEKLV RATGARLVTS LEDLTEADLG FAGLVEERRV
GDEKMVFVEQ CKNPRAVSIL VRGGFERLVD EAERNLDDAL SVVADVVEEP YILPAGGAAE
IEAAKAVRAF APKVGGREQY AVEAFARALE AIPKALAENA GLDPIDILTE LTHKHEQPDG
WRYGLDVYQG KVVDMMSLGL IEPLTVKINA LKVAVEAASM ILRIDEIIAA SKLEKEEKEK
KEEKKEEFD