Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_1052 |
Symbol | |
ID | 4909321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | + |
Start bp | 992418 |
End bp | 994067 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640124804 |
Product | thermosome subunit |
Protein accession | YP_001055943 |
Protein GI | 126459665 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02339] thermosome, various subunits, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0241449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGG CAGTGTTGAC TCAGATAGGT GGAGTTCCAG TGCTGGTGCT CAAGGAGGGC ACACAACGGG CGTTTGGGAA AGAGGCGCTT AGGCTCAATA TAATGATTGC GAGGGCAATT GCCGAGGTCA TGAGGACGAC GCTTGGGCCA AAGGGGATGG ACAAGATGCT CATAGACTCG CTTGGCGATA TAACTATCAC AAACGACGGC GCCACGATCC TAGACGAGAT GGATGTACAA CACCCCATCG CCAAGCTCCT TGTTGAGATT TCTAAGTCGC AGGAGGAGGA GGCGGGAGAC GGCACTACGA CCGCGGTGGT GCTCGCCGGC GCGCTTCTTG AGGAGGCTGA GAAGCTTCTA GAGAAGAACA TCCACCCGAC GGTAATTGTA AGCGGCTTCA AGAAGGCGCT TGACGTAGCC ACTGAGCATT TGAGAAAAGT GGCCGTGCCG GTGAATAGGA GCGACGTAGA TACGCTTAAG AAGATCGCCA TGACTTCCAT GGGCGGCAAG ATAAGCGAGA CTGTGAAGGA GTATTTCGCT GACTTGGCCG TGAGGGCCGT ACTGCAAGTT GCCGAGGAGA GGAATGGGAA GTGGTACGTG GACTTGGACA ACATCCAAAT TGTGAAAAAG CACGGGGCCT CCCTCCTTGA CACACAGCTA GTGTACGGCA TTGTGATAGA CAAGGAGGTT GTACACGCCG CTATGCCGAA GCGCGTGGTA AACGCCAAGA TAGCCCTCTT AGATGCGCCT CTTGAGGTGG AGAAGCCCGA GATAGATGCG GAGATCAGAA TCAACGATCC GACGCAGATG AGGGCCTTCT TGGAGGAGGA GGAGAAGATA CTGAAGGGCT ATGTCGACAA GCTGAAGTCC CTCGGCGTAA CTGCCCTGTT TACCACCAAG GGAATTGACG ACATAGCGCA GTACTACTTG GCCAAGGCCG GGATCTTGGC CGTGAGGAGA GTGAAGCGTA GCGACATTGA GAAACTGGTG AGGGCCACCG GCGCCCGCCT TGTCACAAGC CTCGAAGACC TCACAGAGGC AGACCTAGGC TTCGCCGGCT TGGTGGAAGA GCGCCGCGTG GGAGATGAGA AGATGGTGTT CGTGGAGCAG TGTAAGAACC CGCGCGCGGT GTCCATATTG GTGCGCGGCG GCTTTGAGAG GCTCGTGGAC GAGGCTGAGA GAAATCTCGA CGACGCCCTA TCTGTAGTTG CCGACGTCGT AGAAGAGCCG TACATACTGC CGGCAGGAGG CGCAGCGGAG ATCGAGGCCG CCAAGGCTGT TAGAGCGTTT GCCCCCAAGG TAGGCGGCAG AGAGCAGTAC GCAGTTGAGG CCTTCGCAAG AGCCCTAGAG GCAATACCCA AGGCACTTGC AGAAAACGCC GGCCTCGACC CCATCGACAT ATTGACAGAG CTGACTCACA AGCACGAGCA GCCAGACGGA TGGAGATACG GCCTAGACGT CTACCAAGGC AAAGTCGTGG ACATGATGAG CCTTGGCCTA ATCGAGCCGC TTACGGTAAA GATAAACGCG CTTAAAGTGG CCGTCGAGGC CGCCAGCATG ATCCTGAGAA TAGACGAGAT AATCGCGGCC TCTAAGCTGG AGAAAGAAGA GAAAGAAAAG AAGGAGGAGA AGAAGGAGGA ATTCGACTAA
|
Protein sequence | MSQAVLTQIG GVPVLVLKEG TQRAFGKEAL RLNIMIARAI AEVMRTTLGP KGMDKMLIDS LGDITITNDG ATILDEMDVQ HPIAKLLVEI SKSQEEEAGD GTTTAVVLAG ALLEEAEKLL EKNIHPTVIV SGFKKALDVA TEHLRKVAVP VNRSDVDTLK KIAMTSMGGK ISETVKEYFA DLAVRAVLQV AEERNGKWYV DLDNIQIVKK HGASLLDTQL VYGIVIDKEV VHAAMPKRVV NAKIALLDAP LEVEKPEIDA EIRINDPTQM RAFLEEEEKI LKGYVDKLKS LGVTALFTTK GIDDIAQYYL AKAGILAVRR VKRSDIEKLV RATGARLVTS LEDLTEADLG FAGLVEERRV GDEKMVFVEQ CKNPRAVSIL VRGGFERLVD EAERNLDDAL SVVADVVEEP YILPAGGAAE IEAAKAVRAF APKVGGREQY AVEAFARALE AIPKALAENA GLDPIDILTE LTHKHEQPDG WRYGLDVYQG KVVDMMSLGL IEPLTVKINA LKVAVEAASM ILRIDEIIAA SKLEKEEKEK KEEKKEEFD
|
| |