Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0916 |
Symbol | |
ID | 4462380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 997221 |
End bp | 998903 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639699935 |
Product | thermosome |
Protein accession | YP_843344 |
Protein GI | 116754226 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02339] thermosome, various subunits, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCGT TAGGAGGAGT ACCGGTAATC ATACTCAAGG AGGGCACCCA GAGGGAATCC GGCAGGGAGG CGATCGAGAA CAACATAATG GCTGCACGTG CAGTCGCAAA TGCCGTTAAG ACGACTCTGG GGCCTAAAGG CATGGACAAG CTTCTTGTCG ACGCGCTCGG GGACGTCACG ATAACCAATG ATGGTGTCAC AATACTCAGA GAGATGGAGG TGCAGCATCC AGCTGCCAAG ATGGTTGTGG AGGCAGCGAA GACCCAGGAC AAGGAGGTAG GAGATGGCAC CACCACAGTG GCCATACTGA TAGGCGAGCT GCTGAAGCAT GCGAGAGAGC TCATGGAGAA GGGTCTGCAT CCCACGGTGA TTGCGCGAGG CTATAGCATG GCAGCAGAAA AGGCTGTGGA GTATCTTAAC AGCATAGCGC GAGACGTCTC AGAGAAGGAC AGAGCGCTTC TGGAGAAGGT TGCGATAACC GCGATGACTG GAAAGCTCGC TGAGACTCCG AGCCACAAAG TGGCCAGGTA TGCTGTCGAT CTTGTGCTCT CAACAGTTGA CAAGTTTGAT GGGAAAACGG TCGTCGATCT GGACAATGTT ATGGTTGAGA AGAGGGTTGG CGGCGGGATC GAGGACTCTG AGCTAATCCG CGGAGTGATC ATAGACAAGG AGAGGGTCCA CCAGAACATG CCAAGAAGGG TGGAGAACGC GAGGATCGCG CTGCTGAACG TCCCGATCGA GAGGAGAGAC ACGGAGACGA AGGCGGAGAT ATCGATCACA TCCGGAGACC AGTTCCAGCT CTTCATGGAC CATGAGAAGG AGGAGATCAA AAAGGTAGTG GACAAGGTCA TAAGAAGCGG CGCCAATGTT GTCTTCTGCC AGAAGGGCAT CGATGATCTC GCTCAGCACT TCTTGGCTAA AGCGGGGATC ATGGCGTACC GCAGGATAAG AAAGAGCGAT CTCGAGAAGC TCTCCCGCGC CACTGGAGGC AGGCTCATAA CAAACCTGGA TGAGATGAAG CCAGAGGATC TCGGCGAGGC CGCTCTCGTA GAGGAGAGGA TTGTGGGCGC TGGACCCATG ACATTCGTCA CAGGGTGCAA GAATCCGGGG TATCTCTCGC TGATACTCCG CGGCGGCACA CAGCAGGTTG TGGACAGCCT GGAGAGGGCG CTGGATGATG CACTCCACGC GGTCGCAACA GCGATTGAGA GCGGCAGGCT TCTCGCAGGC GGTGGCGCAC CAGAGACTGC TGTGGGCATA AAGCTGAGAG AGTATGCTGC CTCTCTCAAG GGGAGGGAGC AGCTTGCGGT CGAGAAGTTC GCCGAGGCGA TAGAGGTTGT GCCAAAGACG CTAGCAGAGA ACGCTGGCTT CAATCCTATC GACAAGATGG TCGCTCTGAG GAGCAAGCAC GAGAAGTTTG GCAGCACTTA CGGTCTCAAC GCATACACAG GCGAGATCGT GGACATGTGG GATATCGGTG TCGTCGAGCC TCTCAGGGTC AAGGTCCAGG CTATCTACTC CGCGACAGAT GCTGCCTCCC TCATACTGAG AATAGATGAT GTGATCGCTG CCAAGAAGAA GGAAGAAGGC GAGGAGAAGG AGGGCCAGAT GTCGGGAATG GGTGGGATGG GTGGAATGGG AGGAATGGGT GGAATGGGCG GATTTCCACC AGGCATGATG TGA
|
Protein sequence | MAALGGVPVI ILKEGTQRES GREAIENNIM AARAVANAVK TTLGPKGMDK LLVDALGDVT ITNDGVTILR EMEVQHPAAK MVVEAAKTQD KEVGDGTTTV AILIGELLKH ARELMEKGLH PTVIARGYSM AAEKAVEYLN SIARDVSEKD RALLEKVAIT AMTGKLAETP SHKVARYAVD LVLSTVDKFD GKTVVDLDNV MVEKRVGGGI EDSELIRGVI IDKERVHQNM PRRVENARIA LLNVPIERRD TETKAEISIT SGDQFQLFMD HEKEEIKKVV DKVIRSGANV VFCQKGIDDL AQHFLAKAGI MAYRRIRKSD LEKLSRATGG RLITNLDEMK PEDLGEAALV EERIVGAGPM TFVTGCKNPG YLSLILRGGT QQVVDSLERA LDDALHAVAT AIESGRLLAG GGAPETAVGI KLREYAASLK GREQLAVEKF AEAIEVVPKT LAENAGFNPI DKMVALRSKH EKFGSTYGLN AYTGEIVDMW DIGVVEPLRV KVQAIYSATD AASLILRIDD VIAAKKKEEG EEKEGQMSGM GGMGGMGGMG GMGGFPPGMM
|
| |