Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1710 |
Symbol | |
ID | 5105073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1648129 |
End bp | 1649817 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507604 |
Product | thermosome |
Protein accession | YP_001191789 |
Protein GI | 146304473 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02339] thermosome, various subunits, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGAG TTCCAGTTCT TCTATTCAAG GAAGGCACTT CAAGGTCAAC CGGCAGGGAT GCCCTTAGGA ATAACATACT TGCTGCAAGA ACTTTGGCTG AAATGCTCAG ATCGAGTTTG GGTCCAAAGG GATTGGACAA GATGCTGATT GACAGCTTCA ACGACGTGAC CATTACCAAC GACGGAGCTA CAATTGTCAA GGAAATGGAG ATTCAGCATC CAGCTGCCAA GCTTCTAGTT GAGGCTGCAA AGGCGCAGGA CGCTGAAGTG GGTGACGGGA CAACCAGTGC AGTGGTTCTC GCAGGCCTTC TCTTGGAGAA GGCAGAGGCC CTTCTAGACC AGAACGTTCA CCCTACCATA ATTATTGAAG GGTACAAGAA GGCCTTCAAT AAGGCCCTTG AGCTCCTGAC TCAGATTTCC ACCAAGATAG ATGTTAAGAA CCTTCAGGAT CCTGCAGTTA AGGCCAACCT CAAGAAGATA GTTTACACCA CAATGGCAAG CAAGTTCATT GCAGAATCTG AGGCCGAAAT GAACAAGATC ATGGACATAA TCATTGATGC AGTTTCCAAG GTAGCTGAGC CCCTACCCAA CGGTGGTTAC AATGTGAGCC TTGACCTAGT TAAGATAGAC AAGAAGAAGG GAGGAACAAT AGAGGACAGT ATCCTAGTCC ACGGTCTAGT TCTTGACAAG GAAGTTGTTC ACCCTGGCAT GCCCAGGAGA GTAGAGAAGG CCAAGATAGC CGTTCTAGAC GCAGCTCTAG AGGTAGAGAA GCCGGAGATT TCAGCTAAGA TAAGCATCAC TAGCCCGGAG CAGATTAAGT CCTTCTTAGA CGAGGAGACA AAGTACCTCA AGGAGATGGT TGACAAGCTG GCCAGCATCG GGGCCAATGT GGTGGTGTGC CAGAAGGGGA TTGATGATAT AGCTCAGCAC TTCCTTGCCA AGAAGGGAAT CCTGGCTGTT AGAAGGGTCA AGAGGAGCGA TATTGAGAAG CTAGAGAAGG CGTTGGGAGC TAGGATAATC AGTAGTATCA AGGATGCAAC TCCCGAGGAC CTAGGCTACG CTGAATTGGT TGAGGAGAGA AGAATTGGAA ATGACAAGAT GGTCTTCATT GAAGGCGCCA AGAACCCAAG AGCTGTGAAC ATCCTATTGA GAGGATCCAA TGACATGGCC CTCGATGAGG CCGAGAGAAG TATAAATGAC GCGCTTCACG CGCTTAGAAA CATCCTATTG GAGCCCATGA TAGTGCCAGG CGGAGGAGCA ATAGAGGTGG AACTTGCAAT GAAACTGAGG GAATATGCTA GGACAGTTGG AGGAAAGGAG CAGCTTGCCA TAGAGGCTTA CGCTGATGCC CTAGAGGAGA TCCCAAGCAT ATTGGCTGAA ACTGCCGGAA TGGAGCCCAT ATCCACCCTA ATGGACCTGA GGGCTAGACA CGTTAAGGGA ATTGCCAATG CTGGTGTGGA TGTGATAAAC GGAAAGATAG TCGACGACAT GTTCTCCATC AATGTACTAG AGCCGGTTAG GGTGAAGAGG CAAGTTCTCA AGAGCTCAAC AGAGGCAGCT ACCTCAGTGC TGAAGATTGA TGATCTAATT GCCGCATCTC AGTTGAAGTC CGAGGGTGGC AAGGGTAAGA CTCCTGGCGG AGAAGAAGGA GAAGGAGCTG GAATGGGAGG AGCTCCTTCC TTCGGCTAA
|
Protein sequence | MAGVPVLLFK EGTSRSTGRD ALRNNILAAR TLAEMLRSSL GPKGLDKMLI DSFNDVTITN DGATIVKEME IQHPAAKLLV EAAKAQDAEV GDGTTSAVVL AGLLLEKAEA LLDQNVHPTI IIEGYKKAFN KALELLTQIS TKIDVKNLQD PAVKANLKKI VYTTMASKFI AESEAEMNKI MDIIIDAVSK VAEPLPNGGY NVSLDLVKID KKKGGTIEDS ILVHGLVLDK EVVHPGMPRR VEKAKIAVLD AALEVEKPEI SAKISITSPE QIKSFLDEET KYLKEMVDKL ASIGANVVVC QKGIDDIAQH FLAKKGILAV RRVKRSDIEK LEKALGARII SSIKDATPED LGYAELVEER RIGNDKMVFI EGAKNPRAVN ILLRGSNDMA LDEAERSIND ALHALRNILL EPMIVPGGGA IEVELAMKLR EYARTVGGKE QLAIEAYADA LEEIPSILAE TAGMEPISTL MDLRARHVKG IANAGVDVIN GKIVDDMFSI NVLEPVRVKR QVLKSSTEAA TSVLKIDDLI AASQLKSEGG KGKTPGGEEG EGAGMGGAPS FG
|
| |