Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2264 |
Symbol | |
ID | 5104216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 2165944 |
End bp | 2167605 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640508161 |
Product | thermosome |
Protein accession | YP_001192326 |
Protein GI | 146305010 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02339] thermosome, various subunits, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.802406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCAC AAGCTACAGT TGCTACAACT CCAGAAGGTA TTCCTGTAAT TATTTTAAAG GAAGGTTCAA GCAGAGCCTT TGGAAAGGAA GCCCTTAGGG CCAACATTGC TGCTGTCAAG GCAGTAGAAG AGGCGTTGAG GACGACGTAT GGACCTAGAG GTATGGATAA GATGTTAGTA GACAGTCTTG GAGACATCAC AATCACAAAC GACGGAGCTA CTTTGCTGGA CAAGATGGAT CTACAGCATC CTGCCGCTAA ACTCTTAGTT CAGATAGCTA AGGGTCAAGA CGAAGAGACA GCTGACGGAA CTAAGACTGC TGTGATTCTA TCAGGAGAAC TAGTCAGGAA AGCAGAGGAT CTACTATACA AGGAGGTACA CCCAACCATT ATCATTAGTG GTTATAAGAA GGCTGAGGAA GTAGCTCTTC AGACTATCCA AGAGATAGCG CAACCAATTA GCATTAATGA TGTTGAGTTG ATGAAGAAGG TTGCTATGAC CTCATTGAGT AGCAAGGCAG TTGCTGGGTC AAGGGAGTAT CTAAGCGATG TGGTAGTTAA GGCCGTATCC CAAGTAGCGG AACTTAGAGG GGATAAGTGG TATGTGGATC TGGACAACAT TCAGATAGTC AAGAAGGCAG GAGGCAGTAT AAACGATACT CAACTAATAT ATGGAATAAT AGTGGATAAA GAAGTAGTAC ACCCTGGAAT GCCTAAGAGG GTAGAGAATG CTAAGATAGC TTTGATCGAC GCACCATTAG AGGTAGAAAA GCCAGAGCTA GACGCTGAGA TTAGGATCAA CGATCCCACC CAGATGGAGA GATTCCTACA GGAAGAAGAG AACATTATCA AGGAAAAAGT TGATATGATA GCCAAAACAG GGGCAAACGT AATAATTTGC CAGAAGGGTA TTGATGAGGT AGCCCAGTCT TACTTGGCTA AGAAAGGAAT TCTAGCGGTA AGGAGAGCAA AGAAGAGCGA TCTAGAGAAG TTAGCAAGAG CCACAGGAGG TAGGGTAGTA TCCAATATTG AAGAAATATC AGAACAAGAT CTAGGACATG CAGCACTTGT AGAGGAGAGA AAGATAGGAG AAGATAAGAT GGTCTTTGTG GAGGGAGCCA AGAATCCTAA GGCCATCAGT ATACTTATCA GAGGAGGTCT AGAAAGAGTG GTAGACGAGA CTGAGAGGGC CTTGAGGGAT GCCTTAGGAA CAGTCGCTGA CGTCATCAAG GATGGCAGAG CAGTAGCAGG CGGAGGAGCG GTGGAAATCG AGATTGCAAA GAGGTTGAGG AAGAAGGCTC CACAAGTTGG AGGTAAGGAA CAGCTAGCAA TAGAGGCTTA CGCTAATGCT CTGGAGAGCC TTGTGATGAT ATTGGTAGAG AATGCTGGTT TCGATCCTAT AGATCAATTG ATGAAACTAA GGTCCCTCCA TGAGAATGAG GCAAATAAGT GGTACGGCGT GGATCTCAAT ACAGGTCAAC CCACTGATAA CTGGGCAAGG GGCGTAATAG AGCCAGCTCT AGTAAAGATG AATGCCATCA AGGCAGCCAC AGAGGCTACC ACACTGATAC TCAGAATAGA TGATCTAGTA GCTGCGGGTA AGAAGTCTGG TGGAACTGGA GGAAAGGATA ACAAGTCAGA GAAGCCTTCT GAAGAGGATT AA
|
Protein sequence | MASQATVATT PEGIPVIILK EGSSRAFGKE ALRANIAAVK AVEEALRTTY GPRGMDKMLV DSLGDITITN DGATLLDKMD LQHPAAKLLV QIAKGQDEET ADGTKTAVIL SGELVRKAED LLYKEVHPTI IISGYKKAEE VALQTIQEIA QPISINDVEL MKKVAMTSLS SKAVAGSREY LSDVVVKAVS QVAELRGDKW YVDLDNIQIV KKAGGSINDT QLIYGIIVDK EVVHPGMPKR VENAKIALID APLEVEKPEL DAEIRINDPT QMERFLQEEE NIIKEKVDMI AKTGANVIIC QKGIDEVAQS YLAKKGILAV RRAKKSDLEK LARATGGRVV SNIEEISEQD LGHAALVEER KIGEDKMVFV EGAKNPKAIS ILIRGGLERV VDETERALRD ALGTVADVIK DGRAVAGGGA VEIEIAKRLR KKAPQVGGKE QLAIEAYANA LESLVMILVE NAGFDPIDQL MKLRSLHENE ANKWYGVDLN TGQPTDNWAR GVIEPALVKM NAIKAATEAT TLILRIDDLV AAGKKSGGTG GKDNKSEKPS EED
|
| |