Gene Msed_1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1710 
Symbol 
ID5105073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1648129 
End bp1649817 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content49% 
IMG OID640507604 
Productthermosome 
Protein accessionYP_001191789 
Protein GI146304473 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAG TTCCAGTTCT TCTATTCAAG GAAGGCACTT CAAGGTCAAC CGGCAGGGAT 
GCCCTTAGGA ATAACATACT TGCTGCAAGA ACTTTGGCTG AAATGCTCAG ATCGAGTTTG
GGTCCAAAGG GATTGGACAA GATGCTGATT GACAGCTTCA ACGACGTGAC CATTACCAAC
GACGGAGCTA CAATTGTCAA GGAAATGGAG ATTCAGCATC CAGCTGCCAA GCTTCTAGTT
GAGGCTGCAA AGGCGCAGGA CGCTGAAGTG GGTGACGGGA CAACCAGTGC AGTGGTTCTC
GCAGGCCTTC TCTTGGAGAA GGCAGAGGCC CTTCTAGACC AGAACGTTCA CCCTACCATA
ATTATTGAAG GGTACAAGAA GGCCTTCAAT AAGGCCCTTG AGCTCCTGAC TCAGATTTCC
ACCAAGATAG ATGTTAAGAA CCTTCAGGAT CCTGCAGTTA AGGCCAACCT CAAGAAGATA
GTTTACACCA CAATGGCAAG CAAGTTCATT GCAGAATCTG AGGCCGAAAT GAACAAGATC
ATGGACATAA TCATTGATGC AGTTTCCAAG GTAGCTGAGC CCCTACCCAA CGGTGGTTAC
AATGTGAGCC TTGACCTAGT TAAGATAGAC AAGAAGAAGG GAGGAACAAT AGAGGACAGT
ATCCTAGTCC ACGGTCTAGT TCTTGACAAG GAAGTTGTTC ACCCTGGCAT GCCCAGGAGA
GTAGAGAAGG CCAAGATAGC CGTTCTAGAC GCAGCTCTAG AGGTAGAGAA GCCGGAGATT
TCAGCTAAGA TAAGCATCAC TAGCCCGGAG CAGATTAAGT CCTTCTTAGA CGAGGAGACA
AAGTACCTCA AGGAGATGGT TGACAAGCTG GCCAGCATCG GGGCCAATGT GGTGGTGTGC
CAGAAGGGGA TTGATGATAT AGCTCAGCAC TTCCTTGCCA AGAAGGGAAT CCTGGCTGTT
AGAAGGGTCA AGAGGAGCGA TATTGAGAAG CTAGAGAAGG CGTTGGGAGC TAGGATAATC
AGTAGTATCA AGGATGCAAC TCCCGAGGAC CTAGGCTACG CTGAATTGGT TGAGGAGAGA
AGAATTGGAA ATGACAAGAT GGTCTTCATT GAAGGCGCCA AGAACCCAAG AGCTGTGAAC
ATCCTATTGA GAGGATCCAA TGACATGGCC CTCGATGAGG CCGAGAGAAG TATAAATGAC
GCGCTTCACG CGCTTAGAAA CATCCTATTG GAGCCCATGA TAGTGCCAGG CGGAGGAGCA
ATAGAGGTGG AACTTGCAAT GAAACTGAGG GAATATGCTA GGACAGTTGG AGGAAAGGAG
CAGCTTGCCA TAGAGGCTTA CGCTGATGCC CTAGAGGAGA TCCCAAGCAT ATTGGCTGAA
ACTGCCGGAA TGGAGCCCAT ATCCACCCTA ATGGACCTGA GGGCTAGACA CGTTAAGGGA
ATTGCCAATG CTGGTGTGGA TGTGATAAAC GGAAAGATAG TCGACGACAT GTTCTCCATC
AATGTACTAG AGCCGGTTAG GGTGAAGAGG CAAGTTCTCA AGAGCTCAAC AGAGGCAGCT
ACCTCAGTGC TGAAGATTGA TGATCTAATT GCCGCATCTC AGTTGAAGTC CGAGGGTGGC
AAGGGTAAGA CTCCTGGCGG AGAAGAAGGA GAAGGAGCTG GAATGGGAGG AGCTCCTTCC
TTCGGCTAA
 
Protein sequence
MAGVPVLLFK EGTSRSTGRD ALRNNILAAR TLAEMLRSSL GPKGLDKMLI DSFNDVTITN 
DGATIVKEME IQHPAAKLLV EAAKAQDAEV GDGTTSAVVL AGLLLEKAEA LLDQNVHPTI
IIEGYKKAFN KALELLTQIS TKIDVKNLQD PAVKANLKKI VYTTMASKFI AESEAEMNKI
MDIIIDAVSK VAEPLPNGGY NVSLDLVKID KKKGGTIEDS ILVHGLVLDK EVVHPGMPRR
VEKAKIAVLD AALEVEKPEI SAKISITSPE QIKSFLDEET KYLKEMVDKL ASIGANVVVC
QKGIDDIAQH FLAKKGILAV RRVKRSDIEK LEKALGARII SSIKDATPED LGYAELVEER
RIGNDKMVFI EGAKNPRAVN ILLRGSNDMA LDEAERSIND ALHALRNILL EPMIVPGGGA
IEVELAMKLR EYARTVGGKE QLAIEAYADA LEEIPSILAE TAGMEPISTL MDLRARHVKG
IANAGVDVIN GKIVDDMFSI NVLEPVRVKR QVLKSSTEAA TSVLKIDDLI AASQLKSEGG
KGKTPGGEEG EGAGMGGAPS FG