Gene Msed_2264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2264 
Symbol 
ID5104216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2165944 
End bp2167605 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content45% 
IMG OID640508161 
Productthermosome 
Protein accessionYP_001192326 
Protein GI146305010 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.802406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAC AAGCTACAGT TGCTACAACT CCAGAAGGTA TTCCTGTAAT TATTTTAAAG 
GAAGGTTCAA GCAGAGCCTT TGGAAAGGAA GCCCTTAGGG CCAACATTGC TGCTGTCAAG
GCAGTAGAAG AGGCGTTGAG GACGACGTAT GGACCTAGAG GTATGGATAA GATGTTAGTA
GACAGTCTTG GAGACATCAC AATCACAAAC GACGGAGCTA CTTTGCTGGA CAAGATGGAT
CTACAGCATC CTGCCGCTAA ACTCTTAGTT CAGATAGCTA AGGGTCAAGA CGAAGAGACA
GCTGACGGAA CTAAGACTGC TGTGATTCTA TCAGGAGAAC TAGTCAGGAA AGCAGAGGAT
CTACTATACA AGGAGGTACA CCCAACCATT ATCATTAGTG GTTATAAGAA GGCTGAGGAA
GTAGCTCTTC AGACTATCCA AGAGATAGCG CAACCAATTA GCATTAATGA TGTTGAGTTG
ATGAAGAAGG TTGCTATGAC CTCATTGAGT AGCAAGGCAG TTGCTGGGTC AAGGGAGTAT
CTAAGCGATG TGGTAGTTAA GGCCGTATCC CAAGTAGCGG AACTTAGAGG GGATAAGTGG
TATGTGGATC TGGACAACAT TCAGATAGTC AAGAAGGCAG GAGGCAGTAT AAACGATACT
CAACTAATAT ATGGAATAAT AGTGGATAAA GAAGTAGTAC ACCCTGGAAT GCCTAAGAGG
GTAGAGAATG CTAAGATAGC TTTGATCGAC GCACCATTAG AGGTAGAAAA GCCAGAGCTA
GACGCTGAGA TTAGGATCAA CGATCCCACC CAGATGGAGA GATTCCTACA GGAAGAAGAG
AACATTATCA AGGAAAAAGT TGATATGATA GCCAAAACAG GGGCAAACGT AATAATTTGC
CAGAAGGGTA TTGATGAGGT AGCCCAGTCT TACTTGGCTA AGAAAGGAAT TCTAGCGGTA
AGGAGAGCAA AGAAGAGCGA TCTAGAGAAG TTAGCAAGAG CCACAGGAGG TAGGGTAGTA
TCCAATATTG AAGAAATATC AGAACAAGAT CTAGGACATG CAGCACTTGT AGAGGAGAGA
AAGATAGGAG AAGATAAGAT GGTCTTTGTG GAGGGAGCCA AGAATCCTAA GGCCATCAGT
ATACTTATCA GAGGAGGTCT AGAAAGAGTG GTAGACGAGA CTGAGAGGGC CTTGAGGGAT
GCCTTAGGAA CAGTCGCTGA CGTCATCAAG GATGGCAGAG CAGTAGCAGG CGGAGGAGCG
GTGGAAATCG AGATTGCAAA GAGGTTGAGG AAGAAGGCTC CACAAGTTGG AGGTAAGGAA
CAGCTAGCAA TAGAGGCTTA CGCTAATGCT CTGGAGAGCC TTGTGATGAT ATTGGTAGAG
AATGCTGGTT TCGATCCTAT AGATCAATTG ATGAAACTAA GGTCCCTCCA TGAGAATGAG
GCAAATAAGT GGTACGGCGT GGATCTCAAT ACAGGTCAAC CCACTGATAA CTGGGCAAGG
GGCGTAATAG AGCCAGCTCT AGTAAAGATG AATGCCATCA AGGCAGCCAC AGAGGCTACC
ACACTGATAC TCAGAATAGA TGATCTAGTA GCTGCGGGTA AGAAGTCTGG TGGAACTGGA
GGAAAGGATA ACAAGTCAGA GAAGCCTTCT GAAGAGGATT AA
 
Protein sequence
MASQATVATT PEGIPVIILK EGSSRAFGKE ALRANIAAVK AVEEALRTTY GPRGMDKMLV 
DSLGDITITN DGATLLDKMD LQHPAAKLLV QIAKGQDEET ADGTKTAVIL SGELVRKAED
LLYKEVHPTI IISGYKKAEE VALQTIQEIA QPISINDVEL MKKVAMTSLS SKAVAGSREY
LSDVVVKAVS QVAELRGDKW YVDLDNIQIV KKAGGSINDT QLIYGIIVDK EVVHPGMPKR
VENAKIALID APLEVEKPEL DAEIRINDPT QMERFLQEEE NIIKEKVDMI AKTGANVIIC
QKGIDEVAQS YLAKKGILAV RRAKKSDLEK LARATGGRVV SNIEEISEQD LGHAALVEER
KIGEDKMVFV EGAKNPKAIS ILIRGGLERV VDETERALRD ALGTVADVIK DGRAVAGGGA
VEIEIAKRLR KKAPQVGGKE QLAIEAYANA LESLVMILVE NAGFDPIDQL MKLRSLHENE
ANKWYGVDLN TGQPTDNWAR GVIEPALVKM NAIKAATEAT TLILRIDDLV AAGKKSGGTG
GKDNKSEKPS EED