Gene Mpal_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0032 
Symbol 
ID7270144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp30275 
End bp31930 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content59% 
IMG OID643568691 
Productthermosome 
Protein accessionYP_002465151 
Protein GI219850719 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.810485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCTCAAC AGCTTGCAGG ACAGCCAATC TTTATTCTTA AGGAAGGGAG TTCACGGACT 
CGTGGACGCG ACGCGCAGGG GAACAACATC AATGCAGCCA AGGCTGTTGC GAATGCAGTC
AGGACCACGC TCGGACCAAA GGGCATGGAC AAGATGCTCG TCGACACCAT CGGTGATGTC
GTAATCACCA ATGACGGTGT CACAATTCTC AAGGAGATGG ACATCGAGCA CCCGGCCGCA
AAGATGATGG TCGAGGTCGC TAAGACCCAG GACGATGAAG TCGGTGACGG AACCACGACC
GCTGTCGTGA TCGCCGGCGA ACTCTTAAAG CGTGCAGAAG ACCTTCTTGA CCAGGACGTT
CACCCAACCG TGATCGCTCA CGGATACCGG ATGGCAGCAG AGAAGGCTCA GGAGATCCTC
GCCGAGATTG CGATCCCGGT GAAGGCCACT GACCTCGCAA TGCTGAAGAA GATCTCAGAG
ACCGCGATGA CCGGCAAGGG TGCAGAGGCT GCCAAGGACA AGCTCTGCGA CCTGGTCGTC
AGGGCAGTCA CGATGGTCGC CGAAGAGGAT GGCACTGTCG ACAAGGACAA CATCAAGGTG
GAGAAGAAGG TCGGCGGTTC GATCCAGGAC TCCGAGATCA TCGAGGGGAT GCTGATCGAC
AAGGAACGCG TCCACCCAGG GATGCCAAAG AAGGTCGTCG GCGCGAAGAT TCTGCTCTTA
AATGCAGCGG TCGAGTTCAA GAAGACCGAA GTCGATGCTG AGATCAACAT CACGAGCCCA
GACCAGCTCC AGTCATTCCT CGACGAGGAA GAGCGGATGA TCCGGACCAT CGTCGAGAAG
ATCATCGCCA GCGGCGCGAA CGTCCTCTTC TGTCAGAAGG GTATCGACGA CATTGCCCAG
CACTACCTTG CGAAGGCAAA GATCTTCGGG GTCCGCCGTG TAAAGAAGAG CGACATGGAG
AAGCTGGCCC GTGCGACCGG TGCCACCATG GTCTCTTCGA TCGACGCGAT CAGCAAGGAC
GAGCTCGGCA CTGCAGGGCT CATCGAGGAG AAGAAGGTCT CCGGCGAAGA GATGATCTTC
GTCACCGAGT GCTCCAACCC CAAGGCGGTC TCGATCATCG TCCGCGGTGG GACCGAGCAC
GTCGTCGACG AGCTCGAGCG TGCGATGGAG GATGCTATCA GGGTCGTCTC CGTCGTCATC
GAGGACAAGA AGCTGGTCGC CGGCGGCGGT TCACCAGAGA CCGAGCTCTC CCAGCGCCTG
AAGATCTATG CGTCCAGCGT CGGTGGCCGC GCACAGCTCG CCATCGAAGC CTTCGCCAGC
GCCCTTGAGA TCATCCCGAG GACCCTTGCG GAGAATGCAG GGCTCGACCC CATCGATATG
CTCGTCGAGC TCCGTGCAGC CCATGAGAAG GGACAGAAGA CCGCAGGTCT CGATGTCTAC
GAAGGCAAGG CAGGGGACAT GCTGGCAGCA GGGGTCATCG AGCCGCTGCG GGTCAAGACC
CAGGCCATCT CCAGCGCTGC AGAGGCAGCT GTGATGATCC TCAGAATCGA CGATGTCATC
GCATCGTCCA AGTCAGCAGC CCCAGAAGGC ATGCCACCAG GTGGAATGGG CGGCATGCCA
CCGGGTATGG GCGGTATGGG TGGCATGGAC TACTGA
 
Protein sequence
MSQQLAGQPI FILKEGSSRT RGRDAQGNNI NAAKAVANAV RTTLGPKGMD KMLVDTIGDV 
VITNDGVTIL KEMDIEHPAA KMMVEVAKTQ DDEVGDGTTT AVVIAGELLK RAEDLLDQDV
HPTVIAHGYR MAAEKAQEIL AEIAIPVKAT DLAMLKKISE TAMTGKGAEA AKDKLCDLVV
RAVTMVAEED GTVDKDNIKV EKKVGGSIQD SEIIEGMLID KERVHPGMPK KVVGAKILLL
NAAVEFKKTE VDAEINITSP DQLQSFLDEE ERMIRTIVEK IIASGANVLF CQKGIDDIAQ
HYLAKAKIFG VRRVKKSDME KLARATGATM VSSIDAISKD ELGTAGLIEE KKVSGEEMIF
VTECSNPKAV SIIVRGGTEH VVDELERAME DAIRVVSVVI EDKKLVAGGG SPETELSQRL
KIYASSVGGR AQLAIEAFAS ALEIIPRTLA ENAGLDPIDM LVELRAAHEK GQKTAGLDVY
EGKAGDMLAA GVIEPLRVKT QAISSAAEAA VMILRIDDVI ASSKSAAPEG MPPGGMGGMP
PGMGGMGGMD Y