Gene Mext_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2009 
SymbolhslO 
ID5831204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2240345 
End bp2241352 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content71% 
IMG OID641367809 
ProductHsp33-like chaperonin 
Protein accessionYP_001639478 
Protein GI163851435 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1281] Disulfide bond chaperones of the HSP33 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.729287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCCG GACACGCCCC TTCTTTCACA CCCTCGCTCG AAGGCGACGA CGACGCCGTT 
CTGCCGTTCG CCGTCGAAGC ACTGGATCTG CGCGGCCGCG CGGTGCGGCT CGGGCCCTCG
ATCGACACCA TCCTGCGCCG CCACGGCTAT CCCGACGCGG TCGCCCGGCT GATCGGCGAG
GCGGCGGCGC TCACCGTGCT GCTCGGTGCC TCCCTGAAGC TCGAAGGCCG CTTCCAGCTC
CAGACCAAGA CCGACGGACC GGTGAACATG CTGGTGGTCG ATTTCGAGGC GCCCGACCGG
GTGCGAGCCA CCGCCCGCTT CGATGCGGAG CCGGTGGCTG CCCTCGGCCC GAAGGCGCGC
GCCGCTGACC TGATGGGTCG TGGGCACCTG GCCATGACCA TCGACCAGGG GCCATCCCAG
AGCCGCTACC AGGGCGTCGT CGCGCTCGAG GGCCAAAGCC TCGAAGAGGC CGCGCACCAG
TATTTCCGCC AATCCGAGCA GATCCCGACG CTGGTCCGCC TCGCCGTCGC CGAGCAGATG
GAGGGCGGGG AGAGCCGCTG GAGAGCCGGC GGCCTGCTGG TGCAGTTCCT GCCGACCTCG
CCCGACCGGA TGCGCCAGGC CGACCTTCCG CCCGGCGACG CGCCGGAGGG CCACGAGATC
CTCACCGGTG GCACCCGCGA CGACGATGCC TGGACCGAGG CGCGCAGCCT CGTGAACACG
GTGGAGGACC ACGAGATCGT CGATCCGGCG GTGTCGAGCG AGCGGTTGCT CTACCGCCTG
TTCCACGAGC GCGGCGTGCG CGTGTTCGAT GCGCAGAGCG TGATCGAGCG CTGCCGCTGC
TCGGAAGAGC GGGTGCTGGG GATGATCCGC TCGTTCTCCG CCGAGGAGCG CCGGGACATG
GTCGCGGATG ACGGCACCGT GTCCATCACC TGCGAGTTCT GCTCGCGCCG CTACGTGCTC
GATCCGGCCG AGGTCGAGCG GGATATCGCG ACCGCGCCGG GGGCGTGA
 
Protein sequence
MSSGHAPSFT PSLEGDDDAV LPFAVEALDL RGRAVRLGPS IDTILRRHGY PDAVARLIGE 
AAALTVLLGA SLKLEGRFQL QTKTDGPVNM LVVDFEAPDR VRATARFDAE PVAALGPKAR
AADLMGRGHL AMTIDQGPSQ SRYQGVVALE GQSLEEAAHQ YFRQSEQIPT LVRLAVAEQM
EGGESRWRAG GLLVQFLPTS PDRMRQADLP PGDAPEGHEI LTGGTRDDDA WTEARSLVNT
VEDHEIVDPA VSSERLLYRL FHERGVRVFD AQSVIERCRC SEERVLGMIR SFSAEERRDM
VADDGTVSIT CEFCSRRYVL DPAEVERDIA TAPGA