Gene Msil_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0830 
SymbolhslO 
ID7092688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp914287 
End bp915264 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content69% 
IMG OID643464167 
ProductHsp33-like chaperonin 
Protein accessionYP_002361162 
Protein GI217977015 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1281] Disulfide bond chaperones of the HSP33 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0214354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCT GGCGGCCTGA CAACGCGCCG CAATCGCGGG TCGTTTCGGA TCGCGGGCAG 
GACGACACCG TGCTGCCCTT CGCGGTCGAG CGGCTCGATG TGCGCGGCAG GATCGTGCGG
CTTGGCCCCA GCGTCGACGA GATCTTGAAG CGGCACGCTT ATCCTCCTGC CGTTGCGCGC
ATCCTTGGCG AGGCGCTGGC GCTGACCGTG ATGCTTGGAT CGTCGCTGAA AATCGCCGGC
CGCTTTCAGT TGCAGACGCG CGGCGACGGA CCGGTCGACA TGGTGGTGGT CGATTTCGAC
GCCCCGGACC GGCTGCGCGC CTTTGCGCGG TTCGACGCCG CCCGGCTGAG TCAGGCGCCG
TCCGGCGCCG ATCTGCTTGG CGCCGGCCAT CTCGCCTTCA CGATCGACCA GGGCGCCGAG
GCGGCGCGCT ATCAGGGCGT CGTCGCGCTG ACCGGCCAGG GACTGGAGGA AGCCGCGCAT
CAATATTTCC GCCAGTCCGA GCAGATCCCG ACGCAGGTGC GTCTCGCCGT GGCGCAGCAT
GTGACGACCG AGGGGGTAAG CTGGCGCGCC GGCGGCCTTT TGGTGCAATT CTTGCCGAGC
GCCTCCGAAC GGCGCGGGCC GATCGACCTT CCTCCCGGCG ACGCGCCGGC CAGCGCCGCC
TACGAGCCCC CGCGCGAGGA CGACGCCTGG ACCGAGGCGA AGGCTCTGGT TGGGACCGTC
GAGGACCACG AGCTGGTTGA TCCGACGCTC TCCAGCGAGC GGCTGCTCTA CCGCCTGTTC
CACGAGCCGG GCGTCAAAGT GTTCGAGCCG CAGGGCGTGC GCGACGCCTG CCGCTGCTCG
GACGAGAGCG TCAGAAACAT GCTGCTCGGC TTTTCGCCGC AGGAGCGGGA GGAAATGGTC
GGCGACGACG GTCGGATTGG CGTCACCTGC GAATTCTGCT CGACATTTCG GGCCTTCGAT
CCGTCAGATC TTGGCTGA
 
Protein sequence
MSGWRPDNAP QSRVVSDRGQ DDTVLPFAVE RLDVRGRIVR LGPSVDEILK RHAYPPAVAR 
ILGEALALTV MLGSSLKIAG RFQLQTRGDG PVDMVVVDFD APDRLRAFAR FDAARLSQAP
SGADLLGAGH LAFTIDQGAE AARYQGVVAL TGQGLEEAAH QYFRQSEQIP TQVRLAVAQH
VTTEGVSWRA GGLLVQFLPS ASERRGPIDL PPGDAPASAA YEPPREDDAW TEAKALVGTV
EDHELVDPTL SSERLLYRLF HEPGVKVFEP QGVRDACRCS DESVRNMLLG FSPQEREEMV
GDDGRIGVTC EFCSTFRAFD PSDLG