Gene Mpe_A2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2014 
Symbol 
ID4784234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2159223 
End bp2160188 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content72% 
IMG OID640090584 
Productputative redox regulated molecular chaperone heat-shock-like protein 
Protein accessionYP_001021207 
Protein GI124267203 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1281] Disulfide bond chaperones of the HSP33 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.458338 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC TCCATAAATT CATCTTCGAG GGCCTGCCGG TGCGCGGCAT GCTGGTGCGT 
TTGACCGGCG CGTGGACCGA ACTGCTGGCA CGCCGGGGGA CAGAGCGGGC GCATCCGGCG
CCGGTGCGCA CGCTGCTCGG CGAGATGGCG GCCGCCGGGG TGCTGATGCA GGCCAGCATC
AAGTTCAACG GCGCACTGGT GCTGCAGATC TCGGGCGACG GGCCGGTGAA GCTGGCGGTG
GCCGAGGTGC AGCCCGACCT GGCGCTGCGG GCCACGGCCA CGGTGGTCGG CGACGTGCCG
GCCGGCGCGC GGCTGGAGGC GCTGGTCAAC GTGGGCGGGC GCGGCCATTG CGCGATCACG
CTGGACCCCA AGGACCGCTA CCCGGGCCAG CAGCCCTATC AGGGCGTGGT GCCGCTGCAT
GGTGACCGGC GCGAGCCGCT GCAGCAGCTG TCGGAGGTGC TGGAGCACTA CATGCTGCAG
TCGGAGCAGC TCGACACCAA GCTCGTGCTG GCGGCGAACG ACGACGTGGC CGCCGGCCTG
CTGATCCAGC GCCTGCCGGT CGAGGGCGAA GGCAACCTCG GCGCGCGGAA CGAGGACGAG
ATCGGCCTCA ACGAGGCCTA CAACCGCATC GCCCACCTCA GTGCGACGCT GACGCGCGAG
GAGTTGCTGA CGCTGGACGC CGACACCCTG CTGCGGCGGC TGTTCTGGGA GGAGACCGTG
CGCCGCTTCG AGCCGCTGAC CGGCGAGCAC GGGCCGCGCT TCGCCTGCAG CTGCTCGCGG
GAGCGCGTGG CGCGCATGCT GCGCGGCCTG GGGCGCGAGG AGTTCGACGG CCTGATCGCC
GAGCGCGGGC TGGCCGAGGT GGGCTGCGAG TTCTGTGGCG CCCAGTACCA CTTCGATGCG
GTCGACGGCG GCGAGGTCTT CACGGCGCCC CGCGACCAGC CGCCGGCCTC GCGCGCCGTG
CAGTAG
 
Protein sequence
MSELHKFIFE GLPVRGMLVR LTGAWTELLA RRGTERAHPA PVRTLLGEMA AAGVLMQASI 
KFNGALVLQI SGDGPVKLAV AEVQPDLALR ATATVVGDVP AGARLEALVN VGGRGHCAIT
LDPKDRYPGQ QPYQGVVPLH GDRREPLQQL SEVLEHYMLQ SEQLDTKLVL AANDDVAAGL
LIQRLPVEGE GNLGARNEDE IGLNEAYNRI AHLSATLTRE ELLTLDADTL LRRLFWEETV
RRFEPLTGEH GPRFACSCSR ERVARMLRGL GREEFDGLIA ERGLAEVGCE FCGAQYHFDA
VDGGEVFTAP RDQPPASRAV Q