Gene Mpe_A3390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3390 
SymbolhslU 
ID4786377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3603593 
End bp3604939 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content70% 
IMG OID640091966 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_001022578 
Protein GI124268574 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0502353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGC AGGAGATCGT GTCCGAGCTG GACCGCCACA TCGTCGGCCA GCAGGACGCC 
AAGCGCGCCG TCGCGATCGC GCTGCGCAAC CGCTGGCGCC GCCAGCAGGT GGACGACAGG
CTGCGCCCGG AGATCACGCC GAAGAACATC CTGATGATCG GGCCCACTGG CGTCGGCAAG
ACCGAGATCG CGCGCCGCCT CGCGAAGCTG GCCGACGCGC CCTTCATCAA GGTGGAGGCC
ACCAAGTTCA CCGAGGTCGG CTATGTCGGC AAGGACGTCG ATTCGATCAT CCGCGACCTG
GTCGACCTGG CCGTCAAGCA GACGCGCGAG GCCGCGATCC GGGCCCATCG CGTCCGCGCC
GAGGATGCCG CCGAGGAACG CATCCTCGAC GTGCTGCTGC CGCCGCCCGC CCATGCCGGC
GCCGGCTTCG GGCTGAGCAC CAGCTCGCCG GCCCCCGCCC CGGCCGACAG CACGGCGCGC
CAGACCTTCC GCAAGCGCCT GCGCGAGGGC ACGCTCGACG ACAAGGAGAT CGAGCTCGAA
CTCGCCGAAC CGCGCGCTGC CGTCGAACTG CTGGGCCCGC CCGGCATGGA AGACATGGCC
GAGCAACTCA AGGGCATGTT CGCCAGCCTG GGCCAGACGC GGCGCAAGAC CCGCAAGCTC
AAGATCGCCG AGGCGTTGAA GCTGCTGGTC GACGAGGAGG CGGCCAAGCT GGTCAACGAG
GACGAGATCA AGACGCAGGC GCTGGCCAGC GCCGAGCAGA ACGGCATCGT CTTCATCGAC
GAGATCGACA AGGTCACGTC GCGCGGCGAC GGCGCCAGCG GCGCCGAGGT CTCGCGCCAG
GGCGTGCAGC GCGACCTGCT GCCGCTGGTG GAGGGCACGA CGGTCAGCAC CAAGCACGGC
ACTGTCAAGA CCGATCACAT CCTGTTCATC GCCTCGGGCG CCTTCCACCT GGCGCGCCCG
AGCGACCTGA TCCCGGAGCT GCAGGGCCGC TTCCCGATCC GGGTGGAGCT GGGCTCGCTG
CGGGTCGAGG ACTTCGAGGC GATCCTGACC CAGACCCACG CCAGCCTGGT ACGCCAGTAC
CAGGCACTGC TGGACACCGA GGGCGTCCGG CTCGACTTCC GGCCCGAGGG TGTGCGCCGG
CTGGCGCAGA TTGCGTTCGA CGTCAACGAG CGCACCGAGA ACATCGGCGC GCGCCGGCTG
TCGACGGTGA TGGAGCGCCT GCTCGACGAG GTGAGCTTCG ATGCGCCGAA CCTGGGCGGC
CAGACGATCG CGATCGACGC CGCCTACGTG GATCGCAAGC TCGGGGCGCT GGCGGTCGAC
GAGGATCTGT CCCGCTTCAT TCTCTGA
 
Protein sequence
MTPQEIVSEL DRHIVGQQDA KRAVAIALRN RWRRQQVDDR LRPEITPKNI LMIGPTGVGK 
TEIARRLAKL ADAPFIKVEA TKFTEVGYVG KDVDSIIRDL VDLAVKQTRE AAIRAHRVRA
EDAAEERILD VLLPPPAHAG AGFGLSTSSP APAPADSTAR QTFRKRLREG TLDDKEIELE
LAEPRAAVEL LGPPGMEDMA EQLKGMFASL GQTRRKTRKL KIAEALKLLV DEEAAKLVNE
DEIKTQALAS AEQNGIVFID EIDKVTSRGD GASGAEVSRQ GVQRDLLPLV EGTTVSTKHG
TVKTDHILFI ASGAFHLARP SDLIPELQGR FPIRVELGSL RVEDFEAILT QTHASLVRQY
QALLDTEGVR LDFRPEGVRR LAQIAFDVNE RTENIGARRL STVMERLLDE VSFDAPNLGG
QTIAIDAAYV DRKLGALAVD EDLSRFIL