Gene Mpe_A0060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0060 
Symbol 
ID4783817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp55068 
End bp56669 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content72% 
IMG OID640088607 
Productputative signal peptide protein 
Protein accessionYP_001019257 
Protein GI124265253 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0678938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTTT CGATGGCTTC TGACCGACGC CGCCGCTGGC CGCGTGCTCT GCCGCTGGCC 
TTGAGCCTGG CCCTTGCCGC ACAAGGCCTG GCGCCCGCCG CCCTGGCGCA GCAGGGCAGT
CCTGGCTCGC GCAACAACCT GCCGGCCCTG GGCGACAGCG CGTCCGACGA CATCTCGGTA
CCGAACGAGC GCCGCATCGG CGACCGCATC ATGCGCGACA TCCGACGCGA TGCGGATTAT
CTCGACGACC CGATCCTGCT CGAGTACGTG CAGACCATGA TGAGCGCGCT CATCGTGTCG
TCGCGCGAGC GCGGCAACAT CCCGACGGAG CTGGACGAGC GCTTCGCGTG GGAGGCCTTC
CTCGTGCGCG ACCGCACGGT GAACGCCTTC GCGCTGCCGG GTGGCTACAT GGGCGTGCAC
CTGGGCCTGA TGGCCATCAC GGCGACGCCT GCCGAACTGG CCTCGGTGCT GGCGCACGAG
CTGTCGCACG TGACGCAGCG GCACATCGCG CGCAGCGTGG GCGCGAACAA GCTCACGTCT
CTGGCCGGAA TCGCAGGGCT GATCCTCGGC GTGCTCGCGG CCAGTCGCAG CCCGGAGGCC
GCCAATGCGC TGATCACCGG CGGGCAGGCG GTGGCCGTGC AGGGGCAGCT GAACTATTCG
CGCGATGCAG AGCGCGAGGC CGACCGCGTG GGCTTCAACG TACTCACCGG CGCCGGCTAC
TCGCCGGCCG GCATGCCGGC GATGTTCGAG AAGCTGCTGC AGGCCTCGCG CCTGAACGAC
AGCCAGAATT ACCCTTACCT GCGCAGCCAC CCGCTGACCA CCGAACGCAT CGGCGAAGCC
CGCTCGCGCC TCGGTGTGGC CCATACCGCG CCGCCGCCCC GGCCGCTGCT GCACGTGCTG
ATGCAGGCCC GCGCCCGCGT GCTGATGGAC ACGCGCGACA GCGCGCTCCA GCGCGCCCAG
GACTTCGACA CCGAGCGGGC CCTGGCCACC GTCGCCGCGC CCGCCGACCG GCTGGGGGCG
CTGTACGGCA GTGCGCTGGC GTCGATCCTG CGGCGCGATT TCTCGCGCGC CGACGCGGCC
TTGAGCGCTG CACGTCCGCT GGCGGACGGC GACAAGGACG CCATGAATGC CTTGGCGCTG
TTGACGCTGC AAGGTCTGCT GGTGCGCAGC GATGGCATGC GCGCCGACCC CGTGCTCGGC
AGCCTGCTGG CCGCGCCGGG AGGGCAGGAC TCCCGCACGC TGCTGCTTGC GCAGGCGCAA
CGCGCGCTGC TGCCAGGCAG CCCGCCCGAG ATGCAGCGCG CGGCCGTCGA CCGCCTTCAG
ACCTGGGTGG CCGTCCACCG GCAGGATGCG CTGGCCTGGG GCAGCGCTGC GCAACTGTGG
GAACGGCTGG GTCAGCGGCT GCGGGCCGTG CGCGCGGACG CCGAGTCGCG CGCGGCGGTC
GGTGACGTGC AGGGCGCCGT CGAGCGGCTG CGTGCAGCGC AGAAGCTGGC CCGCACGGCG
ACCGGCAGCG ACTTCATCGA GGCCTCGGTG CTCGAATCCC GGCTGCGCGA GCTGGAGCTG
CAGCGCCGGC AGATCATTGC CGAGGAGCGC GGCGAGCTTT GA
 
Protein sequence
MRLSMASDRR RRWPRALPLA LSLALAAQGL APAALAQQGS PGSRNNLPAL GDSASDDISV 
PNERRIGDRI MRDIRRDADY LDDPILLEYV QTMMSALIVS SRERGNIPTE LDERFAWEAF
LVRDRTVNAF ALPGGYMGVH LGLMAITATP AELASVLAHE LSHVTQRHIA RSVGANKLTS
LAGIAGLILG VLAASRSPEA ANALITGGQA VAVQGQLNYS RDAEREADRV GFNVLTGAGY
SPAGMPAMFE KLLQASRLND SQNYPYLRSH PLTTERIGEA RSRLGVAHTA PPPRPLLHVL
MQARARVLMD TRDSALQRAQ DFDTERALAT VAAPADRLGA LYGSALASIL RRDFSRADAA
LSAARPLADG DKDAMNALAL LTLQGLLVRS DGMRADPVLG SLLAAPGGQD SRTLLLAQAQ
RALLPGSPPE MQRAAVDRLQ TWVAVHRQDA LAWGSAAQLW ERLGQRLRAV RADAESRAAV
GDVQGAVERL RAAQKLARTA TGSDFIEASV LESRLRELEL QRRQIIAEER GEL