Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0060 |
Symbol | |
ID | 4783817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 55068 |
End bp | 56669 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640088607 |
Product | putative signal peptide protein |
Protein accession | YP_001019257 |
Protein GI | 124265253 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0678938 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTTT CGATGGCTTC TGACCGACGC CGCCGCTGGC CGCGTGCTCT GCCGCTGGCC TTGAGCCTGG CCCTTGCCGC ACAAGGCCTG GCGCCCGCCG CCCTGGCGCA GCAGGGCAGT CCTGGCTCGC GCAACAACCT GCCGGCCCTG GGCGACAGCG CGTCCGACGA CATCTCGGTA CCGAACGAGC GCCGCATCGG CGACCGCATC ATGCGCGACA TCCGACGCGA TGCGGATTAT CTCGACGACC CGATCCTGCT CGAGTACGTG CAGACCATGA TGAGCGCGCT CATCGTGTCG TCGCGCGAGC GCGGCAACAT CCCGACGGAG CTGGACGAGC GCTTCGCGTG GGAGGCCTTC CTCGTGCGCG ACCGCACGGT GAACGCCTTC GCGCTGCCGG GTGGCTACAT GGGCGTGCAC CTGGGCCTGA TGGCCATCAC GGCGACGCCT GCCGAACTGG CCTCGGTGCT GGCGCACGAG CTGTCGCACG TGACGCAGCG GCACATCGCG CGCAGCGTGG GCGCGAACAA GCTCACGTCT CTGGCCGGAA TCGCAGGGCT GATCCTCGGC GTGCTCGCGG CCAGTCGCAG CCCGGAGGCC GCCAATGCGC TGATCACCGG CGGGCAGGCG GTGGCCGTGC AGGGGCAGCT GAACTATTCG CGCGATGCAG AGCGCGAGGC CGACCGCGTG GGCTTCAACG TACTCACCGG CGCCGGCTAC TCGCCGGCCG GCATGCCGGC GATGTTCGAG AAGCTGCTGC AGGCCTCGCG CCTGAACGAC AGCCAGAATT ACCCTTACCT GCGCAGCCAC CCGCTGACCA CCGAACGCAT CGGCGAAGCC CGCTCGCGCC TCGGTGTGGC CCATACCGCG CCGCCGCCCC GGCCGCTGCT GCACGTGCTG ATGCAGGCCC GCGCCCGCGT GCTGATGGAC ACGCGCGACA GCGCGCTCCA GCGCGCCCAG GACTTCGACA CCGAGCGGGC CCTGGCCACC GTCGCCGCGC CCGCCGACCG GCTGGGGGCG CTGTACGGCA GTGCGCTGGC GTCGATCCTG CGGCGCGATT TCTCGCGCGC CGACGCGGCC TTGAGCGCTG CACGTCCGCT GGCGGACGGC GACAAGGACG CCATGAATGC CTTGGCGCTG TTGACGCTGC AAGGTCTGCT GGTGCGCAGC GATGGCATGC GCGCCGACCC CGTGCTCGGC AGCCTGCTGG CCGCGCCGGG AGGGCAGGAC TCCCGCACGC TGCTGCTTGC GCAGGCGCAA CGCGCGCTGC TGCCAGGCAG CCCGCCCGAG ATGCAGCGCG CGGCCGTCGA CCGCCTTCAG ACCTGGGTGG CCGTCCACCG GCAGGATGCG CTGGCCTGGG GCAGCGCTGC GCAACTGTGG GAACGGCTGG GTCAGCGGCT GCGGGCCGTG CGCGCGGACG CCGAGTCGCG CGCGGCGGTC GGTGACGTGC AGGGCGCCGT CGAGCGGCTG CGTGCAGCGC AGAAGCTGGC CCGCACGGCG ACCGGCAGCG ACTTCATCGA GGCCTCGGTG CTCGAATCCC GGCTGCGCGA GCTGGAGCTG CAGCGCCGGC AGATCATTGC CGAGGAGCGC GGCGAGCTTT GA
|
Protein sequence | MRLSMASDRR RRWPRALPLA LSLALAAQGL APAALAQQGS PGSRNNLPAL GDSASDDISV PNERRIGDRI MRDIRRDADY LDDPILLEYV QTMMSALIVS SRERGNIPTE LDERFAWEAF LVRDRTVNAF ALPGGYMGVH LGLMAITATP AELASVLAHE LSHVTQRHIA RSVGANKLTS LAGIAGLILG VLAASRSPEA ANALITGGQA VAVQGQLNYS RDAEREADRV GFNVLTGAGY SPAGMPAMFE KLLQASRLND SQNYPYLRSH PLTTERIGEA RSRLGVAHTA PPPRPLLHVL MQARARVLMD TRDSALQRAQ DFDTERALAT VAAPADRLGA LYGSALASIL RRDFSRADAA LSAARPLADG DKDAMNALAL LTLQGLLVRS DGMRADPVLG SLLAAPGGQD SRTLLLAQAQ RALLPGSPPE MQRAAVDRLQ TWVAVHRQDA LAWGSAAQLW ERLGQRLRAV RADAESRAAV GDVQGAVERL RAAQKLARTA TGSDFIEASV LESRLRELEL QRRQIIAEER GEL
|
| |