Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0645 |
Symbol | |
ID | 4784772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 678467 |
End bp | 679996 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640089204 |
Product | peptidase |
Protein accession | YP_001019842 |
Protein GI | 124265838 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.146477 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCAAG AAGCAAGAAC TGAAATGCCG CGCTCGTCGT GGCGCGCGCG TTCCTGGGTC TGGGTGGCGG CGGCGAGCAT CGCCGGCGCC TCGACGCTGG GGGGGCTGCT GATGTCTCCG CATGCCAGCC ATGCGCAGCC GGCCGTGGCA GCGGCACGTG GCCTGCCCGA CTTCACCGAC CTCGTCGAGC AGGTCGGCCC GGCGGTGGTG AACATCCGGA CTACCGAGCG CACCCGCGGT GGCCAACGCG GCGGTGGGGG TGCCGGCCCG GAGATGGACG AGGAAATGCA GGAGTTCTTC CGTCGCTTCT TCGGCGTGCC GCCGGGGCAA CTGCCTGGGC AGCGCCAAGA TCCGCGGCGG CAGGCACCGG ACGAAGAGCA ACAGCGTGGC GTGGGTTCCG GCTTCATTTT CACGACCGAC GGCTACGTGA TGACCAACGC GCACGTGGTC GACGGTGCCG ACGAGGTGTA CGTCACGCTG ACGGACAAGC GCGAGTTCAA GGCCAAGCTG ATCGGTGCTG ACAAGCGCAC CGACGTGGCC GTGGTCAAGA TCGAGGCCGC AGGCCTGCCG TCGGTGAAGA TCGGCGACGT CAGCAAGTTG AAGGTCGGCG AATGGGTCAT GGCGATCGGC TCGCCCTTCG GCCTGGAGAA CACGGTCACG GCGGGCATCG TCAGCGCCAA GGCGCGCGAC ACCGGCGAGT TCGTCCCCTT CATCCAGACC GACGTGGCCA TCAATCCCGG CAACTCCGGC GGTCCGTTGA TCAACCTGCG CGGCGAGGTG GTCGGCATCA ACTCGCAGAT CCTCAGCCGC TCGGGCGGCT TCATGGGCAT CTCCTTTGCC ATTCCGATGG ACGAAGCCAC GCGCGTGGCG GACCAGTTGC GTGCCGGTGG CCGTGTGGTG CGGGGTCGCA TCGGCGTGCA GATCGGCGAG GTGACGAAGG ACGTGGCCGA ATCGCTCGGC CTCGGCAAGG CGGCGGGGGC CCTGGTGCGT TCGGTCGAGG CCGGCGGGCC GGCCGACAAA GCCGGCGTCG AGGCGGGCGA CATCATCACG CGCTTCGATG GCAAGCCGGT CGAGAAATCC AGCGACCTGC CGCGTCTGGT GGGGGGAACC AAGCCGGGGA GCAAGGCAAG CCTGCAGGTC TTCCGCCGCG GCAGCGCGCG AGATCTTGGT GTGACGGTGG CAGAACTCGA GCCCGAACCG GGACGCCGGG CGGCCGCACC GGAGAGCAAG CAGGCGCCGA CGCCGAGCGT GGTGTCCGGC CTGGGCTTGA CGCTGGCCAA CCTGAGCGAG GAGCAGAAGC GCGAACTCAA GCTGCGTGGC GGCGTGCGTG TGGAAGCGAC CGAAGGTGCG GCGGCGCGCG CGGGCTTGCG TGAAGGCGAC GTGATCCTGT CGGTCGGCAA TGTCGAGATC GTCGACGTGA AGCAGTTCGA GGCCGTGATC GCCAAGGTCG ACAAGAGCAA GCCCATCAAC GTGCTGTTCA GGCGAGGAGA GTGGGCGCAG TACGCGCTGA TCCGCACGGC GACCCGCTGA
|
Protein sequence | MMQEARTEMP RSSWRARSWV WVAAASIAGA STLGGLLMSP HASHAQPAVA AARGLPDFTD LVEQVGPAVV NIRTTERTRG GQRGGGGAGP EMDEEMQEFF RRFFGVPPGQ LPGQRQDPRR QAPDEEQQRG VGSGFIFTTD GYVMTNAHVV DGADEVYVTL TDKREFKAKL IGADKRTDVA VVKIEAAGLP SVKIGDVSKL KVGEWVMAIG SPFGLENTVT AGIVSAKARD TGEFVPFIQT DVAINPGNSG GPLINLRGEV VGINSQILSR SGGFMGISFA IPMDEATRVA DQLRAGGRVV RGRIGVQIGE VTKDVAESLG LGKAAGALVR SVEAGGPADK AGVEAGDIIT RFDGKPVEKS SDLPRLVGGT KPGSKASLQV FRRGSARDLG VTVAELEPEP GRRAAAPESK QAPTPSVVSG LGLTLANLSE EQKRELKLRG GVRVEATEGA AARAGLREGD VILSVGNVEI VDVKQFEAVI AKVDKSKPIN VLFRRGEWAQ YALIRTATR
|
| |