Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1129 |
Symbol | |
ID | 4784611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1206746 |
End bp | 1207813 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640089692 |
Product | putative serine protease |
Protein accession | YP_001020325 |
Protein GI | 124266321 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.247199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0306417 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGGC CATCCCTCGT GGAGGCCGAG ATGTCACCCT CCCTTCTGCT TCGCCTGTCG TTCGCCCTGG TTCTGGGACT GTCCGCCGGC CTCGCTCACC CCGCACCGGA CCGTCAGGGT AACGGCGAGG CTGCCGAATT CCCGGGCCAG CGCCTGGCGC TGAAACGGGC CAGCGATGCG GTGCTCGGGG TGCAGAGCAC GGCGACCGAG GGCGCCAGCA CGATCGACAC GCTGGGCGAA TACCGCGCCG GCTCCGGCGT CGTGATCGCC AGCGACGGCT TGGTGCTGAC GATCGGCTAC CTGATTCTCG AGGCGGAGGA GGTCGAGCTG GTGCTCGACA GCGGCAAGCG CATGCCGGCC CGCGTGGTGG CCTACGATCT GGCGACCGGC TTCGGCCTGG TGCAAGCGGT GCTGCCGCTG GGCATCGCGC CGGCGCAGCT CGGCCAGGCC CACGCGGTGG CGGAGGGCGA GCCGCTGCTG TTCGTGAGCG GCGGCGACGA CGGGGCGCTG AGCGCCGCGG AGCTGGTGTC GCGGCGCGGC TTCTCCGGCT ACTGGGAGTA CCACATCGAC GGCGCGCTGT TCACCAGCCC GGCCCGCCGC GACCACAGTG GTGCAGGGCT CTTCAACGCG CAGGGGGAGC TGATCGGCAT CGGCTCGCTG CTCGTGCCCA GCGCGCCCGG CGACGGCAGC CGCCGTGCCG GCAACATGTT CGTGCCGGTC GACCTGCTGC CGCCGATCTT CGCGGAGTTG CGCGAACGGG GCGTATCGCG CGCCAGCATG CGTGCCTGGC TGGGAGTGAA CTGCGTCGAG CAGGACGACG GCCTGCGCGT GGTGCGCGTG AGCCGCGACA GCCCGGCCGA GATGGCAGGG CTGCAGCCCG GCGATCTGAT CCGCCGCCTC GATGGCGCGC CGGTCGGTGG CCTGGAGTCC TTCTACAAGA TGCTCTGGAA TGGAGGCAGC GCCGAGCGTG ACCTGACCAT CGAGGTGTTG CGCGAAGGCC GGATGCAGTC GGTGCCGGTG CACAGCATCG ACCGCACGCA GACCTTGCGG CGCGCGCGCG GCATCTGA
|
Protein sequence | MLRPSLVEAE MSPSLLLRLS FALVLGLSAG LAHPAPDRQG NGEAAEFPGQ RLALKRASDA VLGVQSTATE GASTIDTLGE YRAGSGVVIA SDGLVLTIGY LILEAEEVEL VLDSGKRMPA RVVAYDLATG FGLVQAVLPL GIAPAQLGQA HAVAEGEPLL FVSGGDDGAL SAAELVSRRG FSGYWEYHID GALFTSPARR DHSGAGLFNA QGELIGIGSL LVPSAPGDGS RRAGNMFVPV DLLPPIFAEL RERGVSRASM RAWLGVNCVE QDDGLRVVRV SRDSPAEMAG LQPGDLIRRL DGAPVGGLES FYKMLWNGGS AERDLTIEVL REGRMQSVPV HSIDRTQTLR RARGI
|
| |