Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3396 |
Symbol | |
ID | 4786383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3611114 |
End bp | 3612130 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640091972 |
Product | trypsin-like serine protease |
Protein accession | YP_001022584 |
Protein GI | 124268580 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0221978 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGGC TTCCCCTCCT CTTCGCAGCC GCCTGGCTGC TGGGTCTGCC GCCTGGCGCG CGGGCCGACA TCGACCGCGC CGCGCTGATC CGGCTGGCGC CGAGCGTGCT GAAGATCGAG GCGGTCAGCG CCGCGGGCGG CCTGCAGCTC GGTTCCGGCG TGATCGTCGG CCCCGGCAGG GTGGTGACCA ACTGCCATGT GACGCGCCAC GCGGTGCGCG TGAACGTGGT GAAGGGCGGT GTGCGCTGGA CGGCCAACCT GCAGGCCGCC GACATGCTCC GCGACCTGTG TCTGCTGCAG GTCCCGAGGC TGGAGGGCGA TGCCGTCCCG ATCGCGCGCG CGGCCTCGCT ACACCCCGGG CAGCAGGTGC TGGCGATGGG CTACACCGGC GGGGTGGGCA TCCAGCTCAG CGAAGGCGAC GTGGTGGCCC TGCACCACTG GTCCGGCAGC CAGATCGTGC AGAGCAGCAA CTGGTTCAGC TCGGGCGCCA GCGGTGGCGG GCTGTTCAAT GCCGACGGCA AGCTGGTCGG CATCCTGACC TTCCGGTTGC GCGGCGGCGC TCGCCACTAC TTCGCCGCAC CCGCCGACTG GGTGCTCGCG CAGCTCAACG ACGAACTGCC CTACAACGCG GTCGCCCCGC TGGCCGGCAA GAGCTTCTGG GAGCAGCCGG ACACCGAGCA ACCCTACTTC CTGCAGGCCG CCGCGCTGGA GCAGGGCCAG CAATGGGCGG CACTCGCCCA ACTGGCCGAC CGCTGGCAAC AGGAGGCCGG CGACGATCCG GAAGCACCCT ACCTGCTGGG CGTCGCCTTC GAGGGCCTGC ACCAGCCAGA GCCCTCCATC CGCGCCTTCC AGCGCAGCGT GGAGATCGAT CCGACCTACA ACCGCAGCTG GGCCCGACTC GCCCAGGTCT ACAAGCGGCA GGGCCAGCTG CGCGAATCAC GCAATGCCGT CGCGCGCCTT GCGGCGCTCG ACCCGAAACA GGCCCGCGAA CTCGCGGCCG AACTGGAGAA ACCATGA
|
Protein sequence | MRRLPLLFAA AWLLGLPPGA RADIDRAALI RLAPSVLKIE AVSAAGGLQL GSGVIVGPGR VVTNCHVTRH AVRVNVVKGG VRWTANLQAA DMLRDLCLLQ VPRLEGDAVP IARAASLHPG QQVLAMGYTG GVGIQLSEGD VVALHHWSGS QIVQSSNWFS SGASGGGLFN ADGKLVGILT FRLRGGARHY FAAPADWVLA QLNDELPYNA VAPLAGKSFW EQPDTEQPYF LQAAALEQGQ QWAALAQLAD RWQQEAGDDP EAPYLLGVAF EGLHQPEPSI RAFQRSVEID PTYNRSWARL AQVYKRQGQL RESRNAVARL AALDPKQARE LAAELEKP
|
| |