Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0422 |
Symbol | |
ID | 4785175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 456637 |
End bp | 458052 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640088980 |
Product | trypsin-like serine protease |
Protein accession | YP_001019619 |
Protein GI | 124265615 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.233942 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGAC CCCGAGCCTG GCGGCAGCGT GCACTCGGGC TGCTGTTCGC GCTGGCGAGC GGCGCCATCG GTGCGCAGAC CGCCGCCCCC GCCTCCGCTC CCTCCTCCGC CCCGCTGCCC GGGATCGGCG CGCCATCGGT CGCCACCGCG CCGCTGCCCG TGTCGCCGTC CGCGCAGCGG CTCTACGAGC GCGCGCGCGG CCAGCTGCTG CAGGTGCGCA CGCTGCTGAA GGGGCAGGAC AGCCAGGCCT CGGTCGGCTC GGGTTTCTTC GTCAGCGACG ACGGCCTGAT CGTCACCAAC TACCACGTCG TCAGCCAGGT GGCGCTGCAG CCCGATCGCT ACCGGCTCAC CTACACCCGC GTCGACGGCC GCGAGGGCGC GCTGCAGCTG CTGGGCTTCG ACGCGATCCA CGACCTGGCG CTCGTGAAGG CGCTGCCGCC GAACGGCCCG TCTCGCAAGA GCGGCGTCAG CGTGGTCGAC GCAGCGGGCG AGCCGCTCGC CTTCCGCGCC GCGAACGACG CACTGGCCCA GGGCGAACGC ATCTACTCGC TCGGCAACCC GCTCGACGTC GGCTTCGCGG TGCTGGAAGG CAACTACAAC GGGCTCGTCG AGCGCAGCTT CTACCCCAGC ATCTTCTTCG GCGGCGCGCT CAACTCCGGC ATGAGCGGCG GGCCCGCGCT CGACGAGGCC GGCCGCGTGG TCGGCGTCAA CGTCGCCACA CGGCGCGACG GCCAACAGGT GAGCTTCCTG GTGCCGGCGC CGTTCGCGCA GGCCCTGGTG GAGCGGGCCC GCGGCGCGGC GCCGATCACC GCGCCGGTCT ATCCGCAGCT CACCGCGCAG CTGCTCGCGC ACCAGGAGGC CGTGGTGCAA CGCTTCGTCC AGCAGCCCTG GCGCAGCGCC GGCCACCCGC ACTACCTGAT CCCCGTGCCA CAGGAAGACT TCATGCGCTG CTGGGGCCGC AGCACGCCGG CGGACACCAA GGGCCTGGAG TTCGAGCGCT CCGACTGCGA GATGGACACG CAGATCTTCG TCAGCGGCAG CCTGCTCACC GGCTCGCTGG GCGCGCGCCA CGAGGCCTAC GACGGCCGCA AGCTCGGCTG GCTGCGCTTC ACCGAGCGCT ACAGCGCGAG CTTCCGCAAC GAGAGCTTCG GGCGCCGCAA CCCGAAGGAA TTCACCGCGC CGCAGTGCAG CGAGCGCTTC GTCGACCGCG ACGGCCTGCC GCTGCGCGCA GTGCTGTGCC TGTCGGCCTA CAAGCGCCTC GCCGGACTCT ACGACGTCAG CGTGCTGGTC GCCACACTCG ACCAGGCCCG CGTCGGCGCG CAGGGCCGCC TCGACGCCCG CGGCGTCAGT TTCGACAACG CGATGAAACT GGCCTCGCAC TACCTGCAGG GCTACGGCGT GAAGGCGGCG CCATGA
|
Protein sequence | MTRPRAWRQR ALGLLFALAS GAIGAQTAAP ASAPSSAPLP GIGAPSVATA PLPVSPSAQR LYERARGQLL QVRTLLKGQD SQASVGSGFF VSDDGLIVTN YHVVSQVALQ PDRYRLTYTR VDGREGALQL LGFDAIHDLA LVKALPPNGP SRKSGVSVVD AAGEPLAFRA ANDALAQGER IYSLGNPLDV GFAVLEGNYN GLVERSFYPS IFFGGALNSG MSGGPALDEA GRVVGVNVAT RRDGQQVSFL VPAPFAQALV ERARGAAPIT APVYPQLTAQ LLAHQEAVVQ RFVQQPWRSA GHPHYLIPVP QEDFMRCWGR STPADTKGLE FERSDCEMDT QIFVSGSLLT GSLGARHEAY DGRKLGWLRF TERYSASFRN ESFGRRNPKE FTAPQCSERF VDRDGLPLRA VLCLSAYKRL AGLYDVSVLV ATLDQARVGA QGRLDARGVS FDNAMKLASH YLQGYGVKAA P
|
| |