Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0844 |
Symbol | |
ID | 4787137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 879642 |
End bp | 880802 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640089405 |
Product | putative HtrA-like serine protease signal peptide protein |
Protein accession | YP_001020041 |
Protein GI | 124266037 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.748774 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAGA CCTGGTTGAT TTTCTCCCAA GCCGTGACGG TCGCCCTGGC GATGTGGTTC GTCGTGGCCA CGCTCAAGCC CGAATGGGTG CAGAACAGGC CGCTGTCGAC TGCGGTGGGG TCGTCGTCCA ACCCGGCCCC TCAGATCACG CTGGCACCCG CCGCAGCCGG CGGCAGCAGC TACGCCGCCG CGGCCAAGCG CGCGGCGCCG GCGGTGGTGA GCGTCATCAC GAGCAAGACA CCGACCGCGC GCGATCCGCG CGCCGCCGAT CCCTGGTTCC GCTATTTCTT CGGTGATCCC GACTCGCAGG CGCAGAGCGG CGTGGGCTCG GGCGTCATCG TTTCGCCCGA AGGCTATGTG CTGACCAACA ACCACGTGGT CGAGCGCATG GACGACATCG AGGTCGTGCT GTCCGACGGG CGGCGCACCA AGGCCGAGGT GATCGGCACC GACCCCGAGG CCGACCTCGC GGTGCTGCGC ATCAAGCTCG ACAAGCTGCC GTCGGTGAGC TTCGGGGACT CCGACGCACT GCAGGTCGGC GACGTGGTGC TGGCGATCGG CAACCCCTTC GGCGTCGGCC AGACGGTCAC CTCCGGCATC GTGTCGGCGC TCGGCCGCAA CCAGCTCGGC ATCAACACCT TCGAGAACTT CATCCAGACC GACGCGGCCA TCAACCCCGG CAATTCCGGC GGGGCGCTGA TCGACGCCGC CGGCAACCTG ATGGGGATCA ACACAGCGAT CTACTCGCGC TCGGGCGGCA GCCTCGGCAT CGGCTTCGCG ATCCCGGTGT CCACGGCCCG CCAGGTGATG GAGGGCCTGA TCCGGGACGG CCAGGTCACG CGCGGCTGGA TCGGCGTGGA GCCACGCGAC CTCACGCCCG AGATCGCCGA GACCTTCAAC CTCAAGGTGT CGCAGGGCGT GCTGATCACC GGCGTGCTGC AGGGCGGCCC GGCCAGCGAC GGCGGCCTGC GGCCCGGCGA CGTGGTGGTG AAGGTGGCCG ACTCGCCGGT CGGCAACACC TCGCAGCTGC TCAACGTCGT GGCCTCCCTC AAGCCGCGCT CGAAGGCCCG GCTCGTGGTG CAACGGGGCG ACAGCGAAGT GACGCTCGAC GTCATCGTCG CGCAGCGGCC GAAGGCACCG CGGCAGCAGC GCGAACCCTA G
|
Protein sequence | MRKTWLIFSQ AVTVALAMWF VVATLKPEWV QNRPLSTAVG SSSNPAPQIT LAPAAAGGSS YAAAAKRAAP AVVSVITSKT PTARDPRAAD PWFRYFFGDP DSQAQSGVGS GVIVSPEGYV LTNNHVVERM DDIEVVLSDG RRTKAEVIGT DPEADLAVLR IKLDKLPSVS FGDSDALQVG DVVLAIGNPF GVGQTVTSGI VSALGRNQLG INTFENFIQT DAAINPGNSG GALIDAAGNL MGINTAIYSR SGGSLGIGFA IPVSTARQVM EGLIRDGQVT RGWIGVEPRD LTPEIAETFN LKVSQGVLIT GVLQGGPASD GGLRPGDVVV KVADSPVGNT SQLLNVVASL KPRSKARLVV QRGDSEVTLD VIVAQRPKAP RQQREP
|
| |