Gene Mpe_A0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0844 
Symbol 
ID4787137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp879642 
End bp880802 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID640089405 
Productputative HtrA-like serine protease signal peptide protein 
Protein accessionYP_001020041 
Protein GI124266037 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.748774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGA CCTGGTTGAT TTTCTCCCAA GCCGTGACGG TCGCCCTGGC GATGTGGTTC 
GTCGTGGCCA CGCTCAAGCC CGAATGGGTG CAGAACAGGC CGCTGTCGAC TGCGGTGGGG
TCGTCGTCCA ACCCGGCCCC TCAGATCACG CTGGCACCCG CCGCAGCCGG CGGCAGCAGC
TACGCCGCCG CGGCCAAGCG CGCGGCGCCG GCGGTGGTGA GCGTCATCAC GAGCAAGACA
CCGACCGCGC GCGATCCGCG CGCCGCCGAT CCCTGGTTCC GCTATTTCTT CGGTGATCCC
GACTCGCAGG CGCAGAGCGG CGTGGGCTCG GGCGTCATCG TTTCGCCCGA AGGCTATGTG
CTGACCAACA ACCACGTGGT CGAGCGCATG GACGACATCG AGGTCGTGCT GTCCGACGGG
CGGCGCACCA AGGCCGAGGT GATCGGCACC GACCCCGAGG CCGACCTCGC GGTGCTGCGC
ATCAAGCTCG ACAAGCTGCC GTCGGTGAGC TTCGGGGACT CCGACGCACT GCAGGTCGGC
GACGTGGTGC TGGCGATCGG CAACCCCTTC GGCGTCGGCC AGACGGTCAC CTCCGGCATC
GTGTCGGCGC TCGGCCGCAA CCAGCTCGGC ATCAACACCT TCGAGAACTT CATCCAGACC
GACGCGGCCA TCAACCCCGG CAATTCCGGC GGGGCGCTGA TCGACGCCGC CGGCAACCTG
ATGGGGATCA ACACAGCGAT CTACTCGCGC TCGGGCGGCA GCCTCGGCAT CGGCTTCGCG
ATCCCGGTGT CCACGGCCCG CCAGGTGATG GAGGGCCTGA TCCGGGACGG CCAGGTCACG
CGCGGCTGGA TCGGCGTGGA GCCACGCGAC CTCACGCCCG AGATCGCCGA GACCTTCAAC
CTCAAGGTGT CGCAGGGCGT GCTGATCACC GGCGTGCTGC AGGGCGGCCC GGCCAGCGAC
GGCGGCCTGC GGCCCGGCGA CGTGGTGGTG AAGGTGGCCG ACTCGCCGGT CGGCAACACC
TCGCAGCTGC TCAACGTCGT GGCCTCCCTC AAGCCGCGCT CGAAGGCCCG GCTCGTGGTG
CAACGGGGCG ACAGCGAAGT GACGCTCGAC GTCATCGTCG CGCAGCGGCC GAAGGCACCG
CGGCAGCAGC GCGAACCCTA G
 
Protein sequence
MRKTWLIFSQ AVTVALAMWF VVATLKPEWV QNRPLSTAVG SSSNPAPQIT LAPAAAGGSS 
YAAAAKRAAP AVVSVITSKT PTARDPRAAD PWFRYFFGDP DSQAQSGVGS GVIVSPEGYV
LTNNHVVERM DDIEVVLSDG RRTKAEVIGT DPEADLAVLR IKLDKLPSVS FGDSDALQVG
DVVLAIGNPF GVGQTVTSGI VSALGRNQLG INTFENFIQT DAAINPGNSG GALIDAAGNL
MGINTAIYSR SGGSLGIGFA IPVSTARQVM EGLIRDGQVT RGWIGVEPRD LTPEIAETFN
LKVSQGVLIT GVLQGGPASD GGLRPGDVVV KVADSPVGNT SQLLNVVASL KPRSKARLVV
QRGDSEVTLD VIVAQRPKAP RQQREP