Gene Mpe_A0645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0645 
Symbol 
ID4784772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp678467 
End bp679996 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content68% 
IMG OID640089204 
Productpeptidase 
Protein accessionYP_001019842 
Protein GI124265838 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.146477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAAG AAGCAAGAAC TGAAATGCCG CGCTCGTCGT GGCGCGCGCG TTCCTGGGTC 
TGGGTGGCGG CGGCGAGCAT CGCCGGCGCC TCGACGCTGG GGGGGCTGCT GATGTCTCCG
CATGCCAGCC ATGCGCAGCC GGCCGTGGCA GCGGCACGTG GCCTGCCCGA CTTCACCGAC
CTCGTCGAGC AGGTCGGCCC GGCGGTGGTG AACATCCGGA CTACCGAGCG CACCCGCGGT
GGCCAACGCG GCGGTGGGGG TGCCGGCCCG GAGATGGACG AGGAAATGCA GGAGTTCTTC
CGTCGCTTCT TCGGCGTGCC GCCGGGGCAA CTGCCTGGGC AGCGCCAAGA TCCGCGGCGG
CAGGCACCGG ACGAAGAGCA ACAGCGTGGC GTGGGTTCCG GCTTCATTTT CACGACCGAC
GGCTACGTGA TGACCAACGC GCACGTGGTC GACGGTGCCG ACGAGGTGTA CGTCACGCTG
ACGGACAAGC GCGAGTTCAA GGCCAAGCTG ATCGGTGCTG ACAAGCGCAC CGACGTGGCC
GTGGTCAAGA TCGAGGCCGC AGGCCTGCCG TCGGTGAAGA TCGGCGACGT CAGCAAGTTG
AAGGTCGGCG AATGGGTCAT GGCGATCGGC TCGCCCTTCG GCCTGGAGAA CACGGTCACG
GCGGGCATCG TCAGCGCCAA GGCGCGCGAC ACCGGCGAGT TCGTCCCCTT CATCCAGACC
GACGTGGCCA TCAATCCCGG CAACTCCGGC GGTCCGTTGA TCAACCTGCG CGGCGAGGTG
GTCGGCATCA ACTCGCAGAT CCTCAGCCGC TCGGGCGGCT TCATGGGCAT CTCCTTTGCC
ATTCCGATGG ACGAAGCCAC GCGCGTGGCG GACCAGTTGC GTGCCGGTGG CCGTGTGGTG
CGGGGTCGCA TCGGCGTGCA GATCGGCGAG GTGACGAAGG ACGTGGCCGA ATCGCTCGGC
CTCGGCAAGG CGGCGGGGGC CCTGGTGCGT TCGGTCGAGG CCGGCGGGCC GGCCGACAAA
GCCGGCGTCG AGGCGGGCGA CATCATCACG CGCTTCGATG GCAAGCCGGT CGAGAAATCC
AGCGACCTGC CGCGTCTGGT GGGGGGAACC AAGCCGGGGA GCAAGGCAAG CCTGCAGGTC
TTCCGCCGCG GCAGCGCGCG AGATCTTGGT GTGACGGTGG CAGAACTCGA GCCCGAACCG
GGACGCCGGG CGGCCGCACC GGAGAGCAAG CAGGCGCCGA CGCCGAGCGT GGTGTCCGGC
CTGGGCTTGA CGCTGGCCAA CCTGAGCGAG GAGCAGAAGC GCGAACTCAA GCTGCGTGGC
GGCGTGCGTG TGGAAGCGAC CGAAGGTGCG GCGGCGCGCG CGGGCTTGCG TGAAGGCGAC
GTGATCCTGT CGGTCGGCAA TGTCGAGATC GTCGACGTGA AGCAGTTCGA GGCCGTGATC
GCCAAGGTCG ACAAGAGCAA GCCCATCAAC GTGCTGTTCA GGCGAGGAGA GTGGGCGCAG
TACGCGCTGA TCCGCACGGC GACCCGCTGA
 
Protein sequence
MMQEARTEMP RSSWRARSWV WVAAASIAGA STLGGLLMSP HASHAQPAVA AARGLPDFTD 
LVEQVGPAVV NIRTTERTRG GQRGGGGAGP EMDEEMQEFF RRFFGVPPGQ LPGQRQDPRR
QAPDEEQQRG VGSGFIFTTD GYVMTNAHVV DGADEVYVTL TDKREFKAKL IGADKRTDVA
VVKIEAAGLP SVKIGDVSKL KVGEWVMAIG SPFGLENTVT AGIVSAKARD TGEFVPFIQT
DVAINPGNSG GPLINLRGEV VGINSQILSR SGGFMGISFA IPMDEATRVA DQLRAGGRVV
RGRIGVQIGE VTKDVAESLG LGKAAGALVR SVEAGGPADK AGVEAGDIIT RFDGKPVEKS
SDLPRLVGGT KPGSKASLQV FRRGSARDLG VTVAELEPEP GRRAAAPESK QAPTPSVVSG
LGLTLANLSE EQKRELKLRG GVRVEATEGA AARAGLREGD VILSVGNVEI VDVKQFEAVI
AKVDKSKPIN VLFRRGEWAQ YALIRTATR