Gene Mpe_A3396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3396 
Symbol 
ID4786383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3611114 
End bp3612130 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content71% 
IMG OID640091972 
Producttrypsin-like serine protease 
Protein accessionYP_001022584 
Protein GI124268580 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0221978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGGC TTCCCCTCCT CTTCGCAGCC GCCTGGCTGC TGGGTCTGCC GCCTGGCGCG 
CGGGCCGACA TCGACCGCGC CGCGCTGATC CGGCTGGCGC CGAGCGTGCT GAAGATCGAG
GCGGTCAGCG CCGCGGGCGG CCTGCAGCTC GGTTCCGGCG TGATCGTCGG CCCCGGCAGG
GTGGTGACCA ACTGCCATGT GACGCGCCAC GCGGTGCGCG TGAACGTGGT GAAGGGCGGT
GTGCGCTGGA CGGCCAACCT GCAGGCCGCC GACATGCTCC GCGACCTGTG TCTGCTGCAG
GTCCCGAGGC TGGAGGGCGA TGCCGTCCCG ATCGCGCGCG CGGCCTCGCT ACACCCCGGG
CAGCAGGTGC TGGCGATGGG CTACACCGGC GGGGTGGGCA TCCAGCTCAG CGAAGGCGAC
GTGGTGGCCC TGCACCACTG GTCCGGCAGC CAGATCGTGC AGAGCAGCAA CTGGTTCAGC
TCGGGCGCCA GCGGTGGCGG GCTGTTCAAT GCCGACGGCA AGCTGGTCGG CATCCTGACC
TTCCGGTTGC GCGGCGGCGC TCGCCACTAC TTCGCCGCAC CCGCCGACTG GGTGCTCGCG
CAGCTCAACG ACGAACTGCC CTACAACGCG GTCGCCCCGC TGGCCGGCAA GAGCTTCTGG
GAGCAGCCGG ACACCGAGCA ACCCTACTTC CTGCAGGCCG CCGCGCTGGA GCAGGGCCAG
CAATGGGCGG CACTCGCCCA ACTGGCCGAC CGCTGGCAAC AGGAGGCCGG CGACGATCCG
GAAGCACCCT ACCTGCTGGG CGTCGCCTTC GAGGGCCTGC ACCAGCCAGA GCCCTCCATC
CGCGCCTTCC AGCGCAGCGT GGAGATCGAT CCGACCTACA ACCGCAGCTG GGCCCGACTC
GCCCAGGTCT ACAAGCGGCA GGGCCAGCTG CGCGAATCAC GCAATGCCGT CGCGCGCCTT
GCGGCGCTCG ACCCGAAACA GGCCCGCGAA CTCGCGGCCG AACTGGAGAA ACCATGA
 
Protein sequence
MRRLPLLFAA AWLLGLPPGA RADIDRAALI RLAPSVLKIE AVSAAGGLQL GSGVIVGPGR 
VVTNCHVTRH AVRVNVVKGG VRWTANLQAA DMLRDLCLLQ VPRLEGDAVP IARAASLHPG
QQVLAMGYTG GVGIQLSEGD VVALHHWSGS QIVQSSNWFS SGASGGGLFN ADGKLVGILT
FRLRGGARHY FAAPADWVLA QLNDELPYNA VAPLAGKSFW EQPDTEQPYF LQAAALEQGQ
QWAALAQLAD RWQQEAGDDP EAPYLLGVAF EGLHQPEPSI RAFQRSVEID PTYNRSWARL
AQVYKRQGQL RESRNAVARL AALDPKQARE LAAELEKP