Gene Mpe_A0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0422 
Symbol 
ID4785175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp456637 
End bp458052 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content72% 
IMG OID640088980 
Producttrypsin-like serine protease 
Protein accessionYP_001019619 
Protein GI124265615 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.233942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGAC CCCGAGCCTG GCGGCAGCGT GCACTCGGGC TGCTGTTCGC GCTGGCGAGC 
GGCGCCATCG GTGCGCAGAC CGCCGCCCCC GCCTCCGCTC CCTCCTCCGC CCCGCTGCCC
GGGATCGGCG CGCCATCGGT CGCCACCGCG CCGCTGCCCG TGTCGCCGTC CGCGCAGCGG
CTCTACGAGC GCGCGCGCGG CCAGCTGCTG CAGGTGCGCA CGCTGCTGAA GGGGCAGGAC
AGCCAGGCCT CGGTCGGCTC GGGTTTCTTC GTCAGCGACG ACGGCCTGAT CGTCACCAAC
TACCACGTCG TCAGCCAGGT GGCGCTGCAG CCCGATCGCT ACCGGCTCAC CTACACCCGC
GTCGACGGCC GCGAGGGCGC GCTGCAGCTG CTGGGCTTCG ACGCGATCCA CGACCTGGCG
CTCGTGAAGG CGCTGCCGCC GAACGGCCCG TCTCGCAAGA GCGGCGTCAG CGTGGTCGAC
GCAGCGGGCG AGCCGCTCGC CTTCCGCGCC GCGAACGACG CACTGGCCCA GGGCGAACGC
ATCTACTCGC TCGGCAACCC GCTCGACGTC GGCTTCGCGG TGCTGGAAGG CAACTACAAC
GGGCTCGTCG AGCGCAGCTT CTACCCCAGC ATCTTCTTCG GCGGCGCGCT CAACTCCGGC
ATGAGCGGCG GGCCCGCGCT CGACGAGGCC GGCCGCGTGG TCGGCGTCAA CGTCGCCACA
CGGCGCGACG GCCAACAGGT GAGCTTCCTG GTGCCGGCGC CGTTCGCGCA GGCCCTGGTG
GAGCGGGCCC GCGGCGCGGC GCCGATCACC GCGCCGGTCT ATCCGCAGCT CACCGCGCAG
CTGCTCGCGC ACCAGGAGGC CGTGGTGCAA CGCTTCGTCC AGCAGCCCTG GCGCAGCGCC
GGCCACCCGC ACTACCTGAT CCCCGTGCCA CAGGAAGACT TCATGCGCTG CTGGGGCCGC
AGCACGCCGG CGGACACCAA GGGCCTGGAG TTCGAGCGCT CCGACTGCGA GATGGACACG
CAGATCTTCG TCAGCGGCAG CCTGCTCACC GGCTCGCTGG GCGCGCGCCA CGAGGCCTAC
GACGGCCGCA AGCTCGGCTG GCTGCGCTTC ACCGAGCGCT ACAGCGCGAG CTTCCGCAAC
GAGAGCTTCG GGCGCCGCAA CCCGAAGGAA TTCACCGCGC CGCAGTGCAG CGAGCGCTTC
GTCGACCGCG ACGGCCTGCC GCTGCGCGCA GTGCTGTGCC TGTCGGCCTA CAAGCGCCTC
GCCGGACTCT ACGACGTCAG CGTGCTGGTC GCCACACTCG ACCAGGCCCG CGTCGGCGCG
CAGGGCCGCC TCGACGCCCG CGGCGTCAGT TTCGACAACG CGATGAAACT GGCCTCGCAC
TACCTGCAGG GCTACGGCGT GAAGGCGGCG CCATGA
 
Protein sequence
MTRPRAWRQR ALGLLFALAS GAIGAQTAAP ASAPSSAPLP GIGAPSVATA PLPVSPSAQR 
LYERARGQLL QVRTLLKGQD SQASVGSGFF VSDDGLIVTN YHVVSQVALQ PDRYRLTYTR
VDGREGALQL LGFDAIHDLA LVKALPPNGP SRKSGVSVVD AAGEPLAFRA ANDALAQGER
IYSLGNPLDV GFAVLEGNYN GLVERSFYPS IFFGGALNSG MSGGPALDEA GRVVGVNVAT
RRDGQQVSFL VPAPFAQALV ERARGAAPIT APVYPQLTAQ LLAHQEAVVQ RFVQQPWRSA
GHPHYLIPVP QEDFMRCWGR STPADTKGLE FERSDCEMDT QIFVSGSLLT GSLGARHEAY
DGRKLGWLRF TERYSASFRN ESFGRRNPKE FTAPQCSERF VDRDGLPLRA VLCLSAYKRL
AGLYDVSVLV ATLDQARVGA QGRLDARGVS FDNAMKLASH YLQGYGVKAA P