Gene Mpe_A2584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2584 
Symbol 
ID4787019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2758290 
End bp2759282 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content78% 
IMG OID640091153 
Productputative prolyl aminopeptidase 
Protein accessionYP_001021772 
Protein GI124267768 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.35326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGCA ACGGCGATCC GGGGCAAACC CTCGCCCCCG CGGCGCTGCC GGCCGGGGAC 
TGGCTGGCGC CCGTGGACGG ACACCGCGTC TGGTGGTGCG AGGGGGGCGA CCCGGCCGGG
CTGCCGGTGC TGATCGTGCA CGGCGGCCCG GGCGGCGCCA GCCGCCTGGA GCCGACGCGC
TGGTTCGACG GCCTGCCGCT GCGCTGGATC GCGATCGACC AGCGCGGTTG CGGGCGCAGC
GAGCCGCCCG GTCGGACGGA CGGCAACGAC CTGGGAGCGC TGCTCGACGA CATGGAGCGC
CTGCGCCGCC ACCTGGGCCT GCGGCGCTGG GCCGTGGCCG GCGGCTCGTG GGGTGCGCGC
GTGGCGCTGG CCTATGCCGC GCGCTGGCCG GAGGTGCTGC ACGGGCTGCT GCTGCGCAGC
CCCTTCCTCG GCACGGCCGC CGAGACGCGG CGCTACATCG CGCCGTGGCG GCCCTGGCTG
GGCGCCGAGG GCCAGGCCTG GCTGGGCGAA CCGGCCGCGA CGGCGGTGGC CGCGCTGTAT
CAGGCGGAGC CGGGGTTGCT CCACATTGGC GCAATGCAGG CCGACGAGCG GATCGCCCGC
GCCTGGTCGG CGTTCGACGA TGCGCAATCG GCGCCGGGTG GGGTCGCGGC CAGCGGCGCT
CGCTGCGATC CCGCCGCCTT GCCGGCCGCG ACGCCGCAGC TGATGGCTTC GTGGCGCGTC
CACGCCCACT ACGCCGCCGC GTCCTGGGGG GCGGCAGCCG CCGGTGCGGC CGGTGTGCCG
GCGCTGAACA GCGGTGTGCC GGTCAGCGTG GTCTGGGGGG CGGCCGATGC CACCTGCGAC
CCGGCCGTGG CCCGGGCGCT CGCGGCGGCG CTGCCAGGCG CCTTGTCGAA CGAGGTGCCG
GAGGCCGGTC ACCGCATGAG CGATCCCCGC TTGGCGCCGG CCTTGCGTGC CGCCGCGCGC
GACTGGGCGC TGCGGTGTCG GGGCAGCGGC TGA
 
Protein sequence
MSSNGDPGQT LAPAALPAGD WLAPVDGHRV WWCEGGDPAG LPVLIVHGGP GGASRLEPTR 
WFDGLPLRWI AIDQRGCGRS EPPGRTDGND LGALLDDMER LRRHLGLRRW AVAGGSWGAR
VALAYAARWP EVLHGLLLRS PFLGTAAETR RYIAPWRPWL GAEGQAWLGE PAATAVAALY
QAEPGLLHIG AMQADERIAR AWSAFDDAQS APGGVAASGA RCDPAALPAA TPQLMASWRV
HAHYAAASWG AAAAGAAGVP ALNSGVPVSV VWGAADATCD PAVARALAAA LPGALSNEVP
EAGHRMSDPR LAPALRAAAR DWALRCRGSG