Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2584 |
Symbol | |
ID | 4787019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2758290 |
End bp | 2759282 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 640091153 |
Product | putative prolyl aminopeptidase |
Protein accession | YP_001021772 |
Protein GI | 124267768 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.35326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAGCA ACGGCGATCC GGGGCAAACC CTCGCCCCCG CGGCGCTGCC GGCCGGGGAC TGGCTGGCGC CCGTGGACGG ACACCGCGTC TGGTGGTGCG AGGGGGGCGA CCCGGCCGGG CTGCCGGTGC TGATCGTGCA CGGCGGCCCG GGCGGCGCCA GCCGCCTGGA GCCGACGCGC TGGTTCGACG GCCTGCCGCT GCGCTGGATC GCGATCGACC AGCGCGGTTG CGGGCGCAGC GAGCCGCCCG GTCGGACGGA CGGCAACGAC CTGGGAGCGC TGCTCGACGA CATGGAGCGC CTGCGCCGCC ACCTGGGCCT GCGGCGCTGG GCCGTGGCCG GCGGCTCGTG GGGTGCGCGC GTGGCGCTGG CCTATGCCGC GCGCTGGCCG GAGGTGCTGC ACGGGCTGCT GCTGCGCAGC CCCTTCCTCG GCACGGCCGC CGAGACGCGG CGCTACATCG CGCCGTGGCG GCCCTGGCTG GGCGCCGAGG GCCAGGCCTG GCTGGGCGAA CCGGCCGCGA CGGCGGTGGC CGCGCTGTAT CAGGCGGAGC CGGGGTTGCT CCACATTGGC GCAATGCAGG CCGACGAGCG GATCGCCCGC GCCTGGTCGG CGTTCGACGA TGCGCAATCG GCGCCGGGTG GGGTCGCGGC CAGCGGCGCT CGCTGCGATC CCGCCGCCTT GCCGGCCGCG ACGCCGCAGC TGATGGCTTC GTGGCGCGTC CACGCCCACT ACGCCGCCGC GTCCTGGGGG GCGGCAGCCG CCGGTGCGGC CGGTGTGCCG GCGCTGAACA GCGGTGTGCC GGTCAGCGTG GTCTGGGGGG CGGCCGATGC CACCTGCGAC CCGGCCGTGG CCCGGGCGCT CGCGGCGGCG CTGCCAGGCG CCTTGTCGAA CGAGGTGCCG GAGGCCGGTC ACCGCATGAG CGATCCCCGC TTGGCGCCGG CCTTGCGTGC CGCCGCGCGC GACTGGGCGC TGCGGTGTCG GGGCAGCGGC TGA
|
Protein sequence | MSSNGDPGQT LAPAALPAGD WLAPVDGHRV WWCEGGDPAG LPVLIVHGGP GGASRLEPTR WFDGLPLRWI AIDQRGCGRS EPPGRTDGND LGALLDDMER LRRHLGLRRW AVAGGSWGAR VALAYAARWP EVLHGLLLRS PFLGTAAETR RYIAPWRPWL GAEGQAWLGE PAATAVAALY QAEPGLLHIG AMQADERIAR AWSAFDDAQS APGGVAASGA RCDPAALPAA TPQLMASWRV HAHYAAASWG AAAAGAAGVP ALNSGVPVSV VWGAADATCD PAVARALAAA LPGALSNEVP EAGHRMSDPR LAPALRAAAR DWALRCRGSG
|
| |