Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0301 |
Symbol | |
ID | 5832613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 336562 |
End bp | 338442 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641366086 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001637796 |
Protein GI | 163849753 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.111113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGATC TGATCCCGCG CGCGCATCTC TTCGGCAACC CGACGCGCTA CGGTCATCAG ATCAGCCCCG ACGGGCGCCG CCTCGGTTGG GTGGCGCCCC ATGAGGGTGT GCTCAACATC TGGTCGGCGC CGATCGACGA CCTCGACGCC GCCGTGCCCG TCACCACCGA CCGGCGCCGC GGCATCGACG CCTACGCCTT CGCCTATGAC GGGCGCCACC TGCTCTACGT GCAGGACGCG GACGGCGACG AGAACCACCA CCTCTACGCC GTCGATCTCA CCACGGGCGA GCGACGCGAC CTGACGCCGA TCCCCGGCAT CGCTGCGGCG ATCGTGGGCC TCAGCCGCAT CGTGCGCGAC CGCGTGCTCG TCGCGATCAA CGACCGCGAC CCGCGCTTCC ACGACCTGCA CAGCATCGAT CTCGCCACCG GCGAGCGCAG CCTCGTGATC GAGAATCCGG GCTTTGCCGG TTTCCTGATC GATGAGCGCT ACGCGGTTCG CTTCGCCTTC CGCAATCTTC CGGACGGTTC GAGCCAGTTG ATCGCCCCGG ACGGCGCGAA CTGGAAGCCG TGGCTCACCT TCCCGCCCGA GGATGCCCGC GTCTCCGGCG CGGAGAATCT CGACGCCGCC GGCACCGCCC TGTTCTGCCG CGACAGCCGC GGGCGCAACA CCGCCGCGCT GACCCGCATC GATCTCGCCA CCGGCGAGAC CCGCGTGCTC GCCGCGCACG AGGAGGCGGA TATCGGCGCG GTGCTGCAGG ATGCCGAGAC GCACGAGCCG GTGGCCTACT CGGTCACCCA TGCCCGCAAA TCCTGGCACG TGCTCGACCC GCGTTTGACC GACGACTTCG CCTTCCTCGA AACGCAGGGG CTCGGCGACT GGTATCCGGC GAGCCGCACC GAGGACGATG CGCTCTGGAT CGTGGTGGCC CGCGCCGACA CCCGCGTCGG CGAGGCCGCG ATCTACGACC GGCGGGCAAA GACGCTGCGC TCGCTCGGCA GCGCCCGGCC GGAACTGGAG GGTGCGCCGC TCGCCCCGAT GAGCCCGGCG ATCATCCGCT CCCGCGATGG GCTCGATCTC GTCTCGTATC TCAGCCGCCC GCTCGATGCG CAGGCCCCCG GCCCGCTGGT GCTGCTCGTC CATGGCGGCC CGTGGGCGCG AGACAGCTTC GGCTTCGACG GCCTCCATCA ATGGCTGGCC AATCGCGGCT ATGCCGCGCT CAGCGTCAAC TTCCGTTCCT CGACCGGCTT CGGGAAAGCC TTCCTCAATG CGGGCGACCG CGAATGGGGT CGGCGGATGG ACGACGACCT CAGCGACGCC GTCGCCTGGG CGGTGGCGCA GGGTGTGGCC GATCCGGCTC GCGTCGCGAT CATGGGCGGC AGCTACGGCG GCTACGCCAC GCTGATGGCG CTGACCCGCA ACCCCGGATC GTACGCCTGC GGCATCGACC TCGTCGGCCC GGCCAACCTC GAAACCCTGG TGCGGACGAT CCCGCCCTAT TGGGAGGCGA TGCGGGCGCA GCTCCACCGC GCCATCGGCG ATCCCGACAC CGAGGAGGGC ATGGCGCTGA TCCGCGAGCG CTCCCCGGTC TACTTCGCCG ACCGAATCAA GGCGCCGCTG CTGATCGTGC AGGGGGCCAA CGATCCGCGG GTGAAACAGG CCGAGTCGGA CCAGATGGTC GCGGCCATGG AGCGCGGCGG CATTCCCGTG ACCTACCTGC TGTTTCCGGA CGAGGGCCAC GGCCTCGTGC GCCCGGCCAA CCGGCTGGCC TTCTTCGCGC GGGCGGAAGA GTTCCTGGCG CGCCATCTCG GCGGGCGCTG CGAGCCGATC CGCGAGGATG AATCCGCCGG GACGTCGATG CAGGTGGTGC GGGAGGGATA G
|
Protein sequence | MVDLIPRAHL FGNPTRYGHQ ISPDGRRLGW VAPHEGVLNI WSAPIDDLDA AVPVTTDRRR GIDAYAFAYD GRHLLYVQDA DGDENHHLYA VDLTTGERRD LTPIPGIAAA IVGLSRIVRD RVLVAINDRD PRFHDLHSID LATGERSLVI ENPGFAGFLI DERYAVRFAF RNLPDGSSQL IAPDGANWKP WLTFPPEDAR VSGAENLDAA GTALFCRDSR GRNTAALTRI DLATGETRVL AAHEEADIGA VLQDAETHEP VAYSVTHARK SWHVLDPRLT DDFAFLETQG LGDWYPASRT EDDALWIVVA RADTRVGEAA IYDRRAKTLR SLGSARPELE GAPLAPMSPA IIRSRDGLDL VSYLSRPLDA QAPGPLVLLV HGGPWARDSF GFDGLHQWLA NRGYAALSVN FRSSTGFGKA FLNAGDREWG RRMDDDLSDA VAWAVAQGVA DPARVAIMGG SYGGYATLMA LTRNPGSYAC GIDLVGPANL ETLVRTIPPY WEAMRAQLHR AIGDPDTEEG MALIRERSPV YFADRIKAPL LIVQGANDPR VKQAESDQMV AAMERGGIPV TYLLFPDEGH GLVRPANRLA FFARAEEFLA RHLGGRCEPI REDESAGTSM QVVREG
|
| |