Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2193 |
Symbol | |
ID | 4784798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2348733 |
End bp | 2350757 |
Gene Length | 2025 bp |
Protein Length | 674 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640090761 |
Product | hypothetical protein |
Protein accession | YP_001021384 |
Protein GI | 124267380 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.189847 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCA CGATCCTCCT CACCGCCTCG CTGCTCACCG GCCTGCTGAT GCTGCAGGCC TGCGGCACGC CTGCGGCGCG CTCGCCGGCC GCCGGCACGC TGCTGCCGAA CGAGAACCTC GTCGTGCAGG GCATTCCGGC CATCCCGGTG TCGCTGGTGC AGCAGGTCGA GCGTTACACC GATTTTCGCG GCCATGCCCT TGCCGACTGG CATCCGGCAC GCGACGAGAT GCTGGTCTCG CACCGCCCGG CCGGCGCCAA CACCGCGCAG CTGTTCCGTG TGCCCGGCCC GTTGGCGGCC CCGGTCGCGC TGACCGACTT CGCCGACCCG GTGACCGAGG CGAGCTACGA GCCAGTGCTG GGCCGCTACC TCGTGTTCTC GCGCAGCCCC GGCGGCAACG AGGCCGAGCA GCTGTATCGG CTCGATGCCG GCAAGCGCCA ACCCACGCTG CTGACCGATC CCGACCAGCG CCACGACGCC ATCGGCTGGC TGCACCAGAG CGCGCAGTTG CTTTACACCG CGGTACCGCT GGACCGCACC GCCGAGGGCG GCAGCCGCGC AACGCCCACC ACCACGCTGT GGCTGGTCGA TCCCCTGCAG CCGTCGTCGC GTCGCCGCAT CGCCGAGCTG CCCGGCACCG GCTGGTTCGG TGGCGAGATC TCGCGTGACG ACAAGCAGTT GGCGATCACG CGCTACCGCT CGGCGACCGA TTCGCAGGTC TGGCTGATCG ACCTCGCCGA CGGCACGCCG ACGCAGTTGC TGCCGGCCGT CGGCGAACCC TTGCAGGCCA CGCACTTCCC GGTCGGCTAC TCGCCGGACG GCGCCTCGCT CTACGTGCTG AGCGACCGCG CCGGCGAGTT CCGCGAGCTG ATGCGCTACG ACTTCGCGAG CCGCGCGCTG ACGCGGCTCA CGGCGGCGAT TCCCTGGGAC GTCGGCGCGG CCCAGCTCAG CGACGACGGC GCGCTGGCGG CGCTGAAGAT GAACGTCGAC GGTCGCGAGG AACTGCGCGT CGTCGACACC CGCACGCTGC GCGAACGAGC GCTCCCCGCG CTGCCGGCCG GCGGCGTGAA TGCGGTGCGC TTCCAGCCGC GCACCCACCG GCTCGCGGTG GCGATCGACA GCGCCCAGGG GCCGGGCCGC CTGTTGGCGC TCGACGTCGA CCGCGGCACG GTGCAGCCGT GGACACGGCC CCACGTGCCG CCGGGCCTGG ACACCACCAG CTTCCCCGAC CAGCGCATCG TGCGCTGGAC CAGCTTCGAC GGCCTGAGCA TGTCGGCCGT GCTCAGTCCG CCGCCGGCGC GCTTCACCGG CAAGCGGCCG GTCGTGGTGC TGGTGCACGG CGGCCCCGAA GCGCAAGCGA CGATGGGCTT CCTGGGCCGC TGGAGCTACC TTGTCAACGA GCTCGGCGTC GCCATCGTCG AGCCCAACGT GCGCGGTTCC TCGGGCTATG GCAAGACCTT CCTCGCGCTC GACAACGGCA TGAAGCGCGA GGACGCCGTC AGGGACCTCG GCACGCTGCT CGACTGGATC GCCACGCAGC CGGACCTCGA CGCGGGCCGC GTGCTGGTGG TCGGCGGCAG CTACGGCGGC TACATGAGCC TGGCCGCGAG CGTGCACTTC GCCGACCGCA TCGCCGGTGC GATCGACATC GTCGGCATCT CCAGCTTCGT GAGCTTCCTG AACAACACCG AGAGCTATCG GCGCGACCTG CGCCGCGTGG AATACGGCGA CGAGCGCGAT CCGGCGATGC GCGACTTCCT CGAACGCATC TCGCCGCTGA ACAACGCACA GAAGATCCGC AAGCCGCTGT TCGTGATCCA GGGTCGCAAT GACCCGAGGG TGCCGTGGAC CGAGGCCGAA CAGATCGTCG AACGCGTCCG GCAGACCGGC ACGCCGGTCT GGTACCTGCT CGCCGAGAAC GAGGGGCACG GCTTTCGCCG CAAGGAGAAC GCCGACTACC AGTTCTACGC GATGCTGCTC TTCATGCAGG AGACGCTGCT GAAGTCGGCC GGGCGCCAAG ACTGA
|
Protein sequence | MPRTILLTAS LLTGLLMLQA CGTPAARSPA AGTLLPNENL VVQGIPAIPV SLVQQVERYT DFRGHALADW HPARDEMLVS HRPAGANTAQ LFRVPGPLAA PVALTDFADP VTEASYEPVL GRYLVFSRSP GGNEAEQLYR LDAGKRQPTL LTDPDQRHDA IGWLHQSAQL LYTAVPLDRT AEGGSRATPT TTLWLVDPLQ PSSRRRIAEL PGTGWFGGEI SRDDKQLAIT RYRSATDSQV WLIDLADGTP TQLLPAVGEP LQATHFPVGY SPDGASLYVL SDRAGEFREL MRYDFASRAL TRLTAAIPWD VGAAQLSDDG ALAALKMNVD GREELRVVDT RTLRERALPA LPAGGVNAVR FQPRTHRLAV AIDSAQGPGR LLALDVDRGT VQPWTRPHVP PGLDTTSFPD QRIVRWTSFD GLSMSAVLSP PPARFTGKRP VVVLVHGGPE AQATMGFLGR WSYLVNELGV AIVEPNVRGS SGYGKTFLAL DNGMKREDAV RDLGTLLDWI ATQPDLDAGR VLVVGGSYGG YMSLAASVHF ADRIAGAIDI VGISSFVSFL NNTESYRRDL RRVEYGDERD PAMRDFLERI SPLNNAQKIR KPLFVIQGRN DPRVPWTEAE QIVERVRQTG TPVWYLLAEN EGHGFRRKEN ADYQFYAMLL FMQETLLKSA GRQD
|
| |