Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0076 |
Symbol | |
ID | 4787679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 66565 |
End bp | 68958 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640092485 |
Product | hypothetical protein |
Protein accession | YP_001023090 |
Protein GI | 124262620 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.701551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00833618 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCTTGACC CTTCTTCCGA GCCTCCCGGC GACGCCAGCG ACGCCCCGGC GAAGCGCAAG CGCGGCCGCC CGGCCGGCGT GAAGATGCGC ATCGCCGCCG ACCAGCTGCG GCACCACCAC TTCAGCTTTG TCCGGGAGAT CATCGAGCTG CCCGACCAGC CGCTGAAGAA GCCGTGGGAG CGGTATCTCG CCTTCGAGGG CGGGCCGAGC GATGAGCGGC ACTACGCTGC GCGCCTGCGC GAACTCGTCA AGCTGATGCG CTTTGCCGCC ACCGAACGCG GCCTGGCCGC GCGGGCCGAC GTCGCGCTTG CCGGCGTCAT CCCGCCGTCG CCACCGCCCT CCCCTGCCCT GCCGAAGCCG TCGGCGGCGG CACCGGTCCG GACGATCCCC GACCTGGATG AGTGGGTCGG GCAGCGCTGC GAGGCCCTCG GCATCGACGT CGACTTCCAG ACACAGGCCG AGTGGCTGGC CGAGTACGAG GAGGAGTTCG GTTTGAACGC GCCGCCGGTA CCGATGCCGG CCACCGCCCT GCCCTCGCCA TCGCCGGCGC CGGCCTCGAG CGGCCGCGCG CCCTCCCCTG CCCTGCCCAA GCGCCAGGAC CGTCTCGCCG CCCTGAACGA GCTCGCGCAT GCGCTGGCCA AGCCCCCTGC CCTTTCCGAC CCCCTGAGCG CCTGGCTGTC GGAAGACTTG GCGCGCCGTC TCGCCGGCAC GGAGGTCGAT GGCCGGCGCC TGCCTCTGCT CACGCTGGAC AACCTGATCA CCTTCGTCAA CCTGTACCAC TACCGCTGGT GGGAGCACGT GCCGCGGCTT GGTCAGGAGC GCGGTGATCG GCTGACTGCC TGGCTGGCTC CCCTCTCCGA CGCCCTGGGC CGACCTCTCA AGGAGGTGGC TCGAAGGCCA CAGCACGAGA TCCGCCTGGC GCGCGAGCGC GCGCTGCGCG GCAGCAAACG CTACGGCATG GTCCCGCTCG ATCAGCTCGC CGTTCCGCCC GACCTCAGCG GGCGCGATGG CACCTTCAGG TTGCCCGGGG TAAACGTCTG GGGCGTGGAC ACGGACTTGG ACGCGATCTT CAACTGGATG GCTCGATACG CGACTTCGCC GCGCACCTCC GCCTCCTACG GGTCGATCGT CGAGCGCTTC TACCTCTGGG CCGTCCTGGT GAAGCGCAAG CCGATGTCTT CGCTCACCGA GGGCGACATG CGGGACTACC GGGACTTCAT CACCCGCCCG CCCGCGGACT GGGTCCAGGA GCGCTTCGTC ATGCGCGGCG GCCCCGACTG GCGGCCGTTC CGAGGCCCCC TTTCCCCGGC GAGCCGCAAG CGCAACTTCA TCGTCATCTC GATGATGCTT AGCGCGATGG TCGAGAAGGC CGGCTACCTC AAGGGCAATG CTGCGGAGGG CGTGCTGCGC AGCCTCACCG TCCGCGGCCC AGCCATCAAC ATCGACCGCA CCTTCACCGA CGCTCAATGG GCGTACGTGA TGCGCCGGTG GGACGAGGAG TACGCCTCGT GCGGCCCGAG GCACGCCGAC GACGAGGAGC AGCCCTTCTG CCCCGACGAT GGTCACCCAG ACCAGGAGTT CACGCGCGCC GCCCTCCTCC GCCGTACCCG CTTGGTGCTC GAGCTGGGCG CAACGACGGG CCTGCGGCTT TCAGAGTTCC CCACCACGCG CCTCAAGTCG ATCGCTCGCC AAGTGGTTGA TGGCGAGGAA GTCTGGCTCA TGACCGTCCT CGGCAAGGGG AACAAGCTGC GCGAGGTGCT GCTGTTTGAC GACGTGCGGG AGATGGTCGA GCAGCACCAC CGCGACATGG ACATGATGGG CACGGCGTTC GACCCGAGCA ATACGCGCAA GTTGCGCACC CTGCACGAGG ACGACGCGCT CTCGTCCGAA CCAGTGGCAC CGCCCGCCGC GGAACCCGAC CCCTCAGCTC TTCTCCTGCC CGGTGCGCCC CGGGCCGCGG ACGATCCCGA CTGGACGCTA CGTCCGCTCA TTGGGGCCTT ACGCAAGGCC GGCCGCCGCT GGAAGCTGGA TGCCAATGGG GTCAAGGTGA TTGATGAGGA GTCGCCAAGG AACGCCGACC GCTACGGCTC GATCGATCCC TCAGGCCTCT ACAAGTCGCT GAAACGCTTC TTCGAACTCT GCAAGAAGGA CGCAGCCGAA CACGGCCTGC CTACCGAAGA CGCCAAGGCA TTTGCCTCTG CGTCGACGCA CTGGATGCGC CACTTCTTCG CAAACACCGC CCTCGCCGAC GGTGTCGCAG CCGAGGCCGT GAAGGACGCC ATGGGGCACA AGAGCCTGAA CACCACCTCG ATCTACCTAC GCACGGAGCG ACGGCGAATG GTCGAGCAAC TCGGCAAGCT GAAGCGTCGC GGCGCACCTG GTGGAGCCGG GTGA
|
Protein sequence | MLDPSSEPPG DASDAPAKRK RGRPAGVKMR IAADQLRHHH FSFVREIIEL PDQPLKKPWE RYLAFEGGPS DERHYAARLR ELVKLMRFAA TERGLAARAD VALAGVIPPS PPPSPALPKP SAAAPVRTIP DLDEWVGQRC EALGIDVDFQ TQAEWLAEYE EEFGLNAPPV PMPATALPSP SPAPASSGRA PSPALPKRQD RLAALNELAH ALAKPPALSD PLSAWLSEDL ARRLAGTEVD GRRLPLLTLD NLITFVNLYH YRWWEHVPRL GQERGDRLTA WLAPLSDALG RPLKEVARRP QHEIRLARER ALRGSKRYGM VPLDQLAVPP DLSGRDGTFR LPGVNVWGVD TDLDAIFNWM ARYATSPRTS ASYGSIVERF YLWAVLVKRK PMSSLTEGDM RDYRDFITRP PADWVQERFV MRGGPDWRPF RGPLSPASRK RNFIVISMML SAMVEKAGYL KGNAAEGVLR SLTVRGPAIN IDRTFTDAQW AYVMRRWDEE YASCGPRHAD DEEQPFCPDD GHPDQEFTRA ALLRRTRLVL ELGATTGLRL SEFPTTRLKS IARQVVDGEE VWLMTVLGKG NKLREVLLFD DVREMVEQHH RDMDMMGTAF DPSNTRKLRT LHEDDALSSE PVAPPAAEPD PSALLLPGAP RAADDPDWTL RPLIGALRKA GRRWKLDANG VKVIDEESPR NADRYGSIDP SGLYKSLKRF FELCKKDAAE HGLPTEDAKA FASASTHWMR HFFANTALAD GVAAEAVKDA MGHKSLNTTS IYLRTERRRM VEQLGKLKRR GAPGGAG
|
| |