Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0152 |
Symbol | |
ID | 4787755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 129500 |
End bp | 131260 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640092560 |
Product | hypothetical protein |
Protein accession | YP_001023165 |
Protein GI | 124262695 |
COG category | [S] Function unknown |
COG ID | [COG4676] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.621521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000148726 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAAGC GATTTCTGAC CGCGGCGTGC GCCGCTCTGG CCGTCGTCAC GGCCGCCCCG ACCTTCGCTG CCCCCGCGCC CACGCCCTCG GCGTGCGGTG ACCAGGGCCT CTACGTCGGC TTCTACAACG GCGTCTGGAG CACCAGCGCC GATGCCCTGC GCGGCATGGG CGCGCTGCGT GCGCAGCTGG GGGAGACCTA TCGCGGCGAG CACATCCGCT ACGAGCTCAT GTACAACCCG ACGAACGGCT GGGAAGGCAT CGCGCAGGCA TTCGATCAGC GGCTGGCGCA AGAGAACGCC GTGCTGACCA ACCGATGGGA GTCGTTCTGG GACACCCTGC AGGGCCGCGG CCCGGGCGGT GCCGGCTTCC TCGCCACGCT GGGGCAGACC TTCAGTTCGA TGGCCAGCCT GGCCAACGAC CTGCGCGCCA TCGTGACGAC GGCCTCCGTC AACTCGGTGA CTTCGCTGGC GGCCAACCCG CCGACGGCGC GCGACTACGC CGAGCAGAAC ACGCGTCTGG ACACGATGAC GCTGGAAGGC AGGAAGCTGC TGCTCGTCGC GCACTCGCAG GGCGACCTGT TCCTTGGACA GGCCTACCAG CACCTCGACG CCGGCGTCGC CGCCGGACGC GTGACGGCCG GCTCCTATGC CGCGCTGCGC ATTGCCCCGG CTGGCTCCTA TGCTCCGGCG CGCAGCCAGC ACGTGCTGGT CTCGCAGGAT CAAGTGATCA ACGCTCTGCG CCAGGCGGCG AACGTGCCGG CCAACAACGC CACGGTGCCG ACCTACCAGG AGCAGATCGC CCTCGGCGTG TCGGACCCCG ACATCAGCGC TGACCTGCTC GCGGAGACGT ACCTGAACGC CATGTTCACC GGTGCCGGCG CGCCAGCGCC GATGATCAAG TCCGCGGCTT ACCAGCTGAT CGACGGCCTC ACGACGCCTT CGCAACTGGC GCGCTCCGGC TTCTTCACCG CCACGCTCAA CTGGGACCGC AACGGCGACG TGGACCTGCA CGCCTTCGAG CCGAACGGCG CTCATGTCTT CTACGGCATG CCCAACGGCC AGTCGGGGAT GCTGGACTAC GACGACACGA GGACGACCGG CCCCGAGCAC TACACGACCA CCTGCGATGC GAGCCAGCTC GCCGAAGGCA CCTACACGAT CGCCGTCAAC AACTTTGGCG CCCCGGCCGG CACCCAGGCT GTCGTTCAGG TCGCCACATG GGCCGACGGT CCGCTGCTGA CGGCGCCGGT GCTCACGTTG GGCCAGCAGG CAGGCAGCGC CGGCAACGCG AATCCGACGC CGGTGATGAC CGTCACCGTG GCCAAGGATC CGGCGACCGG CCTGTTCAAG GTGACGGCGG CCGCGGCCAA CGGCGATGGC CCCGTGACCG GAACCCCGCA GACGCTCCAA CTGGAAGTGC AGATGGTTCC GCCGGCCGGA AAGGAAGCTC TCTGCGGAAC CGCATCCGCC TACCAGATCC GTGACTTCAC GAGCGACTTG GGCGGGCAGA GCTGGAACTG CACGCCGACG ACCTGGAACA TGCGCTTGAC GAACACGACG TCGTCCACGA TGGCGATCGG TTCCTTCGCT AAGTCGAGCA TTGCCTGGAC GACCGCATCC AACAGTTGCG GCGCCACGCT GGCGGCCGGC GAGAGCTGTG TTGTCGCCAT CAAGTCGCCG ACGTTCACGG CCAGCGAATC TTCCGGCATG AACGTGAGCT GGCCGACCGT CGGCATCCAG AAGTACGTCT ACATCTACTG A
|
Protein sequence | MNKRFLTAAC AALAVVTAAP TFAAPAPTPS ACGDQGLYVG FYNGVWSTSA DALRGMGALR AQLGETYRGE HIRYELMYNP TNGWEGIAQA FDQRLAQENA VLTNRWESFW DTLQGRGPGG AGFLATLGQT FSSMASLAND LRAIVTTASV NSVTSLAANP PTARDYAEQN TRLDTMTLEG RKLLLVAHSQ GDLFLGQAYQ HLDAGVAAGR VTAGSYAALR IAPAGSYAPA RSQHVLVSQD QVINALRQAA NVPANNATVP TYQEQIALGV SDPDISADLL AETYLNAMFT GAGAPAPMIK SAAYQLIDGL TTPSQLARSG FFTATLNWDR NGDVDLHAFE PNGAHVFYGM PNGQSGMLDY DDTRTTGPEH YTTTCDASQL AEGTYTIAVN NFGAPAGTQA VVQVATWADG PLLTAPVLTL GQQAGSAGNA NPTPVMTVTV AKDPATGLFK VTAAAANGDG PVTGTPQTLQ LEVQMVPPAG KEALCGTASA YQIRDFTSDL GGQSWNCTPT TWNMRLTNTT SSTMAIGSFA KSSIAWTTAS NSCGATLAAG ESCVVAIKSP TFTASESSGM NVSWPTVGIQ KYVYIY
|
| |