Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0623 |
Symbol | |
ID | 4787466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 580633 |
End bp | 583008 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640093044 |
Product | hypothetical protein |
Protein accession | YP_001023622 |
Protein GI | 124263152 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.103441 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACCG CACCGCAAGC GACGCAATCG TCCCCCGCCG CCTCGCAGCC GCAAGGCGTG ACCCGCCACG TCACCGACGA AGGCATGGAG AAGGCGCTCG GCATCGGCCA GGGCGGCAAG CGCCTGAACA CGCACGCGCA CGACCTTCAG GAGCGCATCG ATGACGCCCG CGACGACTTC CTCGAGGTGC GCACCTTCTA TGTCGGCGGC CGTGACCGGC ACGACTTCCT AAAAAACGGG CTGCTGGGCG AAATCGTAAG CCGCACGCCC ATCCTCGTCT ACGACCTGCC CGAGCTCAAG GCGTTCTGCA ACACCGCCTT CGTCGACCGC TCCGGCAAGA TGTACATCGC CGACACGTTC TGCCGCCGGC TCCTCTCAGA GCATGATGCA GGCCTGGACT CGTTGAACTA CGTCTTCCGC CACGAGGCGG ACCACCTGCG TCGCCTTCAC CTGGCGCGCA TGCTGGACTA CCCGCACTCG GTGGCGAATG CCGCGACGGA CACGCGGATC AACATCGACA TCGTGAAGGG CGAGGCGGCC GAGCGCGCGG AGGCCGAGAA GGGCTCCTCG CTCAACGACA ACGAGGTCCG TGAGGCCATC AAGAAGTACC TGGCCGAGCA CGCCCAATCG AGCATCAGCG TCTTCTACGG GATGACCTTG GAGGAGCACG TCAAGTACGA CGGCATGTCC GAGGAGGCCA TCGCGGCACT GATGATGAAG GACTGGAAGG AGCCGCCCCC GCTGCCCAAT CGCGAGGTGT CCTTCGAGCA CATCATGGAG GGCGCCGCCC AGGAGGCGGA CAACGTCAAG GGCATGCTCC TGGCGGGGAA GCCGCTGGCG CCGACCGCCC CTCCCTACGT CATGACGCCC AACGAGCTGT CGGGCCTGGC GCAGGACTTG CGGACCATCG GCAAGGCCAA GGCGAACCCG TCCAAGGTGA GCGACCAGGA CCTGCAGTCG GCCTACGACC GCCTGGCCAA GCTCCGGGAA CACCAAGGCC TGATCGAGCT CGACAACCAG CACATCCGGG CAGCCATGGG TCTCCTCGGC AAGGGAGCCA GCCACTCCTC CGGCAAGACC GGCGACGCCT ACCTCGACAT GCTCAAGCCC TCCGAGCGCG TCGAGATGGC GATGAAGATT CTCGAGAAGA TTCTGCAGCC GCAGAAGTCC AACGGGATGC CGCAGCAGCC GCAGAACGGT GGCCTCACCA TCAAGGACCT CGAGCGCGCC ATGGGCCGCG GCGGCGCGCC GAATCCCGGC AACGGCAACT CGCAATCCGG TGGCCAACCC GGCGACCAGG CGGGCGCCCA GGATGGCAGC GGCACCGAGG ACATGGTGCC GGCGCCGACG GTCACGCACG GCCAGGACCA CGTGATGAGC ACCGAGGACC TCGCCCAGGC GCTGCACGAC GCCGGCGTGT CGTCGGACAC CATGGCCAAG CTCGGCTTCG ACGACCTCAA GAAGATTCCG GAGGAGGTCA AGCACGCGAA GGATGGCGTG GTCTCGGCCA TCAACAAGGC CTCCGAGGAC CAGATGAAGG TGGGCTCGCG CTACCCGGGC GGGCACCTGC TGCACTACGC CAAGGCGCAG ATGCTGGACT TCTTCAAGCC GGTGCTGACC TGGGAGATGG CCCACAAGAA GCTGCTGGAG GCCTGCGGCA AGGGCTCGCG CTACGACCCG ACGGAGCCGT GGACGCTCTA CCACGTGGAC GCGGCCGACA TGGGCTTCAA GCACCAGCGC GACGTGCCCT TCATGGGCAG CCGCATGCCG GGCAAGGAGC AAAAGCCGCT GATGTTCGAC ATCATCGACA CCTCGGGCTC CGTCGATGAC GCCATGCTCA AGCGCTTCGT CTCCGAGGCG CTCAACCAGG CCCGCCGGGT CTCTCGCGGT GTCGCGCCGG ACGTTCTCAT CAGCTGGGCC GACACCATCT GCCGCGGGGT GCCTGAGTTC ATCAGCGAGA AGAACTACAA GCAGTTCCTC ACCAAGGGCA TCAACTACGG CGGCCGCGGC GGCACCAACT TCCAGGCGGC CATCGAGAAC GTTCTCGAGA TGGTGAAGCC GGGCTCGAAG TCGGGCTACG CCAAGCGCAA CATCGACGCC ATCTGCTACA TGACCGACTC CGGTGACTCG GTGCCGGACC CGGCGCGGCT GTTGCGCAAG GCCCAGGAGT GCGGGCTGAA GAAGTTGCCC CCGATCCTCT TCCTGGTGCC CAAGTCCTGC TACGACGAGC GCTTCGCCAA GGAGGCCAGC AAGTGGGCGA CCGTCGTCTA CTTCCACGCC GGCCCTGGCG CCAAGCACAC GCAGAAGGTC GACATCAACG CGGCCGCCCG TGAGCAAGAT CAGAAGAACC GCAACCTGAA GGCCCCGCGG CCGTAA
|
Protein sequence | MATAPQATQS SPAASQPQGV TRHVTDEGME KALGIGQGGK RLNTHAHDLQ ERIDDARDDF LEVRTFYVGG RDRHDFLKNG LLGEIVSRTP ILVYDLPELK AFCNTAFVDR SGKMYIADTF CRRLLSEHDA GLDSLNYVFR HEADHLRRLH LARMLDYPHS VANAATDTRI NIDIVKGEAA ERAEAEKGSS LNDNEVREAI KKYLAEHAQS SISVFYGMTL EEHVKYDGMS EEAIAALMMK DWKEPPPLPN REVSFEHIME GAAQEADNVK GMLLAGKPLA PTAPPYVMTP NELSGLAQDL RTIGKAKANP SKVSDQDLQS AYDRLAKLRE HQGLIELDNQ HIRAAMGLLG KGASHSSGKT GDAYLDMLKP SERVEMAMKI LEKILQPQKS NGMPQQPQNG GLTIKDLERA MGRGGAPNPG NGNSQSGGQP GDQAGAQDGS GTEDMVPAPT VTHGQDHVMS TEDLAQALHD AGVSSDTMAK LGFDDLKKIP EEVKHAKDGV VSAINKASED QMKVGSRYPG GHLLHYAKAQ MLDFFKPVLT WEMAHKKLLE ACGKGSRYDP TEPWTLYHVD AADMGFKHQR DVPFMGSRMP GKEQKPLMFD IIDTSGSVDD AMLKRFVSEA LNQARRVSRG VAPDVLISWA DTICRGVPEF ISEKNYKQFL TKGINYGGRG GTNFQAAIEN VLEMVKPGSK SGYAKRNIDA ICYMTDSGDS VPDPARLLRK AQECGLKKLP PILFLVPKSC YDERFAKEAS KWATVVYFHA GPGAKHTQKV DINAAAREQD QKNRNLKAPR P
|
| |