Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2334 |
Symbol | |
ID | 4783851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2503888 |
End bp | 2505546 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640090903 |
Product | hypothetical protein |
Protein accession | YP_001021525 |
Protein GI | 124267521 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAGC TGACGCCCCA GGACATGGCT GCCAAGTTGT TGACCACCGG CTTCGAGCGC AGCGGCCCTT CGGCCGCGGC CTTGAGCGAC CCCATCGCCG ATACGCCGAT GGTGGTGACG CTGGACCAGT TGCGGCCCTA CGACCACGAC CCGCGCGTGA CGCGCAACCC GGCCTATGCG GAGATCAAGG CGTCCATCCG CGAACGCGGG TTGGACGCGC CCCCTGCGAT CACACGCAGG CCGGGCGAGG CGCACTACAT CATTCGCAAC GGCGGCAACA CGCGGCTGGC GATCCTGCGC GAGTTGTGGA GCGAGACCAA GGAGGAGCGC TTTTTTCGCA TTGCGTGCCT GTTCCGCCCG TGGCCGGCGC GCGGCGAAAT CGTGGCGCTG ACCGGGCATC TGGCCGAGAA CGAGCTGCGC GGCGGGCTGA CCTTCATCGA GCGGGCCTTG GGCGTCGAGA AGGCGCGCGA GTTCTACGAG CAGGAAAGCG GCCAGGCGCT GTCGCAGAGC GAACTCGCGC GGCGGCTGAC TGCCGACGGC TATCCGGTGC CGCAGTCACA CATCAGCCGC ATGAACGATG CGGTGCGCTA TCTGCTGCCG GCGATCCCGA CGCTGCTGTA CGGCGGATTG GGCCGGCATC AGGTGGACCG GCTCGCGGTG CTGCGCAAGG CGTGCGAGCG CACCTGGGAG CGGCGTGCGC TGGGCCGCAC CGTGACCGTG GACTTCGCCA CCTTGTTCCA CGACGTGCTG ACGCAGTTCG ACACACAGCC GGACGACTTC TCGCCGCAGC GGGTGCAGGA CGAGCTGGTG GGCCAGATGG CCGAGCTGCT GGAGGCGGAC TACGACACGC TGGCGCTGGA GATCAACGAC AGCGAAAGCC GCCAGCGTGC GCTGACCAGC GAACCGGCGG CGCCGACGCC ACCGGCAGCG CCTTCCGTGC CTTCTGCTCC TCCCCCGCCG GTCTCCGCGC CTCAGCAGCC ACCCGCCTCG TCTGTGCCGC GCGACACCAC GCCAGCCGCG CCTCCGGCGC CAGCAGCAAC ACCGCCTGCA CCGCCCGAAG CGCCGGAGGA GCAGCACGGG CAGCGCGACG AGCGCCTGCA AGGGCACATC GTGACGCCGG CGCCGACCAC CGAGCGCCTG CAGTCCATCC AACGGATGGT CGCGGACCAG CTCGGCGACA AGCTGCCCGA CTTCGAGGCC GATGCGCTGC GTGCGATCCC CGTGCAGGCG GGCGGGCTCT ATCCCATCTC GGACGTCTGG TACATCGAGC CGAGCTTGGA CGTGCCGGAT CGCCTGCGCG TGCACATCGC GCAATTCGCG CGCGAGATCG CCGGGGACGC AGCGGTAGCC GACCACATCG AGGCCAGCGA CGGCGGCATC GGCTTCGTCT GCGTGGCGCC GGCCGTGGGC CAGGCGAAGG CGCTGCCGGT GTTCGCGCGG GCAGTGCTGA CCCTGCTGCA TGCGCTGAGT GCAGCGCCGC CCGCCGCGAA CGGATTGGAC CGCGCGCGGC TGGCCGACGA GCTGGCGGCG CTGCTCCATG GCCATGGCGG CTCGGCCACA CGCCTGAGCG ATGCTGCGCT GGTGAAGCTG TTCCGTCTGC TGCGCCTGGC GCGCCGGCTG CTGGATCTGG AAGCCGGTGA CCCGGGCCAC GAGTCCTGA
|
Protein sequence | MAELTPQDMA AKLLTTGFER SGPSAAALSD PIADTPMVVT LDQLRPYDHD PRVTRNPAYA EIKASIRERG LDAPPAITRR PGEAHYIIRN GGNTRLAILR ELWSETKEER FFRIACLFRP WPARGEIVAL TGHLAENELR GGLTFIERAL GVEKAREFYE QESGQALSQS ELARRLTADG YPVPQSHISR MNDAVRYLLP AIPTLLYGGL GRHQVDRLAV LRKACERTWE RRALGRTVTV DFATLFHDVL TQFDTQPDDF SPQRVQDELV GQMAELLEAD YDTLALEIND SESRQRALTS EPAAPTPPAA PSVPSAPPPP VSAPQQPPAS SVPRDTTPAA PPAPAATPPA PPEAPEEQHG QRDERLQGHI VTPAPTTERL QSIQRMVADQ LGDKLPDFEA DALRAIPVQA GGLYPISDVW YIEPSLDVPD RLRVHIAQFA REIAGDAAVA DHIEASDGGI GFVCVAPAVG QAKALPVFAR AVLTLLHALS AAPPAANGLD RARLADELAA LLHGHGGSAT RLSDAALVKL FRLLRLARRL LDLEAGDPGH ES
|
| |