Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0040 |
Symbol | |
ID | 4783797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 38997 |
End bp | 40706 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640088587 |
Product | paraquat-inducible protein |
Protein accession | YP_001019237 |
Protein GI | 124265233 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00684202 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.214843 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCGCA TGAGCGACAC GCCGCCGCCC TCCTCTTCCG CCCCTGCGTC CCTGCCGGCG ACGGCCGAGC TGCCTGAAGC GGTGGCGGTG CCGCGCCGCC GCAGCCACGG CTATGCGGTG TGGCTGATCC CCATCCTTGC GGCATTGATC GGCCTGACGC TGGGCGTGCG CGCCTGGCTC GCGACCGGAC CCACGGTCAC CATCACCTTC AAGACCGCCG AGGACCTGGA GCCGGGCAAG ACCAAGATCA AGTACAAGAA CGTCGACATC GGCGACGTGA AATCGATCGC GCTCGCCCCC GACGGCAAGG GCGTGGTCGT CACCGCCGAG ATGACCCAGG GCGCGCAGAA GCTGCTGCTC GACGACACGC GCTTCTGGGT GGTGCGGGCT CGCGTGGCGG GCAGCGGCGT CACCGGCCTC TCCACACTGC TGTCGGGCGC CTACATCGGC ATCGACATCG GTACCTCCTC CGAGTCGCGG CGCGAGTTCG TCGGCCTCGA CGTGCCGCCC AGCGTCACCG CTGACGAGCC GGGGCGCGAG TTCATCCTGC GAAGCGCGGA TCTCGGCTCG GTCGACATCG GTTCACCGGT CTATTTCCGC CGTGTGCAGG TCGGCAAGGT CACCGCGACG CGGCTCGACG CCGATGGCCG CGGGGTCACG CTGCGGGTGT TCATTTCGGC GCCGCATGAG CAGCACGTGA CCGAGAACAC GCGCTTCTGG CATGCCAGTG GCCTGGACGT CGCGCTCGAC GCCGGCGGCA TCAAGCTGCA GACGCAGTCG CTGATCTCCA TCGTGCTCGG CGGCATTGCC TTCGGCGCGC CGGAGTCCGG TGCGCCGGGC GCGGCGGCCG AGGCAGAGCA CAGCTTCGCG CTGTTCGAGA ACCAGGCGCT GGCCATGAAG CGCACCAGCT CCGAGGTGAC GCCGATGGTG GCCCATTTCA CCGAATCGCT GCGCGGCCTG GCGCCGGGCG CGCAGATCGA CTTCCGCGGC GTGGTGGTCG GTGAGGTCAC GTCGATCGAC GTCAGCTACG ACCGCGACCG CAGGGAGTTC AGCTTCCCGG TCGGCATGGT GCTGTACAGC GACCGGCTCG GCTCCAAGGT GAGCGCGGCG ATCGAACGGG CGCAGGTCGA CCCGACAGAG CTGATGCGCC GCTCCATCGA GCGCGGCCTG CGCGCGCAGC TGCGCACCGC CAACCTGCTG ACCGGCCAGC GCTACGTGGC GCTCGACTTC TTCCCCGGCG CGCGGCCGGT CAAGACGGCG GCCGCCCCGG CCGCCGGCAG TGAAGCCCCG GTGCTCGAGA TCCCCACCAT CTCGGGTAGC CTCGAGGACC TGCAGGCCAC GCTGACCAGC ATCGCCAAGC GCATCGAGAA CGTGCCGTTC GACGAGATCG CCGCCGACCT GCGCACCGCG CTGCAGTCGC TCGACCGCAC GCTGAAGAAC ACCGACGGGC TGATCGGCCG GCTCGACGGC CAGCTCGGCG AGCTGGTGCC CGAGCTGAAG GCGGCGATCG CCGACACGCG TCGCACCATG AAGTCGGCCG ACAACCTGCT CGCCAGCGAC ACGCCGACCC AGCAGGAGCT GCGGGAAGCG CTGCGCGAGG TGTCGCGCGC CGCGCAGTCG ATGCGCGAAC TGACCGACTA CGTGCAACGC CGCCCGGAAT CCCTGCTGCG CGGCAAACCC GGCGAGCCCG AGGAAGGCGG CAAGCGATGA
|
Protein sequence | MGRMSDTPPP SSSAPASLPA TAELPEAVAV PRRRSHGYAV WLIPILAALI GLTLGVRAWL ATGPTVTITF KTAEDLEPGK TKIKYKNVDI GDVKSIALAP DGKGVVVTAE MTQGAQKLLL DDTRFWVVRA RVAGSGVTGL STLLSGAYIG IDIGTSSESR REFVGLDVPP SVTADEPGRE FILRSADLGS VDIGSPVYFR RVQVGKVTAT RLDADGRGVT LRVFISAPHE QHVTENTRFW HASGLDVALD AGGIKLQTQS LISIVLGGIA FGAPESGAPG AAAEAEHSFA LFENQALAMK RTSSEVTPMV AHFTESLRGL APGAQIDFRG VVVGEVTSID VSYDRDRREF SFPVGMVLYS DRLGSKVSAA IERAQVDPTE LMRRSIERGL RAQLRTANLL TGQRYVALDF FPGARPVKTA AAPAAGSEAP VLEIPTISGS LEDLQATLTS IAKRIENVPF DEIAADLRTA LQSLDRTLKN TDGLIGRLDG QLGELVPELK AAIADTRRTM KSADNLLASD TPTQQELREA LREVSRAAQS MRELTDYVQR RPESLLRGKP GEPEEGGKR
|
| |