Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0057 |
Symbol | |
ID | 4787660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | - |
Start bp | 46539 |
End bp | 49397 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640092466 |
Product | hypothetical protein |
Protein accession | YP_001023071 |
Protein GI | 124262601 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.889998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00629192 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTTCCG ACGCCGTCAA CTTCAAGCTC GAGCGCAAGC AGTCTTCGCT CATCTACGAC ACCCGCAAGC TGGGTGACCG GATCAATGAC GCGCTCGCCG ACGAGACCGT GCCGGTCGTC GTCATGGCAA GTCTGGGAGC CGCCTCCTTC TTCGTGCCGG CCCTGGTAGA GGTCATGGTC CTCGTCGCGC TGGGGTTTTA CTGGATGGCG CGCCGGGCGC AGAAGCGAGC AGGCCTGGCG CTCCGGATGC CCAAGAGCAG CGGCGAGATT GACCCGAAGG AAATCAGCCT CAAGGACGGG AAGCCGAGCC GCGCCGAGGG CATCGTCTAC ATGGGCAATG ACCTCGACAC CGGCGAGGAG GTCTGGCTAA CGGACGTGCA GGCTCGCACG CACATGCTCT TCATGGGCAC TACTGGCTCC GGCAAGACGG AGTTCTTGGT CTCGCTCGTC TACAACTCGC TCATCCACGG CTCGGGACTG ATCTACGTCG ATGGCAAGGC CGACTCCTCG CTGTACGGCA AGGTGTACTC GATGGCCCGC GCGATGGGGC GCGAGGACGA CGTGCTGGTC ATCAACTTCC AGACTGGCGC CAAGGACATC TACGGCGCCC AGCCGAACAA GCTCTCCAAC ACGCTGAACC CCTTCGCCGT CGGCTCCTCG GGCATGCTGA GCGAGCTCGT GAAGGGCCTG ATGGCTACCG GCGAGAAGTC AACCTGGACT CAGCAGGCTG AGTCCTTCGT CGAAGCCCTG ATGAAGCCGC TCGTCTACCT GCGGGACAAG CACGGCCTCA ACCTCGACGT GAACGTGGTC CGTGAATACT TCGAGCTCAA CAAGCTCGAG GACCTCGCTT GGCGCGACGG TGACAAGTAC CCTGGACTCA GCGAGTCCGG TGTGCTGGAC GGCCTGCACA ACTACCTGTT GACGAAGCCG GCGTACAAGA AGGAGAAGTA CCACGACCAG AGCGAGACGA CGAACGAGCA GCACGGCTAC ATCACCATGC AGCTCATCCG CACGTTCAAC TCGCTGTCGG ACACCTACGG CTACATCATG AAGACGCAGC TGGCGGAGAT TGACTTCGTC GACGTCTTCC TGAACCGCCG CATCCTTGTC GTGCTGCTGC CGGCGCTCGA GAAGTCGCCG CCCGAACTGA CCAACCTCGG CCGCATCGTC GTAGCCTCCA TCAAGGCGAC GATGGCCAAG GGCCTCGGCT CGGCACTCGA GGGAGACTGG AAGAAGATCA TCGACGCCAA GCCCACCAAC GCGCCGAGCC CGTTCATGTG CGTGCTCGAC GAGTACGGCT ACTACGCCGT CGAGGGCTTC GCTGTGGTGC CGGCGCAGGC GCGCTCATTG GGCTTCTCCG CAATCTTCGC CGGCCAGGAC CTTCCGGCCT TCGAGAAGGC GTCGAAGGAA GAGGCCGCCT CCACGCTTGC GAACACGAAC ACGCGACTGT GCGGCAAGCT CGAGTGCACG AAGACCTACG ACTTCTTCAA GAACATCGCG GGGCAGGGCA TCTACACGAA GACCTCCCGC TACGAACACG ATTCCGGGGC CATGGGCCCG ACGGCGTTCA AGGCCGACGA CGGCATCAGC ATCGACCGCG TGGAGCGTGT CAGCTTCGAG TCGCTCCGCG GCCAGCTTTC CGGGTACTGG CACCTTTTCT TCGCGAACAC CATCGTGCGC GTGAAGTCCT TCTTCGCCAA CCCGGTGCCG GTGGGAAAGC TGCGCGTCAA CCACTTCATC AAGGTCGCTC GCCCTGACCC GGAGGAGGTC GATGCCTACA AGTCCACGAC GCACCTGTTC ACGCGCGCCG TGACGGCCGA GGGCGGAATC GCCTCCTACA TGGACCGCGT CGCTGACCAG CAGGACATCC GCGACATCGG CGAAGGGTTC GAGAAGTTCA AGCTGCAGGA CTCGGCGCTG CACAACGCTG CTCGCGTGCT GGCCTATTGC TCGCAAGCAG AGGCACACCG CGCGAAGGCC TTTGCCTCGG CCATCGACAT CGGACTCTTT GCCACCGCGG GCGGCGACGA CGATGCTACG CCTGACTTCG TGAGCCAGAT GAGTGGCACG GTCGCGCACG CACCGCAGGT CGATGACGAC ATGCTGGCCT TCATCGATCC GGCAGGAGAC GGCGAAGACC AATCGCAGGA GACCGCTCCT GAGGTCACGT CCTACGGCAG CGAGCCCGAG GTGCCCGAGC GGCTCTTCGA CGATACGCCT GCCGAGGTGC GCCCGGCCGA GCAGGAGCAG CGCGGCTTCA CGGAGGATGC AGACTTCGGT CGTCATCCCG TGGCGCTTGT CGACCCTGAC CTGGGAGCCG ACGACCTGTA CGACGACGAT GTGCCTGCCG ACGTGGTCGC GAGCGCTGAG CCCGAAGAGG ACTGGGGTAG CTACGGCGTA GAGCAGCCGG CCGCGCGCGC CGCACTGATT CCGGAAGGGG TCGGAGAGCT CGTCACCCAT GACCACACGA CCCTCTTCGG GGCCGAGGGC ATTGACGTTC CGCCGGCGGT CGCGATGGCC GACAACGATG TGGAGGCCGA AGCCCAGGAC CGGCCCGTGG ACGAGGTCGG ACAAGAGGGC TTCGACGCCT TCGATGCGCC GCTTGCGGAC GAAGGATTGC TGAACCGCGA GGCCACCATC AGCGGCTTGG CGCAGATCGA GCGCATGACG GGCGCCCCTG AGACGGAGTC GCAGATGACT GCTGCTGCGG TGGCCGAGAC GCTGGCGGGC GCGACGACCT ACCCGCTGAC GCCGCCGAAG GAAAAGCCGG AGCTCGCGCA CTTCGAACTG CTGGCCCAGA AGCTGTCGCG GCACCTGAGC GCCGCGGACG AGAGCGACGA CTCATGGTCG GAACTGTGA
|
Protein sequence | MTSDAVNFKL ERKQSSLIYD TRKLGDRIND ALADETVPVV VMASLGAASF FVPALVEVMV LVALGFYWMA RRAQKRAGLA LRMPKSSGEI DPKEISLKDG KPSRAEGIVY MGNDLDTGEE VWLTDVQART HMLFMGTTGS GKTEFLVSLV YNSLIHGSGL IYVDGKADSS LYGKVYSMAR AMGREDDVLV INFQTGAKDI YGAQPNKLSN TLNPFAVGSS GMLSELVKGL MATGEKSTWT QQAESFVEAL MKPLVYLRDK HGLNLDVNVV REYFELNKLE DLAWRDGDKY PGLSESGVLD GLHNYLLTKP AYKKEKYHDQ SETTNEQHGY ITMQLIRTFN SLSDTYGYIM KTQLAEIDFV DVFLNRRILV VLLPALEKSP PELTNLGRIV VASIKATMAK GLGSALEGDW KKIIDAKPTN APSPFMCVLD EYGYYAVEGF AVVPAQARSL GFSAIFAGQD LPAFEKASKE EAASTLANTN TRLCGKLECT KTYDFFKNIA GQGIYTKTSR YEHDSGAMGP TAFKADDGIS IDRVERVSFE SLRGQLSGYW HLFFANTIVR VKSFFANPVP VGKLRVNHFI KVARPDPEEV DAYKSTTHLF TRAVTAEGGI ASYMDRVADQ QDIRDIGEGF EKFKLQDSAL HNAARVLAYC SQAEAHRAKA FASAIDIGLF ATAGGDDDAT PDFVSQMSGT VAHAPQVDDD MLAFIDPAGD GEDQSQETAP EVTSYGSEPE VPERLFDDTP AEVRPAEQEQ RGFTEDADFG RHPVALVDPD LGADDLYDDD VPADVVASAE PEEDWGSYGV EQPAARAALI PEGVGELVTH DHTTLFGAEG IDVPPAVAMA DNDVEAEAQD RPVDEVGQEG FDAFDAPLAD EGLLNREATI SGLAQIERMT GAPETESQMT AAAVAETLAG ATTYPLTPPK EKPELAHFEL LAQKLSRHLS AADESDDSWS EL
|
| |