Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3526 |
Symbol | |
ID | 4786229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3736618 |
End bp | 3738510 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640092107 |
Product | hypothetical protein |
Protein accession | YP_001022714 |
Protein GI | 124268710 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.416109 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGACT CCAACTCCTC TGCCAACCCG GCCTTCCAGC AGCTCAGCGA TCCGGTGCGG CGCGTGCTGC TGCGAGGCGG GCTCGCCGCC AGCGTGGGTG GGCTGCTCGC GCCGCTGGCC GGCTGCGGCG CGCCGGCGCG GGCTCCGGGC GGGGCACTGC TGGGCTTTCG CGCAGTGCCG GCGTCGAGTG CCGATGCCGT GGTGGTGCCG CCGGGCTATG TGGCCGAGGT GATCTACGCC TGGGGCGACG CGATCGGCGC ACCGGGCCTC GCGGCCGGTC AGCCGGCTTT CCGCGGCGAC GGCAGCCACA GCGCCGAGGA CCAGCAGCTG CAGGCCGGCA TGCACCACGA CGGCATGCAC TTCTTCCCGG CGGGCCGCGG CGATCGTGGC CTGCTGGTGG TGAACCACGA GTACACCGAC GAAGGCCTGC TGCATGCCGA CGGCATGCAG ACCTGGAGCG CCGAGAAGGT GCGCAAGTCG CAGGCGGCGC TCGGCGTGTC GGTGGTCGAG CTGGCTCGTG CGGCGGACGG GCGCTGGGCG GTGCAGCGCC CGTCGGCCTA TGCGCGGCGC ATCAGCGCCC GCACGCCGAT GACGATCAGC GGGCCGGCCG CCGGCCACGC GCTGATGCGG ACGGAGTTCG ATCCCGGTGG CCGCGAGGTC ATCGGCACCT TCAACAACTG CGCGCACGGC GTCACGCCCT GGGGCACCTA CCTGGCCTGC GAGGAGAACT TCGTCGGCTA CTTCCACGGC CCGGCGCAGC CGGATGCTCA TGCGAAGCGC TGGGGCCTGA CACCGGGCGG CTACGGTCTG CGCTGGCACG AGCACGACGC GCGCTTCAAC GCCACGCTGC ATCCGAACGA GTTCAACCGT TTCGGCTGGG TGGTGGAGAT CGACCCGCTC GACCCGCGCA GCACCCCGGT CAAGCGCACC GCGCTCGGCC GCGCGGCGCA CGAGGGCGCC TTCGTCACGC AGGCGGCCGA CGGGCGTGCG GTGGTCTACA TGGGCGAGGA TGCGCGCTTC GAGTACCTCT ACAAGTTCGT GTCGCGTGAC CCGGTGCGCA CCGGCGGTTA CGCGAACAAC CGCGAAGTGC TGGACCACGG CACGCTGTAC GTGGCCCGTT TCGACTCGGA CGGTACGGGC GAATGGCTGG CGTTGGTTCA CGGCCAGAAC GGACTCGATG CGCAGGCGGG CTTTGCCGAC CAGGGCGAAC TGCTGGTGAA GTCGCGCCAG GCGAGCGACA AGGTCGGCGC GACCAAGATG GACCGCCCCG AGTGGATCGC CGGCGACCCG CGCACCGCGA CGCTGTACTG CTCGCTCACC AACAACAGCC GCCGCGGCAC CGACGGCTAT CCGGCCGTCG ACGCCGCGAA TCCGCGCGCG CGCAACACCA TGGGCGAGGT GATCCGCTGG ACGGAGGTGG GAGGCCACGC CGCGACGCGC TTCCTGTGGG ACCACTTCAT CCTGGCCGGC GATCCGGCCA ACGAGCGCGC CGAGGCGCGC GGCAACGTGA AGGGCGACCC GTTCGGCAGC CCGGACGGCC TGTGGTTCGA CGCGCGCGGC GTGCTGTGGA TCGCGACCGA TGCGTCGCCG ACGGCGCTGG GTCAGGGCGA CTACGCCCGA CTGGGCAACA ACATGCTGCT GGCCGCCGAT CCGCGCAGCG GCGAGGTGCG GCGCTTTCTC ACCGGGCCGG TCGGCTGCGA GATCACCGGC ATGGTCGGCA CGCCGGACCT GCGCACGCTG TTCGTCAACA TCCAGCATCC CGGCGAGGGC GGCAACGACC CCGGCGATCC CGCCGCTGCA AGAAAGCGCT CCACCTGGCC CGGCGGCGGC GGGCGGCCGC GCTCGGCCAC AGTGGCGATC CGGCGCATCG ACGGCGGCGC GATCGGCACC TGA
|
Protein sequence | MEDSNSSANP AFQQLSDPVR RVLLRGGLAA SVGGLLAPLA GCGAPARAPG GALLGFRAVP ASSADAVVVP PGYVAEVIYA WGDAIGAPGL AAGQPAFRGD GSHSAEDQQL QAGMHHDGMH FFPAGRGDRG LLVVNHEYTD EGLLHADGMQ TWSAEKVRKS QAALGVSVVE LARAADGRWA VQRPSAYARR ISARTPMTIS GPAAGHALMR TEFDPGGREV IGTFNNCAHG VTPWGTYLAC EENFVGYFHG PAQPDAHAKR WGLTPGGYGL RWHEHDARFN ATLHPNEFNR FGWVVEIDPL DPRSTPVKRT ALGRAAHEGA FVTQAADGRA VVYMGEDARF EYLYKFVSRD PVRTGGYANN REVLDHGTLY VARFDSDGTG EWLALVHGQN GLDAQAGFAD QGELLVKSRQ ASDKVGATKM DRPEWIAGDP RTATLYCSLT NNSRRGTDGY PAVDAANPRA RNTMGEVIRW TEVGGHAATR FLWDHFILAG DPANERAEAR GNVKGDPFGS PDGLWFDARG VLWIATDASP TALGQGDYAR LGNNMLLAAD PRSGEVRRFL TGPVGCEITG MVGTPDLRTL FVNIQHPGEG GNDPGDPAAA RKRSTWPGGG GRPRSATVAI RRIDGGAIGT
|
| |