Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0049 |
Symbol | |
ID | 4787652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 40374 |
End bp | 41579 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640092458 |
Product | hypothetical protein |
Protein accession | YP_001023063 |
Protein GI | 124262593 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.722506 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00143766 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGCTGC GGTTCCGGAA GACCTTCAAG CTCGCCCCCG GCGTCCGGTG GACCATTTCC GGCTCCGGCA GCAGCTGGAG CTTCGGTCCC CGAGGCGCCT CCATCAGCGT GGGCAAGCGC GGCGTCTACG CCAACTACAG CATCCCCGGC ACCGGCCTGT CATACCGCGA GCGGCTCGGT GGCCGCAGCG AGACGCCGTC CGCCCAGCTC GGACCCCCGC AGACGACCAA GGTCAGCCTG ACCTGCGGCA TCGACGATGA AGGCGTCCTG AGCTTCACCG ATGGCGCTGG CATGCCACTC AGCGAGGCGG TGGTCGAGGC CGCCAAGAAG CAGAACCGGG ACGCCATCCT GGCGCTCATC CAGCGCAAGT GCGACGAGCT GAACGACCAG GTCGAGGCGC TCGGCCGGCT TCATCACGAC ACCCCGGACT CCAGGGTGAA GCCCAAGTTC GAACCGCAGC GCATCAACCT GGTAGCGCCC GAGCGGCCCA CTCTCCGCGT CCCGACATTT CTTGAAGGAC TCAGGAAGTC CGTCCGCCTG GCCATCCAGG AGGAAAACGA CCGGGCGCTA GCCCGTTTCG AAGGGGACAC GGAAGAGTTC GAGCGTCAGC GGCGTGCGTT CTACGCGGCC GAGACCAAGC GTCGAGTTCT GGTCGAGCAG CTGATCTACC AGGACGTCCA GGCCATGGAG GACTTCCTCG AGGCAAACCT CCAGGACATC GTCTGGCCGC GGGAGACCCA GGTTGCGGTG GACATCGGGG ATGGAGGTCT CACGGTCCAG CTGGATGTCG ACCTGCCTGA AATCGAGAAC ATGCCGACCA AGTCGGCTGC CGTGCCGGCG CGCGGACTGA AGCTCTCCGT GAAGGAGCTG CCGGCTGCCA AGGTCCGCCG GCTCTACGCG GACCACGTGC ACGGCATCGT GTTTCGCCTG GTCGGCGAGA CCTTCGCCGC CCTGCCCGTC GCCCGCACGG TCGTCGTCTC AGGCTACTCC CAGCGCAGCA ACAGTGCCAC CGGCCACCTC GAGGACCAGT ACCTGCTCTC AGTCAAGGTC GCCCGAGAAG CCTGGGAACA GCTTGCCTTC GACCGGCTTG CCGAGCTCAA CGTGGTGGAC TCGCTTGCCC GCCACGAGCT GCGCCGCGAT CTGACCCGCA TCGGAGAACT GCGCCCCATC AGACCATTCC AGGAGGAGGA GACATGCGAG GTTTGA
|
Protein sequence | MALRFRKTFK LAPGVRWTIS GSGSSWSFGP RGASISVGKR GVYANYSIPG TGLSYRERLG GRSETPSAQL GPPQTTKVSL TCGIDDEGVL SFTDGAGMPL SEAVVEAAKK QNRDAILALI QRKCDELNDQ VEALGRLHHD TPDSRVKPKF EPQRINLVAP ERPTLRVPTF LEGLRKSVRL AIQEENDRAL ARFEGDTEEF ERQRRAFYAA ETKRRVLVEQ LIYQDVQAME DFLEANLQDI VWPRETQVAV DIGDGGLTVQ LDVDLPEIEN MPTKSAAVPA RGLKLSVKEL PAAKVRRLYA DHVHGIVFRL VGETFAALPV ARTVVVSGYS QRSNSATGHL EDQYLLSVKV AREAWEQLAF DRLAELNVVD SLARHELRRD LTRIGELRPI RPFQEEETCE V
|
| |