Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0103 |
Symbol | |
ID | 4787706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 94925 |
End bp | 96310 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640092512 |
Product | hypothetical protein |
Protein accession | YP_001023117 |
Protein GI | 124262647 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.887161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000190274 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCAGT TCGACATCCA CCGAGCATCC GAAGGGTTGG TGTTGGGGCT ACTGCGTGAG TTATATGGTT GGCCGAGGCT GCGCAATCTG AACACGGAAG AGCGAACCAA CTTCCCTGGG ATCGATCTCG CTGACGACGA GGCGCGCGTG GCAGTGCAGG TCACGGGCAC GCCGACGCTG GACAAGATCA AGGGAACCGT CTCCACCTTC CTGACGCACG GCCTAGACAA GCGGTACGAC CGACTGGTGA TCTATGTCCT GACTCGGAAG CAGGGCAGCT ATTCGCAGGA CGCGATCGAC AAGGTGTCTT TGGGACGCGT GAACGTCAGT GCTCGCGACG ACATACTTGA TGTGCGTGAC GTGTGCGCCA AGGCGTCGAC CGTTGATCCA AAGACTTTGG CGAACGCACT TGAGGTCCTT CGCTCCTACA TGCGAGGAGG CGTTGCTGCC GGTCTCGCTG AGGAGGACTT CGACCCTCCC GCATTCCCGG TGGAGCGCGC CATCCTCAAT CTCATTGAGG TCTACTTTCC AGCGCGCATC TACGTTGCGG ACCTGCGCGA CGATGTGGGT TCGAAGGCCG ACAGGCGTCC GCGCAATGAA CGCAAGCTGA TCAGGACTAC GCTGGAAGAG TTGAACTTGC GTGTGCCCTC CGGCTACGAG GTAAGCAGCA GGCAGTTAAT TACCTTTCAC CCACTTGATG ATTCTCAGGG GCCTTTTGCG AGACTAATAG AGCTTGGGAC AGTCACCCCG CTGGTTCCGT CAGAGTTCTA TGGGAGCAAC AACGATCAGG AGCGGATCTT CAAGTCGTTG CTACGCTTCA CGCTGCAGCA GAAGTTGCAC AGGCATCGCG TGCGCTGGTT TCACGACGAT GGGCTCTTTG CGTTTCTGCC GTTTGATGAC AAGGAGTCAC TTCGCGAGGA GACGTGGACA GGTCACAAGA AGACGTCGCG CCGAGTGTTT GAGCGAAAGC AGAACAAGAA CGACCCCAGC AAGACCTTCA TCTGCAAGCA CTTCGCATTC GCCACCGACT TCGTGCTGAA CGACGGTCGT TGGTATATCG CGCTCACCCC CGACTGGTAC TTCAGCTATG GCGACGACTA TCGGCGCTCG CGGTACGCGG ACGAGTCGTT GAAGTGGTTG AAGCGGAAGG AAGTGAATCG AACAGTCACC GACCACTTCC GGTTCTTGAC GTCCTGGCTA GCAGCTCTCG ATCAGGACGA TTTGTTCGCT CTGGCTGCAG GTGGCGCGCC GACGCTAACT TTCGGTGAGG TACTGGCGTT CGACAATCAC CCATCTCTTG ATGATGAGGC TTGGCTGCCG CTGCGCGACG CCACAGGCGA CGACGACGAG GCCGCGACGA TCAAGGGCCT ATTCGACTCA GAATGA
|
Protein sequence | MSQFDIHRAS EGLVLGLLRE LYGWPRLRNL NTEERTNFPG IDLADDEARV AVQVTGTPTL DKIKGTVSTF LTHGLDKRYD RLVIYVLTRK QGSYSQDAID KVSLGRVNVS ARDDILDVRD VCAKASTVDP KTLANALEVL RSYMRGGVAA GLAEEDFDPP AFPVERAILN LIEVYFPARI YVADLRDDVG SKADRRPRNE RKLIRTTLEE LNLRVPSGYE VSSRQLITFH PLDDSQGPFA RLIELGTVTP LVPSEFYGSN NDQERIFKSL LRFTLQQKLH RHRVRWFHDD GLFAFLPFDD KESLREETWT GHKKTSRRVF ERKQNKNDPS KTFICKHFAF ATDFVLNDGR WYIALTPDWY FSYGDDYRRS RYADESLKWL KRKEVNRTVT DHFRFLTSWL AALDQDDLFA LAAGGAPTLT FGEVLAFDNH PSLDDEAWLP LRDATGDDDE AATIKGLFDS E
|
| |