Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0020 |
Symbol | |
ID | 4787623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 18416 |
End bp | 20344 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640092431 |
Product | hypothetical protein |
Protein accession | YP_001023036 |
Protein GI | 124262566 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00459602 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCACCTCA GCGACGCCGG CATCCAGACG CTCGAGCAAG CCATGGCTCA GGCTCGGGCC CAGACCAAGA CGCGCAAGAC GGGCGACACC GCGGCCCTGA AGGCCCTGCG CGACATCTTG AAGGGAGTCG TCGTCCGCCC GGACGACCTC CCCTTCGGAA CCGTTCAGCC CGAGCGCGCT CTGCCTGGCC ACCTGCTGCT GGCTGACGCT TGGGTCACCA GCGCCCATGG CGACAAGCTG TGGGCCCGCC TCAGCGGCAC CGGCCGGCAC TACAACGGCA TGTTCGGCAC GCCGGAGGAG CTGCAGGCGG GTCACGGCGC CCTCACCGAC AAGGACGTCA TCGCCGGACT GCTGCACACC GAGGAGGCCT ACAGGGAGCT GGAGCCGTAC GACATCGGCA CCCGAGTCGC CATCGCGCTT CGTATCCTCG CGGCCCCGTC CGTTGCCCGG GTTATGCCCC AGGCGGCTCG CGCGGTGCAG CGGATGCTGA GCGAGGAGCT GCTCGCGCAC CCGGTGCTGC TTCACCCGGC CATCCTCGTC CCGGGCGCCG GCGGCAAGGA GGGAAGCTAC GTGGCGCTCG GCAAGGCCAT CGCGGCGACC GGCTGCGCCG GGTTGCCGCG GCGATCCGAC TTCCAGGGCG AGACCCCGCC TGGCGGCGCG GGCGGTCCGT TTGGCACTCT AGAGGCACTC TACGATGCCA AACTCGACGT CGCCGGCTCC GTTGTTGCCT CTAGAGTGCC TCTAGAGTGC CTCTACGATG CCAAACGGCC CGAGGGTGGC CAGGGCAGCC TCTTTGGTGA CGAGAAGCGG GCCGAGCCAG TGGCGGCTCG CGACGTGCCC GACGAGTTCG AGCAGCGGCT CCTGGACTTC GCCTTGCACC CGTGCGTGAG GCTCGGGCTC AGCCAGGCTT CGGTCGAGGT GACGGACAAG ACCCTTTATG GCAACTCCAG TGGGGGTATC CGCTTCCGGC GCAGCCGCTT CGTGACCAGC ACCATGAAGC TGGCGACCTC CCAGCCCGCG TTCATCGGTT CCGTGATGGA GCTGGAGACG CTTGTCGTGC GGCATCAGGC CGAGCAGGCC GGCGGCGCGC CGGCGAAGGT GCCCTTCCCC CTTCCCTTCC CGTTCACCGT GCTCGGTGGG CTGAAGCTCG ATGCTGCAGC AATCACCTCT GGGGCCAACT ACAAGTCTCC CGACGTCGAA GACTCCGACG TCGACGAGTC GCACCTGGTG GCGCTGACCG TCCTGGCCTC GCTCTACACC TCCTTCGGGA CGCTCAACCT CTCGAGCTCC GTGGCCGGGA AGAGGCCGAT GAGCGAGCGG GTGAACCCGG CCGATCCGGC CTTCGTGCTC GAGGTCCGTC GGCGCCTCGA GGTGCTGCGG GCGCAGTCAT CGTTCGCGAT GAATCTGGGC CCGGTGCTCA AGCGCCGTGT ATTCATGGGC AGATTCGATG AACTCAGCCC CGAGGACCTT GAATTCAACG CCCGGAGGGT GCTCGATGGG TTCGTGGCCG CCGGCCTGGC GCCCGACGCC CGCGCGGCCG CGGCCTACAT GCTCGAGCAC GCGCTCGGCC GGTCGCGCTC AGACATGGCC CTTCCGAAGA CGTCGGTGGA GGCGGCAGTC GCCTTCGTCA AGGTGCTCGT CTCCACCGGT GTCCTCGGCA AGGTGATGGG CCCGGGCCAG ACGGACCTGG AGGGCTCGTT CGAGCAGCTC ATGGAGAAGG CGATGGTCGA CGTCGTCGAC TTCCCAGGCT ACGAGGTGCC GTTCCGGTCG GTGTGGCTCA AGGCGATGGA GCTGGTCTCG GTGGAGGCGC GCATGCGCGC GGTCATCGAC GCGGTGTCGG AGGTGCCGGA AGCTGCGGTT GCGACCGTGG AGTGCGTCAC GAAGCCGGTG GAGGAAGCGG CGCCGAGGAG GAGAAGGGCC GCGGTGTAG
|
Protein sequence | MHLSDAGIQT LEQAMAQARA QTKTRKTGDT AALKALRDIL KGVVVRPDDL PFGTVQPERA LPGHLLLADA WVTSAHGDKL WARLSGTGRH YNGMFGTPEE LQAGHGALTD KDVIAGLLHT EEAYRELEPY DIGTRVAIAL RILAAPSVAR VMPQAARAVQ RMLSEELLAH PVLLHPAILV PGAGGKEGSY VALGKAIAAT GCAGLPRRSD FQGETPPGGA GGPFGTLEAL YDAKLDVAGS VVASRVPLEC LYDAKRPEGG QGSLFGDEKR AEPVAARDVP DEFEQRLLDF ALHPCVRLGL SQASVEVTDK TLYGNSSGGI RFRRSRFVTS TMKLATSQPA FIGSVMELET LVVRHQAEQA GGAPAKVPFP LPFPFTVLGG LKLDAAAITS GANYKSPDVE DSDVDESHLV ALTVLASLYT SFGTLNLSSS VAGKRPMSER VNPADPAFVL EVRRRLEVLR AQSSFAMNLG PVLKRRVFMG RFDELSPEDL EFNARRVLDG FVAAGLAPDA RAAAAYMLEH ALGRSRSDMA LPKTSVEAAV AFVKVLVSTG VLGKVMGPGQ TDLEGSFEQL MEKAMVDVVD FPGYEVPFRS VWLKAMELVS VEARMRAVID AVSEVPEAAV ATVECVTKPV EEAAPRRRRA AV
|
| |