Gene Mpe_B0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0020 
Symbol 
ID4787623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp18416 
End bp20344 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content69% 
IMG OID640092431 
Producthypothetical protein 
Protein accessionYP_001023036 
Protein GI124262566 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00459602 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACCTCA GCGACGCCGG CATCCAGACG CTCGAGCAAG CCATGGCTCA GGCTCGGGCC 
CAGACCAAGA CGCGCAAGAC GGGCGACACC GCGGCCCTGA AGGCCCTGCG CGACATCTTG
AAGGGAGTCG TCGTCCGCCC GGACGACCTC CCCTTCGGAA CCGTTCAGCC CGAGCGCGCT
CTGCCTGGCC ACCTGCTGCT GGCTGACGCT TGGGTCACCA GCGCCCATGG CGACAAGCTG
TGGGCCCGCC TCAGCGGCAC CGGCCGGCAC TACAACGGCA TGTTCGGCAC GCCGGAGGAG
CTGCAGGCGG GTCACGGCGC CCTCACCGAC AAGGACGTCA TCGCCGGACT GCTGCACACC
GAGGAGGCCT ACAGGGAGCT GGAGCCGTAC GACATCGGCA CCCGAGTCGC CATCGCGCTT
CGTATCCTCG CGGCCCCGTC CGTTGCCCGG GTTATGCCCC AGGCGGCTCG CGCGGTGCAG
CGGATGCTGA GCGAGGAGCT GCTCGCGCAC CCGGTGCTGC TTCACCCGGC CATCCTCGTC
CCGGGCGCCG GCGGCAAGGA GGGAAGCTAC GTGGCGCTCG GCAAGGCCAT CGCGGCGACC
GGCTGCGCCG GGTTGCCGCG GCGATCCGAC TTCCAGGGCG AGACCCCGCC TGGCGGCGCG
GGCGGTCCGT TTGGCACTCT AGAGGCACTC TACGATGCCA AACTCGACGT CGCCGGCTCC
GTTGTTGCCT CTAGAGTGCC TCTAGAGTGC CTCTACGATG CCAAACGGCC CGAGGGTGGC
CAGGGCAGCC TCTTTGGTGA CGAGAAGCGG GCCGAGCCAG TGGCGGCTCG CGACGTGCCC
GACGAGTTCG AGCAGCGGCT CCTGGACTTC GCCTTGCACC CGTGCGTGAG GCTCGGGCTC
AGCCAGGCTT CGGTCGAGGT GACGGACAAG ACCCTTTATG GCAACTCCAG TGGGGGTATC
CGCTTCCGGC GCAGCCGCTT CGTGACCAGC ACCATGAAGC TGGCGACCTC CCAGCCCGCG
TTCATCGGTT CCGTGATGGA GCTGGAGACG CTTGTCGTGC GGCATCAGGC CGAGCAGGCC
GGCGGCGCGC CGGCGAAGGT GCCCTTCCCC CTTCCCTTCC CGTTCACCGT GCTCGGTGGG
CTGAAGCTCG ATGCTGCAGC AATCACCTCT GGGGCCAACT ACAAGTCTCC CGACGTCGAA
GACTCCGACG TCGACGAGTC GCACCTGGTG GCGCTGACCG TCCTGGCCTC GCTCTACACC
TCCTTCGGGA CGCTCAACCT CTCGAGCTCC GTGGCCGGGA AGAGGCCGAT GAGCGAGCGG
GTGAACCCGG CCGATCCGGC CTTCGTGCTC GAGGTCCGTC GGCGCCTCGA GGTGCTGCGG
GCGCAGTCAT CGTTCGCGAT GAATCTGGGC CCGGTGCTCA AGCGCCGTGT ATTCATGGGC
AGATTCGATG AACTCAGCCC CGAGGACCTT GAATTCAACG CCCGGAGGGT GCTCGATGGG
TTCGTGGCCG CCGGCCTGGC GCCCGACGCC CGCGCGGCCG CGGCCTACAT GCTCGAGCAC
GCGCTCGGCC GGTCGCGCTC AGACATGGCC CTTCCGAAGA CGTCGGTGGA GGCGGCAGTC
GCCTTCGTCA AGGTGCTCGT CTCCACCGGT GTCCTCGGCA AGGTGATGGG CCCGGGCCAG
ACGGACCTGG AGGGCTCGTT CGAGCAGCTC ATGGAGAAGG CGATGGTCGA CGTCGTCGAC
TTCCCAGGCT ACGAGGTGCC GTTCCGGTCG GTGTGGCTCA AGGCGATGGA GCTGGTCTCG
GTGGAGGCGC GCATGCGCGC GGTCATCGAC GCGGTGTCGG AGGTGCCGGA AGCTGCGGTT
GCGACCGTGG AGTGCGTCAC GAAGCCGGTG GAGGAAGCGG CGCCGAGGAG GAGAAGGGCC
GCGGTGTAG
 
Protein sequence
MHLSDAGIQT LEQAMAQARA QTKTRKTGDT AALKALRDIL KGVVVRPDDL PFGTVQPERA 
LPGHLLLADA WVTSAHGDKL WARLSGTGRH YNGMFGTPEE LQAGHGALTD KDVIAGLLHT
EEAYRELEPY DIGTRVAIAL RILAAPSVAR VMPQAARAVQ RMLSEELLAH PVLLHPAILV
PGAGGKEGSY VALGKAIAAT GCAGLPRRSD FQGETPPGGA GGPFGTLEAL YDAKLDVAGS
VVASRVPLEC LYDAKRPEGG QGSLFGDEKR AEPVAARDVP DEFEQRLLDF ALHPCVRLGL
SQASVEVTDK TLYGNSSGGI RFRRSRFVTS TMKLATSQPA FIGSVMELET LVVRHQAEQA
GGAPAKVPFP LPFPFTVLGG LKLDAAAITS GANYKSPDVE DSDVDESHLV ALTVLASLYT
SFGTLNLSSS VAGKRPMSER VNPADPAFVL EVRRRLEVLR AQSSFAMNLG PVLKRRVFMG
RFDELSPEDL EFNARRVLDG FVAAGLAPDA RAAAAYMLEH ALGRSRSDMA LPKTSVEAAV
AFVKVLVSTG VLGKVMGPGQ TDLEGSFEQL MEKAMVDVVD FPGYEVPFRS VWLKAMELVS
VEARMRAVID AVSEVPEAAV ATVECVTKPV EEAAPRRRRA AV