Gene Mpe_B0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0201 
Symbol 
ID4787802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp182162 
End bp183940 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content59% 
IMG OID640092607 
Producthypothetical protein 
Protein accessionYP_001023212 
Protein GI124262742 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCAGTG CGCGTGCCAG ACGCGCCGAG TCGACACCTG AACACACCAG CGTGGCGGAC 
TCCGTTTGGC TGGCGCTTTC CGCAGCGGCG GAGAACGCCG AGTGGCTCGC GGCTGCGTCG
ATACCTGTCG GTCGAGTAGG ACTCACAGAC ATTGCTCGAG CTCTGGCCGC GCTCGAGGAC
TCTATTGCCG AGACCAGCTG GCCAGAAGCT CAGAAGTTCA GTCGTTCAAA TGAGACTAGG
CGCTGGCTGC TAGAGCACAA GCTCTTTCCG CCAAACATCG ACCCAGAACG ACTGAGCCGC
CACTTCGTCA GCCGGTTCGA TCGCGTCGCG CCTCAGAATC GAAAGTTGTT CAGCGAACTA
CCGGACGGCG CGAACACTGT TCATCGCTAC CCTGTGGGTG CACTGCCCCA CGAGTCATTA
GCTGACCTCA AGGCCCAAAT ACGGCAGACG CTCGACACCG ATCTCGCCAG AGTCGTCGAG
GGAGCAGTCA AGGATCTTGA CGCATTCGCC ACTCTGCAGT TGACGATCGC AGACCTTGCG
CGGAGCGAGT GCCACGGAAG CGAGCTCGAG CAACTCAAGT TGTTCATTGA GTCCAACGCG
CACCTGCGGC ATAAGTTCGT GCCTGAGATA GTGCGCGTGG CTAGCCCCCA AACGGTCTTG
ACCGCATACG CCCAGGTCAT CGAAAAGTGG CGGCGAGAGG CACGTGTACC AATACTTCCA
ACCGTGTATG GCGGCGAAGC GATGTGCGCC CTGGCCCGTG ACTACGGCGT GAAGATCGAT
CGCAACAAAG CATATCGACT CCTGACCCCA ACGGTCCTGA CTCAGACGGA AATGCTTGCC
TGCGCGCTCA TTCTTCAATG CGCGTCACGA TGGAACTTCA CGACTGTCGT AGCCCTCACC
ACGAAGGGAA TAGTTCCTAA CGGCAATGGG TTCATCGTGA CTTCACTGAA GGGACGAACG
AATCAAACTG CTCCCGATCT GGTCGTATCA CCTCGAGATC ACGAAGTCCT GCGGGCCTTG
CGCACACTCA AAGAGAATCT TGGCGAAGTC AAAGCGCTCG GTTGGGTCGC GAGCGGTGAG
GACCGCCTCT TCTTCAACAC GCACGTAGCT AGGCGCGGCG TAGTCCGTCC CTATGCCAAC
TGGCACTACG TCCTGTCGGG ATTCATCTCT CGACATGACT TGCCTCAGTT CTCACTGGAC
CAGGTTCGAG TTCAAGCGAT CAATGCCTTT GATCTCGAGA GCGCGAGTAT CGAGGCGACG
CAACGGAAGG CCGGACATGC TACGTCAACC ACAACGGCGC GTTACCTGGA CCAACCCATC
CTTCGGGCCA TTAACTCGTC AATAAATTTG GCCTACCAGC GCGAGCTAGA ACGCTCTGTG
CAGTTCGCCA TCGAAGGTCA ACCATGCCCG ACGGGCAGGC TTTTCTCACC GGTGGGCGAT
GGAAGTTCGT GCGCTGACCC TGCAACACCA CCGAGGCTCG ATATGCTCGT CGACGGGCTT
TGCGAAGCAC ACGAATGCCA CCTTGGCGCC GGGTGCCCCA ACAGAAGAAT CGTCATCGAC
ACCGATGCAC TCAGGGACCT CACGTGCACG CACCGGTTCT ACAGTCGTCA CTGGAAGGCG
CTCCTCGATG AGAACGCCGA AGCATTCGAG AAGCACCACC TTCCTACGAT GCTGTTCACA
TTCGGCCTTC GAGAAATTGT CGCGCAGGGA CCTTATCGAA GGTACCTGGC ACTGGCCGAA
GGGCCTGTCG ATCCACCAGC ATTCCCGCCA CTGAGCTAG
 
Protein sequence
MLSARARRAE STPEHTSVAD SVWLALSAAA ENAEWLAAAS IPVGRVGLTD IARALAALED 
SIAETSWPEA QKFSRSNETR RWLLEHKLFP PNIDPERLSR HFVSRFDRVA PQNRKLFSEL
PDGANTVHRY PVGALPHESL ADLKAQIRQT LDTDLARVVE GAVKDLDAFA TLQLTIADLA
RSECHGSELE QLKLFIESNA HLRHKFVPEI VRVASPQTVL TAYAQVIEKW RREARVPILP
TVYGGEAMCA LARDYGVKID RNKAYRLLTP TVLTQTEMLA CALILQCASR WNFTTVVALT
TKGIVPNGNG FIVTSLKGRT NQTAPDLVVS PRDHEVLRAL RTLKENLGEV KALGWVASGE
DRLFFNTHVA RRGVVRPYAN WHYVLSGFIS RHDLPQFSLD QVRVQAINAF DLESASIEAT
QRKAGHATST TTARYLDQPI LRAINSSINL AYQRELERSV QFAIEGQPCP TGRLFSPVGD
GSSCADPATP PRLDMLVDGL CEAHECHLGA GCPNRRIVID TDALRDLTCT HRFYSRHWKA
LLDENAEAFE KHHLPTMLFT FGLREIVAQG PYRRYLALAE GPVDPPAFPP LS