Gene Mpe_A2334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2334 
Symbol 
ID4783851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2503888 
End bp2505546 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content71% 
IMG OID640090903 
Producthypothetical protein 
Protein accessionYP_001021525 
Protein GI124267521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGC TGACGCCCCA GGACATGGCT GCCAAGTTGT TGACCACCGG CTTCGAGCGC 
AGCGGCCCTT CGGCCGCGGC CTTGAGCGAC CCCATCGCCG ATACGCCGAT GGTGGTGACG
CTGGACCAGT TGCGGCCCTA CGACCACGAC CCGCGCGTGA CGCGCAACCC GGCCTATGCG
GAGATCAAGG CGTCCATCCG CGAACGCGGG TTGGACGCGC CCCCTGCGAT CACACGCAGG
CCGGGCGAGG CGCACTACAT CATTCGCAAC GGCGGCAACA CGCGGCTGGC GATCCTGCGC
GAGTTGTGGA GCGAGACCAA GGAGGAGCGC TTTTTTCGCA TTGCGTGCCT GTTCCGCCCG
TGGCCGGCGC GCGGCGAAAT CGTGGCGCTG ACCGGGCATC TGGCCGAGAA CGAGCTGCGC
GGCGGGCTGA CCTTCATCGA GCGGGCCTTG GGCGTCGAGA AGGCGCGCGA GTTCTACGAG
CAGGAAAGCG GCCAGGCGCT GTCGCAGAGC GAACTCGCGC GGCGGCTGAC TGCCGACGGC
TATCCGGTGC CGCAGTCACA CATCAGCCGC ATGAACGATG CGGTGCGCTA TCTGCTGCCG
GCGATCCCGA CGCTGCTGTA CGGCGGATTG GGCCGGCATC AGGTGGACCG GCTCGCGGTG
CTGCGCAAGG CGTGCGAGCG CACCTGGGAG CGGCGTGCGC TGGGCCGCAC CGTGACCGTG
GACTTCGCCA CCTTGTTCCA CGACGTGCTG ACGCAGTTCG ACACACAGCC GGACGACTTC
TCGCCGCAGC GGGTGCAGGA CGAGCTGGTG GGCCAGATGG CCGAGCTGCT GGAGGCGGAC
TACGACACGC TGGCGCTGGA GATCAACGAC AGCGAAAGCC GCCAGCGTGC GCTGACCAGC
GAACCGGCGG CGCCGACGCC ACCGGCAGCG CCTTCCGTGC CTTCTGCTCC TCCCCCGCCG
GTCTCCGCGC CTCAGCAGCC ACCCGCCTCG TCTGTGCCGC GCGACACCAC GCCAGCCGCG
CCTCCGGCGC CAGCAGCAAC ACCGCCTGCA CCGCCCGAAG CGCCGGAGGA GCAGCACGGG
CAGCGCGACG AGCGCCTGCA AGGGCACATC GTGACGCCGG CGCCGACCAC CGAGCGCCTG
CAGTCCATCC AACGGATGGT CGCGGACCAG CTCGGCGACA AGCTGCCCGA CTTCGAGGCC
GATGCGCTGC GTGCGATCCC CGTGCAGGCG GGCGGGCTCT ATCCCATCTC GGACGTCTGG
TACATCGAGC CGAGCTTGGA CGTGCCGGAT CGCCTGCGCG TGCACATCGC GCAATTCGCG
CGCGAGATCG CCGGGGACGC AGCGGTAGCC GACCACATCG AGGCCAGCGA CGGCGGCATC
GGCTTCGTCT GCGTGGCGCC GGCCGTGGGC CAGGCGAAGG CGCTGCCGGT GTTCGCGCGG
GCAGTGCTGA CCCTGCTGCA TGCGCTGAGT GCAGCGCCGC CCGCCGCGAA CGGATTGGAC
CGCGCGCGGC TGGCCGACGA GCTGGCGGCG CTGCTCCATG GCCATGGCGG CTCGGCCACA
CGCCTGAGCG ATGCTGCGCT GGTGAAGCTG TTCCGTCTGC TGCGCCTGGC GCGCCGGCTG
CTGGATCTGG AAGCCGGTGA CCCGGGCCAC GAGTCCTGA
 
Protein sequence
MAELTPQDMA AKLLTTGFER SGPSAAALSD PIADTPMVVT LDQLRPYDHD PRVTRNPAYA 
EIKASIRERG LDAPPAITRR PGEAHYIIRN GGNTRLAILR ELWSETKEER FFRIACLFRP
WPARGEIVAL TGHLAENELR GGLTFIERAL GVEKAREFYE QESGQALSQS ELARRLTADG
YPVPQSHISR MNDAVRYLLP AIPTLLYGGL GRHQVDRLAV LRKACERTWE RRALGRTVTV
DFATLFHDVL TQFDTQPDDF SPQRVQDELV GQMAELLEAD YDTLALEIND SESRQRALTS
EPAAPTPPAA PSVPSAPPPP VSAPQQPPAS SVPRDTTPAA PPAPAATPPA PPEAPEEQHG
QRDERLQGHI VTPAPTTERL QSIQRMVADQ LGDKLPDFEA DALRAIPVQA GGLYPISDVW
YIEPSLDVPD RLRVHIAQFA REIAGDAAVA DHIEASDGGI GFVCVAPAVG QAKALPVFAR
AVLTLLHALS AAPPAANGLD RARLADELAA LLHGHGGSAT RLSDAALVKL FRLLRLARRL
LDLEAGDPGH ES