Gene Mpe_A3370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3370 
Symbol 
ID4786357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3581277 
End bp3582365 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content71% 
IMG OID640091946 
Producthypothetical protein 
Protein accessionYP_001022558 
Protein GI124268554 
COG category[R] General function prediction only 
COG ID[COG0491] Zn-dependent hydrolases, including glyoxylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.499029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTC TCGAACACCA ATTGGGCTAC CCGTTCGGCG ATGCCCTGCC CGCGCTGGGC 
GACGCCCTCG ACGTCGCCCC GGGCGTGAAG TGGGTGCGGA TGCGGCTGCC GTTCGCGCTG
GACCACATCA ACCTCTGGCT GCTGCGCGAC CGGCTCGAGA GTCCGGACGG CCCGGTGGAA
GGCTGGGCCA TCGTCGACTG CGGCATCGGC GACGACACCA CCCGCGCCGC ATGGGAGCAG
GTGTTCGCCG ACCACCTGGA CGGCCTGCCG GTGCTGCGCG TGATCGTGAC CCACATGCAC
CCGGACCACA TCGGTCTGGC CGACTGGCTG ACCACGCGCT GGAGCGTGAT CGGGCGCGAC
TGCCCGCTGT GGATCAGTGC GACCGACTGG AACGCGGCTC GGATCGCCTG CCGCAGCACC
ACCGGCGTCG GCGGCGACGA AGCGGCAGCC CACTTCGCGC GCCATGGCGT GACCGACGCC
GACGCGCTGC AGAAGATCCG CCGGCGCTCG AACTACTACG CCTCGATGGT GCCGTCGGTG
CCGCAGCGCT ACCACCGCCT GCTCGACGGC GAGACGCTGC GCGTCGGGGA CCATGGCTGG
CGTTGCCACG TCGGCTACGG CCATGCGCCG GAGCACATCG CGCTGCATTG CGAGGCGCTC
GGCGTGCTGA TCTCGGGTGA CATGGTGCTG CCGCGCATCA GCACCAACGT CAGCGTGCTC
GACTTCGAAC CCGAGGCCGA CCCGCTGCGG CTCTACCTCG AGTCGATCGA GCGGATGCGC
GCGCTGCCGC CCGACACGCT GGTGCTGCCG GCCCATGGCC GCCCCTTCAC CGGCCTTCAC
ACCCGCGTCG ACCAGTTGCG CGACCACCAT GCCGAACGCT TCGCCGACGT GCTGGCCGCC
TGCGCCGCCG CCCCGAAGAC GGCGACCGAG CTGCTGCCGG TGCTGTTCAA GCGCCCGCTC
GACCTGCACC AGACCACCTT CGCGCTGGGC GAGGCGATCG CCCACCTGCA CGCGCTGTGG
TTCGACGGGC GGCTGGTGCG TCAGGTGGAC CGTGCGAGTG GCGTCTACCG CTTCGCCGTC
GCCGGCTGA
 
Protein sequence
MNALEHQLGY PFGDALPALG DALDVAPGVK WVRMRLPFAL DHINLWLLRD RLESPDGPVE 
GWAIVDCGIG DDTTRAAWEQ VFADHLDGLP VLRVIVTHMH PDHIGLADWL TTRWSVIGRD
CPLWISATDW NAARIACRST TGVGGDEAAA HFARHGVTDA DALQKIRRRS NYYASMVPSV
PQRYHRLLDG ETLRVGDHGW RCHVGYGHAP EHIALHCEAL GVLISGDMVL PRISTNVSVL
DFEPEADPLR LYLESIERMR ALPPDTLVLP AHGRPFTGLH TRVDQLRDHH AERFADVLAA
CAAAPKTATE LLPVLFKRPL DLHQTTFALG EAIAHLHALW FDGRLVRQVD RASGVYRFAV
AG