Gene Mpe_A2841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2841 
Symbol 
ID4785535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3020883 
End bp3021875 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content69% 
IMG OID640091412 
Productinositol-1(or 4)-monophosphatase 
Protein accessionYP_001022030 
Protein GI124268026 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.202978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAGA CCCTGCACCC CATGCTCAAC ACCGCCGTCA AGGCTGCACG CACCGCGGGG 
GCGCTGATCA ACCGGGCCTC GCTGGACATC GAGCGCGTGA CGGTGACGGC CAAGTCGCAC
AACGACTTCG TGACCGAGGT GGACCAGGCG GCCGAGGCGG CGATCATCGA GACGCTGCTC
GGCGCCTACC CCGGCCACGG CATCCTCGCC GAGGAGACCG GGCGGACCCA CGGCGCGAAG
GACAGCGACT ACCTCTGGAT CATCGATCCG CTCGACGGCA CCACCAACTT CATTCACGGC
TTCCCGGTCT ACGCGGTCTC GATCGCACTC GCCTTCCGCG GCCAGATCCA GCAGGCGGTG
GTCTACGATC CCTCACGCAA CGACCTGTTC TACGCCTCCA AGGGACGCGG CGCCTTCCTC
AACGACAAGC GCCTGCGCGT CAGCAAGCGC AGCCGCCTGC TGGAGTCGCT GATCGGCACC
GGCTTCCCGT TCCGCAAGGG CGACAACTTC AAGCGTTACT TGAAGATGTT CGAGGAGGTC
ATGCAGCACT GCGCCGGCCT GCGCCGCCCG GGTGCCGCCG CGCTGGACCT GTGCTACGTG
GCCGCGGGCT GGTACGACGG CTTCTTCGAG ACCGGGCTGA ACCCCTGGGA CATCGCGGCC
GGCTCGCTGA TCATCACCGA GGCCGGCGGC CTGATCGGCA ATTTCACCGG CGAGTCCGAC
TTCCTCTACC AGCGCGAGAT CGTCGCGGGC AACCCGAAGA TCTATGCGCA GCTGGTGAGC
ATCCTCGCGC CCTACACCCG CATCATCAAG GACGACGACG CGGGCGCCAC GGCCCCCGCT
GCGGGCGCCG CAGCGGCGGC GCCGGACGCG ACCGCCGCCT TCGTCGCGAG CGTCGAGGCC
GATACACCGC CCGCCGCCAC CGCGAAGAAG CCGCCGGTGC GCATCCGCAA GACCGACCTC
GCCAAGGCCA AGGACAACGA CGCCCCGTTC TGA
 
Protein sequence
MSQTLHPMLN TAVKAARTAG ALINRASLDI ERVTVTAKSH NDFVTEVDQA AEAAIIETLL 
GAYPGHGILA EETGRTHGAK DSDYLWIIDP LDGTTNFIHG FPVYAVSIAL AFRGQIQQAV
VYDPSRNDLF YASKGRGAFL NDKRLRVSKR SRLLESLIGT GFPFRKGDNF KRYLKMFEEV
MQHCAGLRRP GAAALDLCYV AAGWYDGFFE TGLNPWDIAA GSLIITEAGG LIGNFTGESD
FLYQREIVAG NPKIYAQLVS ILAPYTRIIK DDDAGATAPA AGAAAAAPDA TAAFVASVEA
DTPPAATAKK PPVRIRKTDL AKAKDNDAPF