Gene Mpe_A1144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1144 
Symbol 
ID4785719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1222251 
End bp1224512 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content69% 
IMG OID640089707 
Productphosphatase 
Protein accessionYP_001020340 
Protein GI124266336 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.136121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGC GAGTCGATCT CCCTCTGCTC GGCATCGATC GAGATGACGA GAACAGCAAT 
ACCAGCGGCA ACGACAACTT CATGGCCGTG CTCGACGCCC GGCTGAGCCG GCGCTCCGTG
CTGCGCGGCG GCGTGGGGAC CGCGGCGGCC GTGGTGCTGG GCGGGTTGAG CGTCAGTGCC
TGCGGTGGCG GGGACGACGA CGCGCCGCCC GCGAACCCGC CGGTCGACAA GCTGGGCTTC
ACCGCCGTGG CCAAGAGCAC CGCCGACGCC GTGAGCGTGC CGGCCGGCTA CACCGCCGCC
GTGATCTACG CACTCGGCGA TCCGCTGACG GCGGGCACGC CAGCCTATGC CAACGACGGC
AGCGACACCG ATTTCGACCA GCGCGCCGGC GACCACCACG ACGGCATGGA GTACTTCGGG
CTCAACGCTG CCGGCACCGC GCGCGACGCG AACGGGTCCG AGCGCGGCCT GCTGGCGATG
AACCACGAGG CACTGAGCGA CAACTACCTG CATGTGAACG GCAGCTCGGC GCGTCCGCGT
CCGGCATCGG AATCCGACAA GGAGATCCCG GCCCACGGCG TGTCGGTGGT CGAGGTCCGC
AAGACCGGTG GCAGCTGGGC CTACGTCCAG GACTCGGCCT ACAACCGCCG CGTCACGCCG
CTCACGCCGG TGGAGCTGTC GGGCGCGGTG CGCGGCAACG CGCTCGCGAA GACGCTGTAC
TCGACCGCCG GCACCGGCGC GCGCGGCACC ATCAACAACT GCGGTACCGG CTACACCCCC
TGGGGCACGC TGCTGACCGG CGAAGAGAAC TTCGAGGGCT ACTTCACCCG CTCGGCCGGC
GACGACGCGG CCCGAGGCGA CAAGAGTGTC ACGGCGCTCA ACCGCTACAA GCGCAAGCAG
GGCGCGGCCA GCCGTCACGG CTGGGAAACC AGCGGTGCGG ACGACAAGTA CCAGCGCTGG
GACATCAGCA AGATCGGCGC CTCGGCCGAC GGCAGCGACG ACTACCGCAA CGAGCTCAAC
ACCTTCGGCT ACATCGTCGA GATCGATCCG TACGACAAGG CGGCGACCAT CAAGAAGCGC
ACCACCCTGG GCCGCTTCGC GCACGAGAGC GCGGCCTTCG GCAAGGCGGT CGCCGGCAAG
CCGCTGGCCG TCTACATGGG CGACGACTCG CGTGGCGAGT ACATCTACAA GTTCGTGTCG
GCCACGAACT GGGATGCCGC CGACGCAGAG CCCTCGAACC GCATCGCCGC CGGCGACAAG
TACCTCGACA GCGGCAAGCT CTATGTGGCG AAGTTCAACG ACGACGGCAG CGGCGACTGG
ATCGAGCTGA GCTTCGACAA TCTCGACGTG AAGAACTACG CCGGCTACGC TTTCGCCGAC
GCAGGCGACG TGGCGATCCA CTCGCGCCTG GCCGGCGACG CGGTCGGCGC GACCGCGATG
GACCGCCCCG AGTGGTGCGC GGTGCACCCG ACGACCGGCG AGATCTACTA CACGCTCACC
AACAACAGCG TGCGCAAGCT CGAACCGGCG GCACCGGCGG TCGGGGCGTC GCCCGACTCG
ATCCAGCGCG CGCTCGACGC GGCCAACCCG CGCAGCTACA AGGACTCCTA CGGCGGCGCG
CCGGAAGGCT CGGCCGGCAA CATCAACGGC CACATCATCC GCATGAAGGA AGACGGCGGC
GAGCCCGCGG CGACCGGCTT CACCTGGGAT GTCTACCTGT TCGGTGCGCA GTCCGACGCC
GACACGTCCA AGATCAACCT GTCCTCGCTC ACCGCGGACC AGGACTTCTC CAGCCCCGAC
GGCCTGTGGT TCAGCCGCTC GACCGGGCTG TGCTGGATCC AGACCGACGA CGGCGCCTAC
ACCGACGTGA GCAACTGCAT GATGCTGCTC GGCGTGCCGG GCACGGTAGG CGACGGCGTG
AAGACCACGC TCAGCTACAC CCTGGCCGAC ACGTCCACGC TGAGCATCGA CACCTATGTC
GGCAAGAAGC CGACCGCCGA CACGCTCAAG CGCTTTCTTG TCGGACCGAA GGACTGCGAG
CTGACCGGCT GCACCGAGAC GCCCGATGGC AAGACCGTGT TCGCCAACAT CCAGCACCCG
GGCGAGACGA TCTCCGCCGC GACCATCGGC AACCCGGCGG GCTATGTGAG CCACTGGCCC
GGCAACGCCG GCTACCAGGC GACCGTGGGC AACACCACCT CACGGCCGCG CTCGGCCACG
CTCGCGATCA CCAAGGACGA CGGAGGCCGC GTCGGGGCCT GA
 
Protein sequence
MNQRVDLPLL GIDRDDENSN TSGNDNFMAV LDARLSRRSV LRGGVGTAAA VVLGGLSVSA 
CGGGDDDAPP ANPPVDKLGF TAVAKSTADA VSVPAGYTAA VIYALGDPLT AGTPAYANDG
SDTDFDQRAG DHHDGMEYFG LNAAGTARDA NGSERGLLAM NHEALSDNYL HVNGSSARPR
PASESDKEIP AHGVSVVEVR KTGGSWAYVQ DSAYNRRVTP LTPVELSGAV RGNALAKTLY
STAGTGARGT INNCGTGYTP WGTLLTGEEN FEGYFTRSAG DDAARGDKSV TALNRYKRKQ
GAASRHGWET SGADDKYQRW DISKIGASAD GSDDYRNELN TFGYIVEIDP YDKAATIKKR
TTLGRFAHES AAFGKAVAGK PLAVYMGDDS RGEYIYKFVS ATNWDAADAE PSNRIAAGDK
YLDSGKLYVA KFNDDGSGDW IELSFDNLDV KNYAGYAFAD AGDVAIHSRL AGDAVGATAM
DRPEWCAVHP TTGEIYYTLT NNSVRKLEPA APAVGASPDS IQRALDAANP RSYKDSYGGA
PEGSAGNING HIIRMKEDGG EPAATGFTWD VYLFGAQSDA DTSKINLSSL TADQDFSSPD
GLWFSRSTGL CWIQTDDGAY TDVSNCMMLL GVPGTVGDGV KTTLSYTLAD TSTLSIDTYV
GKKPTADTLK RFLVGPKDCE LTGCTETPDG KTVFANIQHP GETISAATIG NPAGYVSHWP
GNAGYQATVG NTTSRPRSAT LAITKDDGGR VGA