Gene Mpe_A1537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1537 
Symbol 
ID4783555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1657479 
End bp1659167 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content68% 
IMG OID640090104 
ProductPhoH-like ATPase 
Protein accessionYP_001020734 
Protein GI124266730 
COG category[T] Signal transduction mechanisms 
COG ID[COG1875] Predicted ATPase related to phosphate starvation-inducible protein PhoH 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.615224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGC CCAAACCGCC CGCCAAGAAA GCCTCCCTCC TCAGCGCCAG CGACTTCGAC 
TCCCAGGCCG CACCCAAACC GGGCACCCGC GCCACCAAGC AGGACAACCC CGGTCCTGCC
CTGCTCGACC TGTTCGACCC CCACCCCAAG GCGCCGGCCG CCAAGGCGCC CGCCCCGGCA
CCCGCTGCCA CGCCGCGAGC AGTACAGGCC CCGGCGGCCG GCTACCTGCC GGTCAAGGCC
ATCAGCCGCG GCACCGCCAC GGAACCGCGC CCGCGCAAGC CGCGCGCCAG CGGCCCGCCC
AAGCTCTTCG TCCTGGATAC CAACGTTCTG ATGCACGACC CGATGTCGCT GTTCCGCTTC
GACGAGCACG ACGTCTACCT GCCGATGATC ACGCTCGAGG AGCTCGACGG GCACAAGAAG
GGCATGAGCG AGGTCTCGCG CAACGTGCGC CAGGTCAGCC GCGAGCTCGA CGCACTGGCC
GGCGGCGCCG ACAGCGCGAA GTTCGATCCC GCGGCCGGCG TGGCGCTCGC GAAGACCGGT
CACAAGGAGG CCGGAGGCAC CCTGTTCTTC CAAACCACCT TCCTCGACGC CAAGCTGCCG
GCCGGGCTGC CGCAGGGCAA GGCCGACAAC CAGATCCTGG GCGTGGTGCA AGGCCTGCGC
GAGCAGCACC CGACACGAGA CGTGGTGCTG GTGTCCAAGG ACATCAACAT GCGCATCAAG
GCCCGTGCGC TGGGCCTGCC GGCCGAGGAC TACTTCAACG ACAAGACGCT GGAAGACGGT
GACCTGCTCT ACACCGGTGT GCTGCCGCTG CCGGCCGACT TCTGGGAGCG CCACGGCAAG
ACCATGGAGA GCTGGCAGCA AGGCGGCTCC ACGTTCTACC GCATCGCCGG GCCGCTGGTG
CCGGTGCTGA TGGTCAACCA GTTCGTCTAC CTTGAGACCC CGGGCGCCGC ACCGCTCTAC
GCGAAGGTCA CCGAGATCAC CGGCAAGACC GCGGTGCTGA AGACGCTGAA GGACTACTCG
CATGCCAAGA ACGCCATCTG GGGCGTGACC GCGCGCAACC GCGAGCAGAA CTTCGCGCTG
AACCTGCTGA TGGACCCGGA CTGCGACTTC ATCACCCTCA CCGGCACCGC CGGCACCGGC
AAGACGCTGA TGACGCTGGC CGCCGGCCTG TCGCAGGTGA TGGACGATCG CCGCTACAGC
GAGATCATCG TGACCCGCGT GACCGTGCCG GTCGGCGAGG ACATCGGCTT CCTGCCCGGC
ACCGAGGAAG AGAAGATGGG CCCGTGGATG GGCGCGCTCG ACGACAACCT CGAGGTGCTG
TCCAAGAATG ACTCGGGCGC CGGCGAATGG GGCCGCGCCG CGACCAATGA CCTGGTGCGC
AGCAAGATCA AGATCAAGAG CCTGAACTTC ATGCGCGGGC GCACCTTCCT GAACAAGTTC
GTGCTGATCG ACGAGGCGCA GAACCTGACG CCCAAGCAGA TGAAGACGCT GATCACGCGC
GCCGGGCCGG GCACGAAGAT CGTCTGCCTG GGCAACCTGG CGCAGATCGA CACGCCCTAC
CTCACCGAAG GCAGTTCGGG CCTCACCTTC GCGGTCGACC GCTTCAAGGG CTGGCCACAC
AGCGGCCACG TGATGCTGGC GCGCGGCGAG CGCTCGCGGC TGGCCGATTA CGCGTCGGAC
GTGTTGTAG
 
Protein sequence
MPLPKPPAKK ASLLSASDFD SQAAPKPGTR ATKQDNPGPA LLDLFDPHPK APAAKAPAPA 
PAATPRAVQA PAAGYLPVKA ISRGTATEPR PRKPRASGPP KLFVLDTNVL MHDPMSLFRF
DEHDVYLPMI TLEELDGHKK GMSEVSRNVR QVSRELDALA GGADSAKFDP AAGVALAKTG
HKEAGGTLFF QTTFLDAKLP AGLPQGKADN QILGVVQGLR EQHPTRDVVL VSKDINMRIK
ARALGLPAED YFNDKTLEDG DLLYTGVLPL PADFWERHGK TMESWQQGGS TFYRIAGPLV
PVLMVNQFVY LETPGAAPLY AKVTEITGKT AVLKTLKDYS HAKNAIWGVT ARNREQNFAL
NLLMDPDCDF ITLTGTAGTG KTLMTLAAGL SQVMDDRRYS EIIVTRVTVP VGEDIGFLPG
TEEEKMGPWM GALDDNLEVL SKNDSGAGEW GRAATNDLVR SKIKIKSLNF MRGRTFLNKF
VLIDEAQNLT PKQMKTLITR AGPGTKIVCL GNLAQIDTPY LTEGSSGLTF AVDRFKGWPH
SGHVMLARGE RSRLADYASD VL