Gene Mpe_A3006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3006 
Symbol 
ID4784695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3197631 
End bp3199169 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content74% 
IMG OID640091577 
ProductPpx/GppA phosphatase 
Protein accessionYP_001022194 
Protein GI124268190 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.487592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACGA ACGCCCTGCC CGAGTTCCCC GAAGCCGCCC CGCTGGCCGC CATCGACATC 
GGCTCCAACA GCTTCCGCCT CGAGATCGGC CAGCTGACCC GGGGCCGCTA CAAGCGCATC
GACTACCTGA AGGAGACCGT GCGGCTCGGC GGCGGGCTCG ACGCCGACGG CCGCCTGAAC
GAGGAGGCGC AGCTGCGCGG CCTGGCCTGC CTGGCGCGCT TCGCGCTGCG GCTGCGCGGC
TTCGCGCCGG CCCAGGTGCG CGCCGTGGCG ACCCAGACGC TGCGCGAGGC CCGGAACCGC
GACGCCTTCC TGGCGCGAGC GCGCACCGTG CTCGGCCACC CGATCGAGGT CATCTCGGGC
CGCGAGGAAG CCCGCCTCAT CTTCGCCGGC GTGGCGCGAC TGCAGCCGAG CGAGCGGCCG
CGCATCGTGA TCGACATCGG CGGCCGCTCC ACCGAGATGA TCCTCGGCCA GGGCCGCACG
CCGCGCCAGG CCGAGAGCTT CCAGGTCGGC AGCGTGAGCC TGTCGATGCG CTACTTCCCC
GACGGCCGCT TCACCGCTGA CGCCTTCCGC GCCGCGCAGG TGGCGGCCGG CGCCGAGCTC
GAGGAGGCGC TGCAGCCCTT CGCACCGGGA CAGTGGATCG AGGCGCTGGG CTCCTCGGGC
ACGGTGGGTG CGGTGTCACA GCTGCTGGCG GCCAACGGCA TCAGCGACGG CGTCATCACC
CCGGTGGGCC TGCGCTGGTG CATCGAGACC TGCCTTGCCG CCGGTCACCA GGACGCGCTG
GACCTGCCAG GACTCAAGCC CGAACGCCGC GCGGTGCTGG GCGGCGGCCT GTCGATCCTC
TACACGCTGG CACTGCAGTT CGGCATCGAC GCGCTGCAGC CTGCACGCGG CGCCCTGCGC
CAGGGCGTGC TGTTCGACCT GGCCGAGCGC CTGGAGGCGG CGCAGGCCCC GGCCCGCCAC
GCGCACCGGC AGGACATGCG CGACACCTCG GTGCACGAAC TGCAGCGCCG CTTCGGCAGC
GACCTTTCCC AGGCCGCGCG CGTGCAGCGC CTGGCCGGTT CGCTGTACCG GAGCACCTCG
ACGCCGCGCA ACGGGCACAC CGAGGCCGCG CGCGAGCTGG CCTGGGCCGC CGCGCTGCAC
GAGATCGGCA TGTCGGTGTC GCACCACGAC CACCACCGCC ACAGCGCCTA CCTGTTGGCG
CACGTGGACG CGCCGGGCTT CTCGCAGAGC CAGCAGCGGC GCGTGGCGGA GCTGGTGCTC
GGCCACCGCG GCAGCCTGCG CAAGCTCGAC TCCACGCTGG ACCAGGAGGC CACGCTGTGG
CCGGTGCTCA GCCTCCGCCT CGCGGCGCTG TTGTGCCATG CGCGCAACGA CGTGCCGGAG
CGGGTGGTGG CGTTGCGGCG CACCGACGAC GGCGCGCTGC TCCGCATCGA CCGCGCCTGG
GCCGACGGCC ATCCCCGCAC GATGCACTTG CTGGGCGAGG AGGTACGGGC CTGGGAACGG
GCTGGCCGCC TGAAGTTGGC GGTCCGCACG GACGGCTGA
 
Protein sequence
MPTNALPEFP EAAPLAAIDI GSNSFRLEIG QLTRGRYKRI DYLKETVRLG GGLDADGRLN 
EEAQLRGLAC LARFALRLRG FAPAQVRAVA TQTLREARNR DAFLARARTV LGHPIEVISG
REEARLIFAG VARLQPSERP RIVIDIGGRS TEMILGQGRT PRQAESFQVG SVSLSMRYFP
DGRFTADAFR AAQVAAGAEL EEALQPFAPG QWIEALGSSG TVGAVSQLLA ANGISDGVIT
PVGLRWCIET CLAAGHQDAL DLPGLKPERR AVLGGGLSIL YTLALQFGID ALQPARGALR
QGVLFDLAER LEAAQAPARH AHRQDMRDTS VHELQRRFGS DLSQAARVQR LAGSLYRSTS
TPRNGHTEAA RELAWAAALH EIGMSVSHHD HHRHSAYLLA HVDAPGFSQS QQRRVAELVL
GHRGSLRKLD STLDQEATLW PVLSLRLAAL LCHARNDVPE RVVALRRTDD GALLRIDRAW
ADGHPRTMHL LGEEVRAWER AGRLKLAVRT DG