Gene Mpe_A3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3106 
Symbol 
ID4786679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3305541 
End bp3306707 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content65% 
IMG OID640091677 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001022294 
Protein GI124268290 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.122564 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCG CGTCCTACGC CAGCGATCCG GCCAGCGGCC GTGGCCGCCT TCACGCCGAA 
GCGCCCGCAC CGACCCGGGA CGATTACCAG CGCGATCGCG ATCGCATCGT TCATTCCACC
GCGTTCCGAC GCCTCGTCTA CAAGACCCAG GTCTTCCTCA ACCACGAGGG CGACCTGTTC
CGGACGCGGC TCACGCACTC GCTCGAGGTG GCGCAGCTCG CGCGTTCGAT CGCCCGTGCT
CTGCAGCTCA ACGAAGACCT CACGGAAGCC ATCGCGCTCG CGCACGATCT GGGACACACC
CCGTTCGGAC ATGCGGGGCA GGATGAGTTG AACGGCTGCT TGCGACGCAT CGATCCGCAG
GGCTCGGGGT TCGAGCACAA CCTTCAGTCG CTGAGGGTCG TCGACCGACT CGAACAACGA
TACGCTAGCT TCGAAGGATT GAATCTCAGT TTCGAGACAA GGGAAGGCAT CCTCAAGCAC
TGCTCCAGAC GCAACGCGCA GGCGCTCGAG CTGCAGGAGC CGGGCGGTGT CGGGCGCCGT
TTTCTCGATG GCACGCAGCC CAGCCTGGAA GCGCAGTTGT GCAACCTCGC CGACGAGATC
GCCTACAACG CACACGATGT CGACGACGGC GTGCGTTCGG GTTTGATCGG TCTCGAGCAG
CTTCGCGAAG TGCCCTTGGT GGCAACCCAC ATGGATGCCG CTCGACGTGA GCATCCTGGC
CTCCAGGGCC GCAGGCTCTT GTTCGAAACC TTGCGGCGCT TGCTGTCCGC CCAGGTCTAC
GACGTGATCG ATGCCACTCG CGACGCGCTG CGGCAGCATC GCATCGAGAA GGTGTCCGAT
GTGAGGATGT CGCCGTTGCT GGTGCAGTTC ACGCCGGCGA TGCGGGACGA GAGCCTGGTA
CTGAAGCGCT TTCTGTTCGC GCAGCTCTAC CGCCACCCGC AGGTCGAGAG CACGACGCAC
CGCGCAAGAT GCGTGGTACG CGAACTGTTT GAGGCCTACG TGACGGGAGC TGCCGAGCTG
CCGTCGGCTG CGGCCGAAAG CGCCACGCCG TTCCGCGCTG TTACCGACTA TGTGGCGGGC
ATGACGGATC GCTTCGCGAT CAGCGAGCAC CGGCGCATCA CGGGCCGCAC TGTCTTCGAA
GACCACGGAC GGCTGTCTGG AGGGTAG
 
Protein sequence
MSLASYASDP ASGRGRLHAE APAPTRDDYQ RDRDRIVHST AFRRLVYKTQ VFLNHEGDLF 
RTRLTHSLEV AQLARSIARA LQLNEDLTEA IALAHDLGHT PFGHAGQDEL NGCLRRIDPQ
GSGFEHNLQS LRVVDRLEQR YASFEGLNLS FETREGILKH CSRRNAQALE LQEPGGVGRR
FLDGTQPSLE AQLCNLADEI AYNAHDVDDG VRSGLIGLEQ LREVPLVATH MDAARREHPG
LQGRRLLFET LRRLLSAQVY DVIDATRDAL RQHRIEKVSD VRMSPLLVQF TPAMRDESLV
LKRFLFAQLY RHPQVESTTH RARCVVRELF EAYVTGAAEL PSAAAESATP FRAVTDYVAG
MTDRFAISEH RRITGRTVFE DHGRLSGG