Gene Mpe_A1606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1606 
Symbol 
ID4787230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1733196 
End bp1734794 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content78% 
IMG OID640090174 
Productphosphomethylpyrimidine kinase / thiamine-phosphate pyrophosphorylase 
Protein accessionYP_001020803 
Protein GI124266799 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.41103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG AGCGCACGAC CCGTCCGGTG GTCTGGAGCA TCGCCGGCAC CGACAGCGGC 
GGCGGCGCCG GCCTCAGCGC CGACCAGCGC GCGGCCGACG CCTTCGGCGT GCACCTCTGC
CCGGTGGTGT CGGCCGTCAC CGCGCAGAAC TCGCTCGCGG TCACGCGCAT CGAGCGGCTC
CCGCCGCTTG CGCTGGAGGC GCAACTGGAG GCGCTGGCCG ACGACCTGCC GCCCCAGGTG
GTGAAGACCG GGCTGCTCGG CGGCGCGGAA CATGTGCGGC GGGTCGCGCA CTGGATCGAC
CGGCTGCGGC GCCGTCAGCC GGTGGCGCTG GTGGTCGACC CGGTGCTGGC GGCCAGCAGC
GGCGCCACCT TCGCCGACCC CGACACGCTG GCCGCCTACC GCGAGCTGCT GCTGCCCCGC
GCCACGCTGA TCACCCCGAA CCGCCGCGAG GCGGCTGCGC TGCTGGGCCA GCCCGAGGCT
GGCACGGCCG GCCTGCCGGC TCAGGCGCTG GCGCTGCGGC GCCGCGGCGC GCAGGCGGTC
TGCATCACCG GCGGCGACGC CGCCGACCTC GACGGCCGCG TGCTGGACTG GATGGCCACC
GAGCAGGCCG ACGGCTGGCT CGCCGCGCCG CGCATCGCCA CGCCACACCA CCACGGTAGC
GGCTGCACCT TCGCCAGCAG CGCGGCGGCC GCCCTGGCGC TCGGCTTCGT GCCGGCCGAC
GCGCTGGTAC TGGCCAAGAT GGCGACCGGC CACGCACTGC GCCACGCTCA CCCGGCCGGC
GCGGGCCGCG GGCCGGTGCG GGCCGCGGCC GGGTTCGCCA GCGATCCGAG CCTGATGCCC
TGGCTGTCGT GGGGCGCTGC GCCGGTGTTC GCGCCGGCTG CCGAGTCTGA GGCACCGCCC
GCCTCGCTCG GCCTCTATGC CATCGTCGAC AGCGCCGCGC GGGCGGAGCA GGTGCTGGCC
GCCGGCGTCC GCACGCTGCA GCTGCGCCTC AAGACGCCAG CCGTCCCCGA CGCCGGCTGG
CATCCCGCGT TGCGTGCCAC CTTGCAGCGC GGCATCGCCG CAGCGCGCGG GACCGGCGCC
GCGCTGTTCG TCAACGACCA CTGGCAGCTC GCCGCCGAAC TGGGTGCCCC CGGCGTGCAC
CTGGGCCAGG AGGACCTGCT GGCGCTGGGC GACGAGGGCC GGGCGCGGCT GCGCGCCAGC
GGGCTCGCGC TCGGCATCAG CTCGCACAGC CTGTGGGAGC TGGCGCGCGC CCGCACGCTG
GCGCCCCACT ACGTCGCCTG CGGCCCGGTC TGGCCCACGC TGACCAAGGC CATGCCGTGG
CGGCCCCAGG GCCTGGACAA CCTGGCGTGG TGGTGCGCCA TGGCCGGCCG GCCGGTGGTC
GCGATCGGCG GCATCCTGAC GCCCGAGCGG GTGGAAGCCG CGGCGCGCTG CGGCGCCGAC
GGCGTGTGTG CGGTCCGCGT GCTGGGCGAC GACCCGGCGC GCGTGCTGCC GTCGCTGCAG
CGCGCGCTGC AGGCCGGGCG CCAGGCTCCC ACGCCGGCGC CGGTGCCCGC GCTGCCGCAC
CCCTCGCTCG CCCTGAGCGT CGCACCCCGG CCCTCCTGA
 
Protein sequence
MTAERTTRPV VWSIAGTDSG GGAGLSADQR AADAFGVHLC PVVSAVTAQN SLAVTRIERL 
PPLALEAQLE ALADDLPPQV VKTGLLGGAE HVRRVAHWID RLRRRQPVAL VVDPVLAASS
GATFADPDTL AAYRELLLPR ATLITPNRRE AAALLGQPEA GTAGLPAQAL ALRRRGAQAV
CITGGDAADL DGRVLDWMAT EQADGWLAAP RIATPHHHGS GCTFASSAAA ALALGFVPAD
ALVLAKMATG HALRHAHPAG AGRGPVRAAA GFASDPSLMP WLSWGAAPVF APAAESEAPP
ASLGLYAIVD SAARAEQVLA AGVRTLQLRL KTPAVPDAGW HPALRATLQR GIAAARGTGA
ALFVNDHWQL AAELGAPGVH LGQEDLLALG DEGRARLRAS GLALGISSHS LWELARARTL
APHYVACGPV WPTLTKAMPW RPQGLDNLAW WCAMAGRPVV AIGGILTPER VEAAARCGAD
GVCAVRVLGD DPARVLPSLQ RALQAGRQAP TPAPVPALPH PSLALSVAPR PS