Gene Mpe_A2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2154 
Symbol 
ID4785818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2309777 
End bp2311099 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID640090722 
Productbifunctional protein: folylpolyglutamate synthase and dihydrofolate synthase 
Protein accessionYP_001021345 
Protein GI124267341 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.10427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.190806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCC CGTCCACGCT CGCCGACTGG CTGGCCCATT GCGAGCGCCT GCATCCCCGC 
ACCATCGACC TCACGCTGGA GCGCGTGCAG CGCGTGCGCA AGCGGCTCGG GCTGGGCTTC
GACTGTCCGG TGTTCACGGT GGCTGGCACC AACGGCAAGG GCTCGACCTG CGCGATGCTC
GAAGCCATCC TGCTGCATTC GGGTTATCGG GTTGGCGTCT ACACCTCGCC GCACCTGGTG
CATTTCGAGG AACGCTGCCG CATCGGCGGC GAGATCGTCG AGGCGGCAGC GCTGCTGCCG
CACTTCGCGG CCGTCGAGAC GGCCCGCCAA GGTGAGACGC TGACCTACTT CGAGTTCACC
ACGCTGGTGA TCCTGCGGCT TCTCAGCGAA GCCTCGCTTG ACGCGGTGGT GCTGGAGGTC
GGCCTCGGGG GGCGGCTTGA CGCCGTCAAT TCGATCGATA CCGACTGCGC CATCCTCACC
AGCATCGACC TCGACCACAT GGACTATCTC GGGCCCGACC GCGAGGCGAT CGGGCGCGAG
AAGGCAGGCA TCCTGCGCAC CGGCCGGCCG GCGATCGTCG GCGACCCGGT GCCGCCGCAG
AGCGTGATCG ACCACGCGCG CGAGATCGGC GCCGACCTGT GGCGCTTCGG GCACGACTTC
AACTACCGCG GCGACAAGCA GCAGTGGGGC TGGGCCGGTC GCGCGCGCCG CCACAACGGG
CTGGCCTATC CGGCCCTGCG CGGCGCGAAC CAGCTGCTCA ACGCCTCGGG CGTGCTGGCG
GCGCTGGAGG CGCTGCGCGA CCGGCTCCCG GTCACAGCGC AGGCCGTTCG CAACGGACTC
GCGATGGTGG AACTGCCAGG GCGCTTCCAG ATCGTGCCGG GCGCACCGAC GCTGGTGCTC
GATGTCGCGC ACAACCCGCA CGCCGTCGCC ACGCTGGCCG AGAACCTCGA CCAGATGGGC
TACTACCCCT GCACCCACGT CGTGTTCGGT GCGATGAAGG ACAAGGACAT CGAGACCATG
TTCGCGCGTC TGTTGCCGCT GGTGGATCGC TGGTACTTCA GCGACCTGCC GACACCCCGC
GCTGCGACGG CGCAGGAGCT CCGGGCCCTG CACGTGCGGG CCGTGGCCGC GCGCGCGCCG
GGGGCGACGC CGCTGCCGCC GCAAGTGCAG GCCAGCGAGC ATCCAGCGCC CCGCGCCGCA
CTCGTCGCGG CCCTCGAAGC GGCTGACCCC GCTGATAGAA TTGTGGTGTT CGGATCGTTC
TTCACCGTCG GTGGCGTGCT CGAGAACGGG TTGCCACGCC GCACTGCCAA GCATCTGGGT
TGA
 
Protein sequence
MTTPSTLADW LAHCERLHPR TIDLTLERVQ RVRKRLGLGF DCPVFTVAGT NGKGSTCAML 
EAILLHSGYR VGVYTSPHLV HFEERCRIGG EIVEAAALLP HFAAVETARQ GETLTYFEFT
TLVILRLLSE ASLDAVVLEV GLGGRLDAVN SIDTDCAILT SIDLDHMDYL GPDREAIGRE
KAGILRTGRP AIVGDPVPPQ SVIDHAREIG ADLWRFGHDF NYRGDKQQWG WAGRARRHNG
LAYPALRGAN QLLNASGVLA ALEALRDRLP VTAQAVRNGL AMVELPGRFQ IVPGAPTLVL
DVAHNPHAVA TLAENLDQMG YYPCTHVVFG AMKDKDIETM FARLLPLVDR WYFSDLPTPR
AATAQELRAL HVRAVAARAP GATPLPPQVQ ASEHPAPRAA LVAALEAADP ADRIVVFGSF
FTVGGVLENG LPRRTAKHLG