Gene Mext_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0414 
Symbol 
ID5832652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp458330 
End bp460003 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content67% 
IMG OID641366198 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_001637907 
Protein GI163849864 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.119816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.285646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCAG ATATCGAGAT CGCCCGCGCG GCGACCCTGA AGCCGATCGC CCAGGTCGCC 
GAAAAGCTCG GCATCCCGGA CGAGGCGCTT CACAACTACG GCAAGCACAT CGCCAAGATC
GACCACGACT TCATCGCCTC GCTCGAGGGT AAGCCCGAGG GCAAGCTGGT GCTCGTCACC
GCAATCTCGC CGACGCCTGC GGGCGAGGGC AAGACCACCA CCACGGTGGG TCTCGGCGAC
GCGCTCAACC GCATCGGCAA GCGGGCGGTG ATGTGCCTGC GCGAGCCCTC GCTCGGCCCC
TGCTTCGGCA TGAAGGGCGG CGCGGCCGGC GGCGGCAAGG CGCAGGTCGT GCCGATGGAG
CAGATCAACC TGCACTTCAC CGGCGATTTT CACGCCATCA CCTCGGCGCA CTCGCTCGCC
GCCGCTCTGA TCGACAACCA CATCTACTGG GCCAATGAGC TCAACATCGA CGTGCGCCGC
ATCCACTGGC GCCGCGTGGT CGACATGAAC GACCGGGCGC TGCGCGCCAT CAACCAGTCG
CTCGGCGGCG TCGCGAACGG CTTTCCGCGT GAGGACGGCT TCGACATCAC CGTCGCCTCC
GAGGTGATGG CGGTGTTCTG CCTCGCCAAG AATCTGGCCG ACCTCGAAGA GCGGCTCGGC
CGCATCGTCA TCGCCGAGAC CCGCGACCGC AAGCCGGTGA CGCTGGCCGA CGTGAAGGCG
ACCGGCGCCA TGACCGTTCT GCTCAAGGAT GCGCTGCAGC CGAATCTCGT GCAGACGTTG
GAGGGCAACC CGGCCCTGAT CCATGGCGGC CCGTTCGCCA ACATCGCCCA CGGCTGCAAC
TCGGTGATCG CCACCCGCAC CGGCCTGCGG CTCGCCGACT ACACCGTCAC CGAGGCCGGC
TTCGGCGCGG ATCTCGGCGC GGAGAAGTTC ATCGACATCA AGTGCCGCCA GACCGGCCTC
AAGCCGTCGG CGGTGGTGAT CGTCGCCACG ATCCGCGCCC TCAAGATGCA TGGCGGCGTC
AACAAGAAGG ATCTCCAGGC TGAGAACCTC GACGCGCTGG AGAAGGGTTT CGCCAACCTC
GAGCGCCACG TGAACAACGT GCGCAGCTTC GGCTTGCCGG TGGTGGTGGG CGTGAACCAC
TTCTTCCAGG ACACCGACGC CGAGCATGCC CGGTTGAAGG AGCTCTGCCG CGACCGGCTC
CAGGTCGAGG CGATCACCTG CAAGCACTGG GCGGAGGGCG GCGCGGGCGC CGAGGCGCTG
GCGCAGGCCG TGGTGAAGCT CGCCGAGGGC GAGCAGAAGC CGCTGACCTT CGCCTATGAG
ACCGAGACGA AGATCACCGA CAAGATCAAG GCGATCGCGA CCAAGCTCTA CGGCGCGGCC
GACATCCAGA TCGAGTCGAA GGCCGCCACC AAGCTCGCCG GCTTCGAGAA GGACGGCTAC
GGCAAATTGC CGGTCTGCAT GGCCAAGACG CAGTACTCGT TCTCGACCGA CCCGACCCTG
ATGGGCGCGC CCTCGGGCCA CCTCGTCTCG GTGCGCGATG TGCGCCTCTC GGCGGGCGCC
GGCTTCGTCG TGGTGATCTG CGGTGAGATC ATGACCATGC CGGGCCTGCC CAAAGTGCCG
GCGGCGGACA CCATCCGCCT CGACGCCAAC GGTCAGATCG ACGGGCTGTT CTAG
 
Protein sequence
MPSDIEIARA ATLKPIAQVA EKLGIPDEAL HNYGKHIAKI DHDFIASLEG KPEGKLVLVT 
AISPTPAGEG KTTTTVGLGD ALNRIGKRAV MCLREPSLGP CFGMKGGAAG GGKAQVVPME
QINLHFTGDF HAITSAHSLA AALIDNHIYW ANELNIDVRR IHWRRVVDMN DRALRAINQS
LGGVANGFPR EDGFDITVAS EVMAVFCLAK NLADLEERLG RIVIAETRDR KPVTLADVKA
TGAMTVLLKD ALQPNLVQTL EGNPALIHGG PFANIAHGCN SVIATRTGLR LADYTVTEAG
FGADLGAEKF IDIKCRQTGL KPSAVVIVAT IRALKMHGGV NKKDLQAENL DALEKGFANL
ERHVNNVRSF GLPVVVGVNH FFQDTDAEHA RLKELCRDRL QVEAITCKHW AEGGAGAEAL
AQAVVKLAEG EQKPLTFAYE TETKITDKIK AIATKLYGAA DIQIESKAAT KLAGFEKDGY
GKLPVCMAKT QYSFSTDPTL MGAPSGHLVS VRDVRLSAGA GFVVVICGEI MTMPGLPKVP
AADTIRLDAN GQIDGLF