Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0414 |
Symbol | |
ID | 5832652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 458330 |
End bp | 460003 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641366198 |
Product | formate--tetrahydrofolate ligase |
Protein accession | YP_001637907 |
Protein GI | 163849864 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.119816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.285646 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCAG ATATCGAGAT CGCCCGCGCG GCGACCCTGA AGCCGATCGC CCAGGTCGCC GAAAAGCTCG GCATCCCGGA CGAGGCGCTT CACAACTACG GCAAGCACAT CGCCAAGATC GACCACGACT TCATCGCCTC GCTCGAGGGT AAGCCCGAGG GCAAGCTGGT GCTCGTCACC GCAATCTCGC CGACGCCTGC GGGCGAGGGC AAGACCACCA CCACGGTGGG TCTCGGCGAC GCGCTCAACC GCATCGGCAA GCGGGCGGTG ATGTGCCTGC GCGAGCCCTC GCTCGGCCCC TGCTTCGGCA TGAAGGGCGG CGCGGCCGGC GGCGGCAAGG CGCAGGTCGT GCCGATGGAG CAGATCAACC TGCACTTCAC CGGCGATTTT CACGCCATCA CCTCGGCGCA CTCGCTCGCC GCCGCTCTGA TCGACAACCA CATCTACTGG GCCAATGAGC TCAACATCGA CGTGCGCCGC ATCCACTGGC GCCGCGTGGT CGACATGAAC GACCGGGCGC TGCGCGCCAT CAACCAGTCG CTCGGCGGCG TCGCGAACGG CTTTCCGCGT GAGGACGGCT TCGACATCAC CGTCGCCTCC GAGGTGATGG CGGTGTTCTG CCTCGCCAAG AATCTGGCCG ACCTCGAAGA GCGGCTCGGC CGCATCGTCA TCGCCGAGAC CCGCGACCGC AAGCCGGTGA CGCTGGCCGA CGTGAAGGCG ACCGGCGCCA TGACCGTTCT GCTCAAGGAT GCGCTGCAGC CGAATCTCGT GCAGACGTTG GAGGGCAACC CGGCCCTGAT CCATGGCGGC CCGTTCGCCA ACATCGCCCA CGGCTGCAAC TCGGTGATCG CCACCCGCAC CGGCCTGCGG CTCGCCGACT ACACCGTCAC CGAGGCCGGC TTCGGCGCGG ATCTCGGCGC GGAGAAGTTC ATCGACATCA AGTGCCGCCA GACCGGCCTC AAGCCGTCGG CGGTGGTGAT CGTCGCCACG ATCCGCGCCC TCAAGATGCA TGGCGGCGTC AACAAGAAGG ATCTCCAGGC TGAGAACCTC GACGCGCTGG AGAAGGGTTT CGCCAACCTC GAGCGCCACG TGAACAACGT GCGCAGCTTC GGCTTGCCGG TGGTGGTGGG CGTGAACCAC TTCTTCCAGG ACACCGACGC CGAGCATGCC CGGTTGAAGG AGCTCTGCCG CGACCGGCTC CAGGTCGAGG CGATCACCTG CAAGCACTGG GCGGAGGGCG GCGCGGGCGC CGAGGCGCTG GCGCAGGCCG TGGTGAAGCT CGCCGAGGGC GAGCAGAAGC CGCTGACCTT CGCCTATGAG ACCGAGACGA AGATCACCGA CAAGATCAAG GCGATCGCGA CCAAGCTCTA CGGCGCGGCC GACATCCAGA TCGAGTCGAA GGCCGCCACC AAGCTCGCCG GCTTCGAGAA GGACGGCTAC GGCAAATTGC CGGTCTGCAT GGCCAAGACG CAGTACTCGT TCTCGACCGA CCCGACCCTG ATGGGCGCGC CCTCGGGCCA CCTCGTCTCG GTGCGCGATG TGCGCCTCTC GGCGGGCGCC GGCTTCGTCG TGGTGATCTG CGGTGAGATC ATGACCATGC CGGGCCTGCC CAAAGTGCCG GCGGCGGACA CCATCCGCCT CGACGCCAAC GGTCAGATCG ACGGGCTGTT CTAG
|
Protein sequence | MPSDIEIARA ATLKPIAQVA EKLGIPDEAL HNYGKHIAKI DHDFIASLEG KPEGKLVLVT AISPTPAGEG KTTTTVGLGD ALNRIGKRAV MCLREPSLGP CFGMKGGAAG GGKAQVVPME QINLHFTGDF HAITSAHSLA AALIDNHIYW ANELNIDVRR IHWRRVVDMN DRALRAINQS LGGVANGFPR EDGFDITVAS EVMAVFCLAK NLADLEERLG RIVIAETRDR KPVTLADVKA TGAMTVLLKD ALQPNLVQTL EGNPALIHGG PFANIAHGCN SVIATRTGLR LADYTVTEAG FGADLGAEKF IDIKCRQTGL KPSAVVIVAT IRALKMHGGV NKKDLQAENL DALEKGFANL ERHVNNVRSF GLPVVVGVNH FFQDTDAEHA RLKELCRDRL QVEAITCKHW AEGGAGAEAL AQAVVKLAEG EQKPLTFAYE TETKITDKIK AIATKLYGAA DIQIESKAAT KLAGFEKDGY GKLPVCMAKT QYSFSTDPTL MGAPSGHLVS VRDVRLSAGA GFVVVICGEI MTMPGLPKVP AADTIRLDAN GQIDGLF
|
| |