Gene M446_3236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3236 
Symbol 
ID6134958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3580147 
End bp3581823 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content71% 
IMG OID641643423 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_001770075 
Protein GI170741420 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.932652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.61785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACTG ACATCGAGAT TGCCCGCGCG GCCACCCTGC AGCCGATCAG CGCCATCGCC 
GAGACGCTCG GCATCCCGGA CGAGGCCCTG CACCCCTACG GGCGGCACAT CGCCAAGATC
GACCATGCCC ACATCGCCTC GCTGGAGGCC AAGCCCGAGG GCAAGCTGGT CCTGGTGACG
GCGATCAGCC CGACCCCCGC GGGCGAGGGC AAGACCACCA CCACCGTCGG TCTCGGCGAC
GCGCTGAACC GGATCGGCAA GCGGACGGTG ATCTGCCTGC GCGAGCCCTC GCTCGGCCCC
TGCTTCGGCA TGAAGGGCGG CGCGGCCGGC GGCGGCAGGT CCCAGGTCGT GCCCATGGAG
GCGATCAACC TCCACTTCAC GGGCGATTTC CACGCCATCA CCTCGGCCCA CAGCCTCGCG
GCGGCGCTGA TCGACAACCA CATCTACTGG GGCAACGCAC TCGGCATCGA CCCGCGCCGG
GTCGCTTGGC GGCGCGTCGT CGACATGAAC GACCGGTCGC TGCGCTCGAT CGTGCAGTCG
CTCGGCGGCG TCGCCAACGG CTACCCGCGC GAGGACGGGT TCGACATCAC GGTCGCCTCC
GAGGTGATGG CGGTGTTCTG CCTCGCCCGC GACCTCGCGG ATCTGGAGGC GCGGCTCGGC
CGGATCGTCG TCGCCGAGAG CCGCGAGCGC AAGCCCGTGA CCCTCGCCGA CCTGAAGGCG
ACGGGCGCCA TGACGGTGCT GCTCAAGGAC GCGCTGCAGC CGAACCTCGT CCAGACCCTG
GAAGGGAGCC CGGCGCTGAT CCATGGCGGC CCCTTCGCCA ACATCGCGCA TGGCTGCAAC
TCGGTGATCG CGACCCGCTC GGGCCTGCGC CTCGGCGAGT ACGCGGTCAC GGAGGCCGGG
TTCGGGGCCG ATCTCGGCGC CGAGAAGTTC ATCGACATCA AGTGCCGCCA GACCGGCCTG
TCGCCGAGCG CCGTGGTCAT CGTGGCCACG GTGCGGGCCC TCAAGATGCA CGGCGGCGTC
GAGAAGAAGG CGCTCGGCGG GGAGAACGTC GCGGCCCTGG AGAAGGGCTT CGCCAACCTC
CAGCGCCACG TCGAGAACGT GCGCCGCTTC GGACTCCCGG TGGTGGTGGC GGTGAACCAC
TTCCACGCCG ACACGGAGGC CGAGCACGCC GCCCTCAAGG CCCTGTGCCG CGACAGGCTC
GACGTCCAGG CGATCACCTG CCGCCACTGG GCGGAGGGCG GCGCGGGGGC GGAGGATCTC
GCCCGGGCGG TGGTGTCCCT CGCCGAGGGC GGCGCGCCCG CGACCCCGAA CTTCGTCTAC
CCGGAAGAGG CCAAGCTCAC CGACAAGATC CGCACCATCG CCCAGACGCT GTACGGGGCG
GCGGACATCC AGGTCGAGTC GAAGGCCGCC GCCAAGCTCG CCCAGTTCGA GAAGGACGGC
TACGGCAGGC TCCCGGTCTG CATGGCCAAG ACCCAGTACT CGTTCTCGAC CGATCCCGGC
CTGATCGGGG CGCCGAGCGG CCACGTGGTG GCGGTGCGCG ACGTGCGCCT CTCGGCCGGG
GCCGGCTTCG TGGTGGTGAT CTGCGGGGAG ATCATGACCA TGCCGGGCCT GCCCAAGGTG
CCGGCCTCCG AGGGCATCTA CCTCGACGCG AACGGGCAGA TCGAGGGCCT GTTCTGA
 
Protein sequence
MPTDIEIARA ATLQPISAIA ETLGIPDEAL HPYGRHIAKI DHAHIASLEA KPEGKLVLVT 
AISPTPAGEG KTTTTVGLGD ALNRIGKRTV ICLREPSLGP CFGMKGGAAG GGRSQVVPME
AINLHFTGDF HAITSAHSLA AALIDNHIYW GNALGIDPRR VAWRRVVDMN DRSLRSIVQS
LGGVANGYPR EDGFDITVAS EVMAVFCLAR DLADLEARLG RIVVAESRER KPVTLADLKA
TGAMTVLLKD ALQPNLVQTL EGSPALIHGG PFANIAHGCN SVIATRSGLR LGEYAVTEAG
FGADLGAEKF IDIKCRQTGL SPSAVVIVAT VRALKMHGGV EKKALGGENV AALEKGFANL
QRHVENVRRF GLPVVVAVNH FHADTEAEHA ALKALCRDRL DVQAITCRHW AEGGAGAEDL
ARAVVSLAEG GAPATPNFVY PEEAKLTDKI RTIAQTLYGA ADIQVESKAA AKLAQFEKDG
YGRLPVCMAK TQYSFSTDPG LIGAPSGHVV AVRDVRLSAG AGFVVVICGE IMTMPGLPKV
PASEGIYLDA NGQIEGLF