Gene Msil_1712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1712 
Symbol 
ID7093172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1860272 
End bp1861948 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content61% 
IMG OID643465036 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_002362021 
Protein GI217977874 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.311565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCGG ATATTGAAAT CGCCCAGCGC GCCACAATGG AGCGCATCAC CAAAGTCGCC 
GCGGAAAAGC TCGGCATCGA CGACGAGCAT CTTGAACCCT ACGGACATTT CAAGGCGAAG
ATTTCGCTGG ATTATGTCGA CAGCCTGAAA GACCGTCCGG ACGGAAAGCT CATTCTCGTC
ACGGCGATCA GCCCGACGCC GGCCGGCGAA GGCAAGACGA CGACGACGGT GGGTCTCGGC
GACGCGCTGA ACCGCATCGG CAAAAAGGCC TCGATCTGCC TGCGCGAGCC TTCGCTCGGA
CCCGTATTCG GCATGAAGGG CGGCGCGGCC GGCGGCGGCT ATGCGCAGGT CGTTCCGATG
GAGGACATCA ATCTCCATTT CACCGGCGAC TTCAGCGCCA TCGCGCTGGC GACCAATCTC
CTCGCGGCGA TGATCGACAA TCACATCCAT CACGGCAACG AACTCAACAT CGACGTGCGC
CGGATCGCCT GGAAGCGCGT CGTCGACATG AACGATCGCG CCCTGCGCGA CATCACCATC
GCCCTCGGCG GCACGGCGAA CGGCTTTCCG CGCCAGGACG GTTTCGACAT CGTCGTCGCC
TCGGAGGTGA TGGCGATTTT CTGCCTCGCC AATTCGATCG AGGATTTGAA AGCGCGTCTC
GGCGCCATCG TGATCGGCTA CACGACCGAC CAGAAGCCCG TCCATGCGCG CGATCTACAG
GCTAACGGCG CGATGGCGGT ATTGCTCAAG AACGCTTTGA AGCCGAATCT CGTGCAGACG
CTCGAGAACA ACCCGGCCTT CATCCATGGC GGTCCCTTCG CCAATATCGC GCATGGCTGC
AATTCGGTGC TCGCCACCAA GACGGCGCTC AAACTGTCGG ACTATGTCGT GACAGAGGCC
GGCTTCGGCG CTGATCTTGG CGCGGAGAAA TTCATCGACA TCAAATGCCG CAAATCGGGC
CTGCGGCCGC AGGCGGTCGT CATCGTCGCC ACGATTCGTG CGCTAAAATA TCATGGCGGC
GTGGAGCTCA AGGAGCTCAA CACGGAAAAC CTCGATGCGC TGCGCAAGGG GCTCTCCAAT
CTCGAACGCC ATATCAACAA CATCCGCAAC CATTACGGAC TGCCTGTTGT CGTCGCGATC
AACCATTTCA CAGCGGACAC CGCGGCCGAG GTCGATCTGT TGAAGAAGAG CGTCGCTGAT
CTTGGCGCGC CAATCGTCGT CTGCCGCCAT TGGGCGGAAG GCGGCAAGGG CGCCGAAGAT
CTGGCCCGCG TCGTCGTCGA GATGATCGAC AAGGTCCCGA GCGACTTCCA TTTCGTCTAT
GAGGATTCCG CCTCGTTGTG GGACAAGGCG ACGGCGGTCG CGACCAAGCT CTATGGCGCG
ACCAAGGTGA CGGCCGACGC CAAGGTCCGC AACCAGATCA AGAAGCTTGA AGAGAGCGGC
TACGGCAATT TCCCGATCTG CATCGCGAAA ACGCAATATT CCTTCTCGAC CGACGCCAAG
CTGCGCGGCG CGCCGACGGG CCATGACATC AACATCCGCG AGGTCCGGCT TGCGGCCGGA
GCGGAGTTCA TCGTGCTCGT CTGCGGCGAC GTCATGACCA TGCCGGGGCT GCCCAAAATT
CCCTCGGCGA CCAAGATCGA CCTCAACGAA AAGGGCGAGG TCGTCGGGCT GTTCTAG
 
Protein sequence
MASDIEIAQR ATMERITKVA AEKLGIDDEH LEPYGHFKAK ISLDYVDSLK DRPDGKLILV 
TAISPTPAGE GKTTTTVGLG DALNRIGKKA SICLREPSLG PVFGMKGGAA GGGYAQVVPM
EDINLHFTGD FSAIALATNL LAAMIDNHIH HGNELNIDVR RIAWKRVVDM NDRALRDITI
ALGGTANGFP RQDGFDIVVA SEVMAIFCLA NSIEDLKARL GAIVIGYTTD QKPVHARDLQ
ANGAMAVLLK NALKPNLVQT LENNPAFIHG GPFANIAHGC NSVLATKTAL KLSDYVVTEA
GFGADLGAEK FIDIKCRKSG LRPQAVVIVA TIRALKYHGG VELKELNTEN LDALRKGLSN
LERHINNIRN HYGLPVVVAI NHFTADTAAE VDLLKKSVAD LGAPIVVCRH WAEGGKGAED
LARVVVEMID KVPSDFHFVY EDSASLWDKA TAVATKLYGA TKVTADAKVR NQIKKLEESG
YGNFPICIAK TQYSFSTDAK LRGAPTGHDI NIREVRLAAG AEFIVLVCGD VMTMPGLPKI
PSATKIDLNE KGEVVGLF