Gene Msil_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1404 
Symbol 
ID7091742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1516319 
End bp1517635 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content66% 
IMG OID643464742 
ProductFolC bifunctional protein 
Protein accessionYP_002361731 
Protein GI217977584 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.327079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCCC TCGACGCCTT GCTGGCGCGC CTTTATCTGC TGCACAAGAA AGACATCGAG 
CTGACGCTCG GGCGGATGGA GCGGCTGCTC GTCGCGCTGG GTTCGCCGCA AAAGAAACTG
CCGCCGGTCA TCCATGTCGC CGGCACCAAT GGCAAGGGCT CGACCATCGC CTTCATGCGG
GCGATCCTTG AGGCCGCGGG CCGGCGCGTC CATGTTTACA CCTCGCCGCA TCTTGTGCGG
TTCCACGAGC GGATCCGGCT CGGCGCCGCC GGAGGCGGCA AACTCGTCGG CGACGCCCAA
TTGTTCGACG CCCTCAGCCA TTGCGAGGAG GTCAACGCCG GCGCGCCGAT CACGTTCTTC
GAGTTCACGA CAGCCGCCGC CTTCAAGATC TTCAGCGAAG CGCCGGCCGA CTATCTGCTG
CTCGAAGTCG GTCTTGGCGG GCGCGGCGAC GCCACCAATG TGATCTCCGA TCCGCTGGCG
ACGGTGGTGA CCTCGATCTC AATCGACCAT CCGGAATTTC TCGGCTCGAC AATCGAGAAG
ATCGCCTATG AGAAGGCGGG CATTTTCAAG CCGGGCGTTC CCGCCGTGCT CGGCGCGCAG
GATGAACGGG CGTTTGCCGT GCTGCGCATG GAGGCGGCGC GAAAAGGCGC GCCGGTGACG
GCGGCCTCGC GCGATTATTC GGCGCGGGAG GAGCATGGCC GCTTCATCTT CGAGGATGAA
CGCGGCCTCA TCGATCTGCC GCTGCCCCGG CTTCTCGGCC GCCACCAGCA CCAGAACGCC
GCAACCGCCA TCGCCGTCAC GCGGCTGATC GAGCCGGAGC TTGATGCGAA GGTCTATGAG
CGGGGCCTCA TCGAGGTCGA TTGGCCGGCG CGCCTGCAAA ATCTGGTCAA GGGCCGGATC
GCCGCGCTTG GCCCGAAAGG CGCCGAAATC TGGATCGACG GCGGCCACAA TGAAGACGGC
GGCCGCGCCA TCGCCGAAGC CATGGCGGAT TTCGCCGACA AGAGCGAACG CCCGCTCGTC
ATCATCTGCG GCACGTTGAC GACAAAGGAC ACCGGCGCTT TTCTGCGCTC CTTCAAGGGA
CTCGCCCAGC AGGTGATCGC CGTTCCCGTC GAGGCCGAGC ATTATGGCAA GCCCGCGCGT
GACGTCGCCG CCGCCGCAAG CGAAGTTGGC CTGCCAGCGG TCGCCTCCGA GAGCATCGAG
GCCGCGCTGC GTTTCCTCGC GGTCCAGCAC TGGCGTCAGC CGCCCCGCAT CCTGATCGCC
GGCAGCCTTT ATCTCGCCGG CGAAGCTTTG CTTTTAAACG GAACGCCGCC CGCTTGA
 
Protein sequence
MTPLDALLAR LYLLHKKDIE LTLGRMERLL VALGSPQKKL PPVIHVAGTN GKGSTIAFMR 
AILEAAGRRV HVYTSPHLVR FHERIRLGAA GGGKLVGDAQ LFDALSHCEE VNAGAPITFF
EFTTAAAFKI FSEAPADYLL LEVGLGGRGD ATNVISDPLA TVVTSISIDH PEFLGSTIEK
IAYEKAGIFK PGVPAVLGAQ DERAFAVLRM EAARKGAPVT AASRDYSARE EHGRFIFEDE
RGLIDLPLPR LLGRHQHQNA ATAIAVTRLI EPELDAKVYE RGLIEVDWPA RLQNLVKGRI
AALGPKGAEI WIDGGHNEDG GRAIAEAMAD FADKSERPLV IICGTLTTKD TGAFLRSFKG
LAQQVIAVPV EAEHYGKPAR DVAAAASEVG LPAVASESIE AALRFLAVQH WRQPPRILIA
GSLYLAGEAL LLNGTPPA