Gene Msil_2402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2402 
Symbol 
ID7093954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2617160 
End bp2618068 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content65% 
IMG OID643465724 
Productformylmethanofuran--tetrahydromethanopterin formyltransferase 
Protein accessionYP_002362694 
Protein GI217978547 
COG category[C] Energy production and conversion 
COG ID[COG2037] Formylmethanofuran:tetrahydromethanopterin formyltransferase 
TIGRFAM ID[TIGR03119] formylmethanofuran--tetrahydromethanopterin N-formyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCTA TGATCGCCAA TGGCGTCCGC ATCGACGAAT CCTTCGCCGA AGCGTTCCCG 
ATGCGCGGAA CCGCGATCAT CATCACCGCG CCCAATCTGA AATGGGCGCG GCAGGCCGCA
GTGACGATGA CCGGCTTCGC CACCTCCGTC ATCGGCTGCA AGGTCGAGGC CGGAATCGAC
CGCGATGCCC CGGAGAGCGA GACGCCGGAC GGACGGCCCG GCGTCCGCGT CCTCATGTTT
TCCATGTCGA CCGACATGCT GCAGACGCAG CTTGTGACGC GCGCGGGCCA ATGCGTGCTG
ACCTCGCCGG GATCGGCCTG CTTCAACGAC CTCGACGCGC CGGACCGCAT GCCGATCGGC
GACCAGCTGC GCTATTTCGG CGACGGCTGG CAGATTTCGA AGAAATTTCT CGGTCGCCAT
TTCTGGCGCG TGCCGGTGAT GGACGGCGAA TTCTTGTGCG AAGGGACCGT CGGCCTCACC
AAAAAGGCCG TCGGCGGCGG CAATCTTCTC GTCATGGGCG CGAATTTCGC GACCACCATG
AACGCCTGCG AACACGCCAT CGAGGCCATG AATGCGGTCG ACGGCGCGAT CATGCCGTTT
CCGGGCGGCA TCGTGCGCTC GGGATCGAAG GTCGGCTCCA AATATGCCGG CGTTCCGGCC
TCGACCAATG ACGCCTATTG TCCGACCCTG CGCGGCGTCG CCAAAAGCGC GCTCGAGGAA
GACATCGGCT GCGTGCTCGA GATCGTCATC GACGGCCTCG ACGAGAAGGC GGTCGCGGAG
GCGATGCGCG CCGGCCTTGC GGCCATCGTC AAGCTCGGGC CCAAGGACGG CGCACTGCGC
GTGGGCGCCG GTAATTACGG CGGCAAGCTC GGCCCGTTCC ACTTCCATTT GAAGGATCTG
CTGCCGTGA
 
Protein sequence
MRAMIANGVR IDESFAEAFP MRGTAIIITA PNLKWARQAA VTMTGFATSV IGCKVEAGID 
RDAPESETPD GRPGVRVLMF SMSTDMLQTQ LVTRAGQCVL TSPGSACFND LDAPDRMPIG
DQLRYFGDGW QISKKFLGRH FWRVPVMDGE FLCEGTVGLT KKAVGGGNLL VMGANFATTM
NACEHAIEAM NAVDGAIMPF PGGIVRSGSK VGSKYAGVPA STNDAYCPTL RGVAKSALEE
DIGCVLEIVI DGLDEKAVAE AMRAGLAAIV KLGPKDGALR VGAGNYGGKL GPFHFHLKDL
LP