Gene Msil_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2052 
Symbol 
ID7094250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2226184 
End bp2227173 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content63% 
IMG OID643465376 
Productchlorophyllide reductase iron protein subunit X 
Protein accessionYP_002362354 
Protein GI217978207 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1348] Nitrogenase subunit NifH (ATPase) 
TIGRFAM ID[TIGR02016] chlorophyllide reductase iron protein subunit X 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0401365 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGA AGCTTCGCGC CGAGGCCGCG CTTGAAGCGG AAGCCGCGCT TGAAGCGGAA 
GCCGCGCCCG CCGCCGCGCC GACGAAAGCG ACACAGATCA TCGCCATCTA CGGCAAGGGC
GGCATCGGCA AGAGCTTCAC CCTCGCCAAT CTCTCCTACA TGATGGCGCA GCAGGGCAAG
AAAGTGCTGC TCATTGGCTG CGATCCCAAA AGCGACACCA CCTCTCTCCT TTTCGGCGGC
AAGGCCTGCC CGACAATCCT TGAAACCTCG AGCCGCAAGA AACTTGCGGG CGCGCAGGTC
GAGATCGGCG ATGTCTGCTT CAAGCGCGAC GGCGTGTTCG CGATGGAGCT CGGCGGCCCG
GAAGTCGGCC GCGGCTGCGG CGGCCGCGGC ATCATTCACG GCTTCGAGCT ACTTGAAAAG
CTCGGCTTCC ACGAATGGGA TTTCGACTAT GTGCTGCTCG ATTTCCTCGG CGACGTGGTC
TGCGGCGGCT TCGGCCTGCC GATCGCGCGC GACATGTGTC AGAAAGTGAT CGTCGTCGGA
TCGAACGATC TGCAGTCATT ATATGTCGCT AATAATGTTT GTTCCGCCGT CGATTATTTC
CGCAGGCTCG GAGGCAATGT CGGCGTCGCC GGCCTCGTCA TCAACAAGGA CGACCATACC
GGAGAGGCGC AGGCTTTCGC AAAATCCGTC GGCATTCCGG TTCTGGCCTC GATCCCGGCC
GACGACGACA TCCGGCGGAA GAGCGCCAGC TACGAGATCA TTGGCCGGCC TGGCGGACAA
TGGGCGTCCG TGTTCGAAGA GCTCGCCCGC AACATCGCCG AGGCGCCGCC AGTGCGGCCG
TCGCCACTGA CGCAGGACGG GCTGCTCGAA TTGTTCTCCG GCGACGCCGT CGGCAGGGGC
GTCGTGCTCC AGTCGGCGAG CGCGACCGAC ATGATGGGCG CGGCCCGCCT TGAGAAGAAA
TCGCTCGAAA TCATCTACGA CGCCGTTTGA
 
Protein sequence
MALKLRAEAA LEAEAALEAE AAPAAAPTKA TQIIAIYGKG GIGKSFTLAN LSYMMAQQGK 
KVLLIGCDPK SDTTSLLFGG KACPTILETS SRKKLAGAQV EIGDVCFKRD GVFAMELGGP
EVGRGCGGRG IIHGFELLEK LGFHEWDFDY VLLDFLGDVV CGGFGLPIAR DMCQKVIVVG
SNDLQSLYVA NNVCSAVDYF RRLGGNVGVA GLVINKDDHT GEAQAFAKSV GIPVLASIPA
DDDIRRKSAS YEIIGRPGGQ WASVFEELAR NIAEAPPVRP SPLTQDGLLE LFSGDAVGRG
VVLQSASATD MMGAARLEKK SLEIIYDAV