Gene Msil_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2042 
Symbol 
ID7094240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2215686 
End bp2216966 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID643465366 
Productlight-independent protochlorophyllide reductase subunit N 
Protein accessionYP_002362344 
Protein GI217978197 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01279] light-independent protochlorophyllide reductase, N subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.147701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC TGGCGCAGGC CTTTCCGGCC GGCTGCGGAA CGGCCCCCGT GCTGCGCGAG 
CGCGGCCAGC GCGAAGTATT TTGCGGCCTA ACCGGCATCG TCTGGCTGCA CCGCAAGATC
AGCGACGCCT TTTTCCTGGT CGTCGGCTCG CGCACCTGCG CGCATCTCAT CCAGTCCGCC
GCCGGCGTGA TGATTTTCGC CGAGCCGCGT TTTGCGACCG CGATCATCGA CGAGCGCGAT
CTCGCCGGCC TCGCCGACAT GCATGAAGAG CTCGACCGCG TCGTTGCGGA GCTGATGCGG
CGGCGTCCCG ACATCAAGCT GCTGTTCCTC GTCGGCTCAT GCCCGTCGGA AGTGATCAAG
CTCGACCTTG CCCGCGCGGC GCAGACGCTC AGCCGGAAAT TCGCCCCTGG CCTCAGGGTG
CTCAACTATT CTGGCAGCGG CATCGAGACG ACCTTTACGC AGGGTGAGGA CGCCTGCCTT
GCGGCGCTGG TCCCCGAGCT GCCGCAGGCC AGCGCCGATG CGCCGCCGTC TCTTCTCATA
GCCGGCGCGC TCGCTGATAT TGTCGAAGAC CAGCTGCGCC GCATTTTTGG CGAGCTTGGC
GTCGGCGAAG TTTCTTTCCT GCCGCCGCGC GGCAGCGGCG AACTTCCTGC GGTCGGCCCG
AAGACCAGGC TTCTGCTGGC GCAGCCCTTC CTCGCGGCCA CAGCCAAGGC GCTCGAAGAG
CGCGGCGCGC GGCGCCTGCC CGCGCCTTTT CCGCTCGGCG CGGAGGGAAC GGCGGCCTGG
ATCGCAGAGG CGGCGCAGGC CTTCGGCGTC GATCCCGCGC GCGTCGCGGC GGTAACGGCG
CCGCGCCGCA AACGCGCGCA AGAGGCGATG GAGCCATTTC GCCGCGCTCT TGCTGGCAAG
AGCGTCTTTT TCTTCCCGGA TTCCCAGCTT GAGCCGCCGC TTGCTCGCTT CCTCTCGCGC
GAATGCGGCA TGCGCCTCAT CGAGGTCGGA ACGCCCTTCC TGCATCGGCA GCACCTCCAG
CCCGAACTGG ATCTGCTGCC GGAGGGAACG CTAATCAGCG AAGGCCAGGA CGTCGACCGT
CAGCTTGACC GCTGCCGGGC GGAGAAACCC GATCTCGTCG TCTGCGGCCT TGGCCTCGCC
AATCCACTGG AAGCCGAGGG CATGACCACC AAATGGTCGA TCGAGCTTCT CTTCTCGCCA
ATCCAGGGCT TCGAACAGGC GGCCGATCTC GCCGCGTTGT TCGCCCGCCC GATCGACCGC
AGACTTCGGC TGAGGATCTA G
 
Protein sequence
MNALAQAFPA GCGTAPVLRE RGQREVFCGL TGIVWLHRKI SDAFFLVVGS RTCAHLIQSA 
AGVMIFAEPR FATAIIDERD LAGLADMHEE LDRVVAELMR RRPDIKLLFL VGSCPSEVIK
LDLARAAQTL SRKFAPGLRV LNYSGSGIET TFTQGEDACL AALVPELPQA SADAPPSLLI
AGALADIVED QLRRIFGELG VGEVSFLPPR GSGELPAVGP KTRLLLAQPF LAATAKALEE
RGARRLPAPF PLGAEGTAAW IAEAAQAFGV DPARVAAVTA PRRKRAQEAM EPFRRALAGK
SVFFFPDSQL EPPLARFLSR ECGMRLIEVG TPFLHRQHLQ PELDLLPEGT LISEGQDVDR
QLDRCRAEKP DLVVCGLGLA NPLEAEGMTT KWSIELLFSP IQGFEQAADL AALFARPIDR
RLRLRI