Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2042 |
Symbol | |
ID | 7094240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 2215686 |
End bp | 2216966 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643465366 |
Product | light-independent protochlorophyllide reductase subunit N |
Protein accession | YP_002362344 |
Protein GI | 217978197 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01279] light-independent protochlorophyllide reductase, N subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.147701 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCC TGGCGCAGGC CTTTCCGGCC GGCTGCGGAA CGGCCCCCGT GCTGCGCGAG CGCGGCCAGC GCGAAGTATT TTGCGGCCTA ACCGGCATCG TCTGGCTGCA CCGCAAGATC AGCGACGCCT TTTTCCTGGT CGTCGGCTCG CGCACCTGCG CGCATCTCAT CCAGTCCGCC GCCGGCGTGA TGATTTTCGC CGAGCCGCGT TTTGCGACCG CGATCATCGA CGAGCGCGAT CTCGCCGGCC TCGCCGACAT GCATGAAGAG CTCGACCGCG TCGTTGCGGA GCTGATGCGG CGGCGTCCCG ACATCAAGCT GCTGTTCCTC GTCGGCTCAT GCCCGTCGGA AGTGATCAAG CTCGACCTTG CCCGCGCGGC GCAGACGCTC AGCCGGAAAT TCGCCCCTGG CCTCAGGGTG CTCAACTATT CTGGCAGCGG CATCGAGACG ACCTTTACGC AGGGTGAGGA CGCCTGCCTT GCGGCGCTGG TCCCCGAGCT GCCGCAGGCC AGCGCCGATG CGCCGCCGTC TCTTCTCATA GCCGGCGCGC TCGCTGATAT TGTCGAAGAC CAGCTGCGCC GCATTTTTGG CGAGCTTGGC GTCGGCGAAG TTTCTTTCCT GCCGCCGCGC GGCAGCGGCG AACTTCCTGC GGTCGGCCCG AAGACCAGGC TTCTGCTGGC GCAGCCCTTC CTCGCGGCCA CAGCCAAGGC GCTCGAAGAG CGCGGCGCGC GGCGCCTGCC CGCGCCTTTT CCGCTCGGCG CGGAGGGAAC GGCGGCCTGG ATCGCAGAGG CGGCGCAGGC CTTCGGCGTC GATCCCGCGC GCGTCGCGGC GGTAACGGCG CCGCGCCGCA AACGCGCGCA AGAGGCGATG GAGCCATTTC GCCGCGCTCT TGCTGGCAAG AGCGTCTTTT TCTTCCCGGA TTCCCAGCTT GAGCCGCCGC TTGCTCGCTT CCTCTCGCGC GAATGCGGCA TGCGCCTCAT CGAGGTCGGA ACGCCCTTCC TGCATCGGCA GCACCTCCAG CCCGAACTGG ATCTGCTGCC GGAGGGAACG CTAATCAGCG AAGGCCAGGA CGTCGACCGT CAGCTTGACC GCTGCCGGGC GGAGAAACCC GATCTCGTCG TCTGCGGCCT TGGCCTCGCC AATCCACTGG AAGCCGAGGG CATGACCACC AAATGGTCGA TCGAGCTTCT CTTCTCGCCA ATCCAGGGCT TCGAACAGGC GGCCGATCTC GCCGCGTTGT TCGCCCGCCC GATCGACCGC AGACTTCGGC TGAGGATCTA G
|
Protein sequence | MNALAQAFPA GCGTAPVLRE RGQREVFCGL TGIVWLHRKI SDAFFLVVGS RTCAHLIQSA AGVMIFAEPR FATAIIDERD LAGLADMHEE LDRVVAELMR RRPDIKLLFL VGSCPSEVIK LDLARAAQTL SRKFAPGLRV LNYSGSGIET TFTQGEDACL AALVPELPQA SADAPPSLLI AGALADIVED QLRRIFGELG VGEVSFLPPR GSGELPAVGP KTRLLLAQPF LAATAKALEE RGARRLPAPF PLGAEGTAAW IAEAAQAFGV DPARVAAVTA PRRKRAQEAM EPFRRALAGK SVFFFPDSQL EPPLARFLSR ECGMRLIEVG TPFLHRQHLQ PELDLLPEGT LISEGQDVDR QLDRCRAEKP DLVVCGLGLA NPLEAEGMTT KWSIELLFSP IQGFEQAADL AALFARPIDR RLRLRI
|
| |