Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3631 |
Symbol | |
ID | 7092904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 3995068 |
End bp | 3996534 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643466919 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_002363878 |
Protein GI | 217979731 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.486898 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGG CGCCTGCTGA ATCCATCGAA GACATCAAAG CCCGGAACAA GGCCCTCATC AAGGAGGTCC TCGAAGTCTA TCCGGAGAAA ACAGCCAAGC GACGCGCCAA GCATCTCGGC ACGTTCGAGG AAGGCAAGCC TGACTGCGGC GTTAAGTCGA ACATCAAATC GATCCCCGGC GTCATGACGA TCCGCGGCTG CGCTTATGCC GGCTCGAAGG GCGTCGTGTG GGGTCCGATC AAGGACATGA TCCACATCAG CCATGGCCCC GTCGGCTGCG GTCAGTATTC CTGGGCCTCG CGGCGCAACT ATTACATCGG CACCACGGGC GTCGACACCT TCGGCACGAT GCAGTTCACC TCCGATTTCC AGGAGAAGGA CATTGTGTTC GGCGGCGACA AGAAGCTCGC CAAAATCATG GACGAAATTC AGGTCCTGTT CCCCCTGAAC AAGGGCATCA CCGTTCAGTC CGAGTGCCCG ATCGGCCTCA TCGGCGACGA CATCGAGGCG GTCTCCAAGG CCAAGTCGAA GGAATATGAA GGCAAGACGA TCGTTCCCGT CCGCTGCGAA GGCTTCCGCG GCGTGTCGCA GTCGCTCGGC CACCACATCG CCAACGACGC CGTCCGCGAC TGGGTGTTCG ACAAGCAGGA GGGCAAGCCC GCCCGCTTCG AGCAGACCGA CTATGACGTC GCGATCATCG GCGACTACAA CATCGGCGGC GACGCCTGGT CTTCCCGCAT CCTGCTCGAG GAAATGGGTC TTCGCGTGAT TGCGCAATGG TCCGGCGACG GCACCATCGC CGAGCTCGAG GCGACGCCGA AGGCGAAGCT GAATGTGCTG CACTGCTACC GTTCGATGAA CTACATCTCG CGCCACATGG AAGAGAAATA CGGTATACCG TGGGTCGAGT ATAATTTCTT CGGGCCGACC AAGATCGAAG AATCGCTGCG CAAGATCGCC AGCCATTTCG ACGACAAGAT CAAGGACGGC GCCGAGCGCG TCATCGCCAA ATATCGCGCG CTGACCGACG CCGTCATCGC CAAATACCGG CCGCGCCTCG AAGGCAAGAC CGTTATGCTG TTCGTCGGCG GCCTGCGTCC GCGCCACGTG ATCGGCGCTT ACGAGGATCT CGGCATGGAG ATCGTCGGGA CGGGCTACGA GTTCGGCCAC AACGACGACT ATCAGCGCAC CACCCACTAT GTGAAGGACG GCACGCTGAT CTATGACGAC GTGACCGGCT ACGAGTTCGA GAAATTCGTC GAAAAGATCC AGCCCGATCT CGTCGGCTCC GGCATCAAGG AAAAATACGT CTTCCAGAAA ATGGGCGTGC CTTTCCGCCA GATGCACTCG TGGGACTACT CGGGCCCCTA TCACGGCTAC GATGGCTTCG CGATCTTCGC GCGCGACATG GACATGGCGA TCAACTCGCC GGTCTGGAAG CTGGCCAAGA CCCCCTGGGC GGCCTGA
|
Protein sequence | MSLAPAESIE DIKARNKALI KEVLEVYPEK TAKRRAKHLG TFEEGKPDCG VKSNIKSIPG VMTIRGCAYA GSKGVVWGPI KDMIHISHGP VGCGQYSWAS RRNYYIGTTG VDTFGTMQFT SDFQEKDIVF GGDKKLAKIM DEIQVLFPLN KGITVQSECP IGLIGDDIEA VSKAKSKEYE GKTIVPVRCE GFRGVSQSLG HHIANDAVRD WVFDKQEGKP ARFEQTDYDV AIIGDYNIGG DAWSSRILLE EMGLRVIAQW SGDGTIAELE ATPKAKLNVL HCYRSMNYIS RHMEEKYGIP WVEYNFFGPT KIEESLRKIA SHFDDKIKDG AERVIAKYRA LTDAVIAKYR PRLEGKTVML FVGGLRPRHV IGAYEDLGME IVGTGYEFGH NDDYQRTTHY VKDGTLIYDD VTGYEFEKFV EKIQPDLVGS GIKEKYVFQK MGVPFRQMHS WDYSGPYHGY DGFAIFARDM DMAINSPVWK LAKTPWAA
|
| |