Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0230 |
Symbol | nifD |
ID | 3102880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 227222 |
End bp | 228682 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637169452 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_112765 |
Protein GI | 53802573 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.403364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTCA CAGTTGAGCA AGTCAAACAG AGAAACAAGG ACCTCATCAA AGAGGTGCTT GAGGTTTATC CCGACAAGAC GGCCAAGCGC CGCGCCAAGC ACCTTGGCAC CTTCGAAGAA GGCAAGCCCG ACTGCGGGGT CAAATCCAAC ATCAAGTCGA TCCCCGGTGT CATGACCATC CGCGGCTGCG CCTACGCCGG TTCCAAGGGC GTGGTCTGGG GCCCGATCAA GGACATGATC CACATCAGCC ACGGTCCGGT CGGCTGCGGC CAGTATTCCT GGGCGTCGCG CCGCAACTAC TACATCGGCA CCACCGGCAT CGACACCTTC GTCACCATGC AGTTCACCTC CGACTTCCAG GAGAAGGACA TCGTCTTCGG CGGCGACAAG AAGCTGGAGA AGATCATCGA CGAAATCGAG GAACTGTTCC CGCTGAACCA CGGCATCACC GTTCAGTCGG AATGCCCGAT CGGCCTCATC GGCGACGACA TCGAAGCGGT TTCCAAGAAG AAATCCAAGG AATACAGCGG CAAGACCATC GTTCCCGTCC GTTGCGAAGG TTTCCGCGGC GTGTCCCAGT CGCTGGGCCA TCATATCGCC AACGACGCGG TGCGCGACTG GGTGTTCGAC AAGGCCGGCG ACAAGCACCC CGAGTTCCAG TCCACGCCTT ACGACGTCGC CATCATCGGC GACTACAACA TCGGCGGCGA TGCCTGGTCT TCCCGCATCC TGCTGGAAGA AATGGGGCTG CGCGTGATCG CCCAGTGGTC CGGTGACGGC ACCCTGGCCG AGCTGGAAAA CACGCCGAAG GCCAAGCTGA ACGTGCTGCA CTGCTACCGT TCGATGAACT ACATCTCCCG CCACCTGGAA GAGAAGTACG GCGTGCCCTG GGTCGAGTAC AACTTCTTCG GTCCCACCAA GATCGCCGAG TCGCTGCGCA AGATCGCCAG CTTCTTCGAC GACAAGATCA AGGAAGGTGC GGAGCGCGTC ATCGCCAAGT ACCAGCCGCT GATGGATGCG GTGATCGCGA AGTACCGTCC GCGTCTGGAA GGCAAGAAGG TCATGCTGTT CGTCGGTGGC CTGCGTCCGC GTCACGTGAT CGGCGCCTAC GAGGACCTGG GAATGGAGAT CGTCGGTACC GGCTACGAGT TCGGCCACAA CGACGACTAC CAGCGCACCA CGCACTACGT CAAGGACGGC ACGCTGATCT ATGACGACGT GACGGGCTAT GAGTTCGAGA AGTTCGTCGA AGCCATCCAG CCCGACCTGG TCGGCTCCGG CATCAAGGAA AAGTACGTCT TCCAGAAGAT GGGCGTGCCG TTCCGCCAGA TGCACTCCTG GGACTATTCC GGTCCGTACC ACGGTTATGA CGGCTTCGCC ATTTTCGCCA GGGACATGGA CATGGCCATC AACAACCCGG TCTGGGGGAT GACCAAGGCC CCGTGGAAGA GCGCGGCTTA A
|
Protein sequence | MSLTVEQVKQ RNKDLIKEVL EVYPDKTAKR RAKHLGTFEE GKPDCGVKSN IKSIPGVMTI RGCAYAGSKG VVWGPIKDMI HISHGPVGCG QYSWASRRNY YIGTTGIDTF VTMQFTSDFQ EKDIVFGGDK KLEKIIDEIE ELFPLNHGIT VQSECPIGLI GDDIEAVSKK KSKEYSGKTI VPVRCEGFRG VSQSLGHHIA NDAVRDWVFD KAGDKHPEFQ STPYDVAIIG DYNIGGDAWS SRILLEEMGL RVIAQWSGDG TLAELENTPK AKLNVLHCYR SMNYISRHLE EKYGVPWVEY NFFGPTKIAE SLRKIASFFD DKIKEGAERV IAKYQPLMDA VIAKYRPRLE GKKVMLFVGG LRPRHVIGAY EDLGMEIVGT GYEFGHNDDY QRTTHYVKDG TLIYDDVTGY EFEKFVEAIQ PDLVGSGIKE KYVFQKMGVP FRQMHSWDYS GPYHGYDGFA IFARDMDMAI NNPVWGMTKA PWKSAA
|
| |