Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0551 |
Symbol | |
ID | 3831451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 572664 |
End bp | 574124 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637828492 |
Product | nitrogenase component I, alpha chain |
Protein accession | YP_429424 |
Protein GI | 83589415 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01284] nitrogenase alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.39708 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATGG TAAAAATGAA GTGCGACGAG CTCATTCCCG AACGGTATAA GCATATTTAC TACACGGAAA AAGGGCGGTC CGTCATTCCA GCCTGCAATA TCGCCACCAT TCCCGGAGAT ATGACGGAAC GCGGGTGTGC CTTCGCCGGT GCCCGGGGCG TAATAGGTGG GCCCATTGCC GATGTTATTG CTATGGTTCA TGCACCCGTA GGGTGCGCCT GGTATACCTG GGGTACCCGC CGCCACCTGT CCGACCTCTA TACCTGGGCC ACTCCCACCC GCCTCACCAA TGTGGCCTTT AACCGGCGCT ACTGCGTCTG TACCGACATG CAGGAGAAAG ACGTGGTTTT CGGTGGCATA AAAAAGCTGG AGCAGGCCTG CCTGGAGGCC ATCAGGCTCT TCCCCGAAGC GAAGGGGTTG ATTATTTTCA CCACCTGTAC CACCGGCCTC ATCGGTGACG ATGTCCAGGC GGTGGCCCGG AGCGTGGAAA AGAAGACCGG CCGGCTGGTC TTCACCGCCG AATCCCCCGG CTGCTCCGGG GTGAGCCAGT CCAAAGGGCA CCACGACTTC AACATCCAGT TTTACCGCCA GGTACGCAGT TTAAGGGAGC GGCGGCCGGA ATTAAAGATG CCCGAAACAG AGAAAACCCC GTACGATATT TGCCTCATTG GCGACTATAA CATGGACTGG GACTTAAAGG CGATACGTCC CCTGTTTGAA AAGATGGGTT TGCGTATCGT GGCCGTTTTC TCAGGGAATG AACGCATCGA AAACCTGGTC AAGATGCCGG ACGTCAAATT GAACGTGGTC CACTGCCAGC GCTCCGCCGA ATATATCGCC CATATAGAGA AGGACGGCTA TAACATCCCC TTTATACGGG TCTCTCTCTA CGGTATCGAG CAGACCTGTA AGGCCCTGCG GGAAACGGCT GCTTTCTTCG GCCTGGAGGA GCGGGCCGAA GCGGTGATTG CCGAAGAGAT GGCCCGGGTG GAAAAGAGCC TGGCCTTTTA CCGTGAGAAG CTCCAGGGTA AGCGGGTGGC CATCTATGTG GGCGGGCCGC GGGTCTGGCA CTGGATCAAA TTGATGGAGG AACTGGGCAT GCAGGTGGTG GCAGTAGCCT GCACCTTTGC CCACGAAGAC GACTACGAAA AGATCAATGC CCGGGCGCCG GAGGGGATGC TGGTCATCGA CGCCCCCAAT GAGTTTGAGC TTGAAGAGAT GCTCACGTCA ACTAAACCCG ATCTCTTTTT AACTGGCTTG AAGGAGAAAT ATCTGGGGCG CAAAATGGGT ATTCCCACGG TGAATTCCCA CTCCTACGAG AAGGGCCCCT ATGAGGGGTT TGCCGGCATG GTTAATTTCG CCCGGGATAT CTACCAGGGC ATATACGCCC CGGTATGGAA GTTCCAGTGG GGCCTCGACA GCACGCCGGG TATGACGGGG AGGGATGAGC AATGCAGTTA A
|
Protein sequence | MPMVKMKCDE LIPERYKHIY YTEKGRSVIP ACNIATIPGD MTERGCAFAG ARGVIGGPIA DVIAMVHAPV GCAWYTWGTR RHLSDLYTWA TPTRLTNVAF NRRYCVCTDM QEKDVVFGGI KKLEQACLEA IRLFPEAKGL IIFTTCTTGL IGDDVQAVAR SVEKKTGRLV FTAESPGCSG VSQSKGHHDF NIQFYRQVRS LRERRPELKM PETEKTPYDI CLIGDYNMDW DLKAIRPLFE KMGLRIVAVF SGNERIENLV KMPDVKLNVV HCQRSAEYIA HIEKDGYNIP FIRVSLYGIE QTCKALRETA AFFGLEERAE AVIAEEMARV EKSLAFYREK LQGKRVAIYV GGPRVWHWIK LMEELGMQVV AVACTFAHED DYEKINARAP EGMLVIDAPN EFELEEMLTS TKPDLFLTGL KEKYLGRKMG IPTVNSHSYE KGPYEGFAGM VNFARDIYQG IYAPVWKFQW GLDSTPGMTG RDEQCS
|
| |