Gene Moth_0551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0551 
Symbol 
ID3831451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp572664 
End bp574124 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content57% 
IMG OID637828492 
Productnitrogenase component I, alpha chain 
Protein accessionYP_429424 
Protein GI83589415 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.39708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATGG TAAAAATGAA GTGCGACGAG CTCATTCCCG AACGGTATAA GCATATTTAC 
TACACGGAAA AAGGGCGGTC CGTCATTCCA GCCTGCAATA TCGCCACCAT TCCCGGAGAT
ATGACGGAAC GCGGGTGTGC CTTCGCCGGT GCCCGGGGCG TAATAGGTGG GCCCATTGCC
GATGTTATTG CTATGGTTCA TGCACCCGTA GGGTGCGCCT GGTATACCTG GGGTACCCGC
CGCCACCTGT CCGACCTCTA TACCTGGGCC ACTCCCACCC GCCTCACCAA TGTGGCCTTT
AACCGGCGCT ACTGCGTCTG TACCGACATG CAGGAGAAAG ACGTGGTTTT CGGTGGCATA
AAAAAGCTGG AGCAGGCCTG CCTGGAGGCC ATCAGGCTCT TCCCCGAAGC GAAGGGGTTG
ATTATTTTCA CCACCTGTAC CACCGGCCTC ATCGGTGACG ATGTCCAGGC GGTGGCCCGG
AGCGTGGAAA AGAAGACCGG CCGGCTGGTC TTCACCGCCG AATCCCCCGG CTGCTCCGGG
GTGAGCCAGT CCAAAGGGCA CCACGACTTC AACATCCAGT TTTACCGCCA GGTACGCAGT
TTAAGGGAGC GGCGGCCGGA ATTAAAGATG CCCGAAACAG AGAAAACCCC GTACGATATT
TGCCTCATTG GCGACTATAA CATGGACTGG GACTTAAAGG CGATACGTCC CCTGTTTGAA
AAGATGGGTT TGCGTATCGT GGCCGTTTTC TCAGGGAATG AACGCATCGA AAACCTGGTC
AAGATGCCGG ACGTCAAATT GAACGTGGTC CACTGCCAGC GCTCCGCCGA ATATATCGCC
CATATAGAGA AGGACGGCTA TAACATCCCC TTTATACGGG TCTCTCTCTA CGGTATCGAG
CAGACCTGTA AGGCCCTGCG GGAAACGGCT GCTTTCTTCG GCCTGGAGGA GCGGGCCGAA
GCGGTGATTG CCGAAGAGAT GGCCCGGGTG GAAAAGAGCC TGGCCTTTTA CCGTGAGAAG
CTCCAGGGTA AGCGGGTGGC CATCTATGTG GGCGGGCCGC GGGTCTGGCA CTGGATCAAA
TTGATGGAGG AACTGGGCAT GCAGGTGGTG GCAGTAGCCT GCACCTTTGC CCACGAAGAC
GACTACGAAA AGATCAATGC CCGGGCGCCG GAGGGGATGC TGGTCATCGA CGCCCCCAAT
GAGTTTGAGC TTGAAGAGAT GCTCACGTCA ACTAAACCCG ATCTCTTTTT AACTGGCTTG
AAGGAGAAAT ATCTGGGGCG CAAAATGGGT ATTCCCACGG TGAATTCCCA CTCCTACGAG
AAGGGCCCCT ATGAGGGGTT TGCCGGCATG GTTAATTTCG CCCGGGATAT CTACCAGGGC
ATATACGCCC CGGTATGGAA GTTCCAGTGG GGCCTCGACA GCACGCCGGG TATGACGGGG
AGGGATGAGC AATGCAGTTA A
 
Protein sequence
MPMVKMKCDE LIPERYKHIY YTEKGRSVIP ACNIATIPGD MTERGCAFAG ARGVIGGPIA 
DVIAMVHAPV GCAWYTWGTR RHLSDLYTWA TPTRLTNVAF NRRYCVCTDM QEKDVVFGGI
KKLEQACLEA IRLFPEAKGL IIFTTCTTGL IGDDVQAVAR SVEKKTGRLV FTAESPGCSG
VSQSKGHHDF NIQFYRQVRS LRERRPELKM PETEKTPYDI CLIGDYNMDW DLKAIRPLFE
KMGLRIVAVF SGNERIENLV KMPDVKLNVV HCQRSAEYIA HIEKDGYNIP FIRVSLYGIE
QTCKALRETA AFFGLEERAE AVIAEEMARV EKSLAFYREK LQGKRVAIYV GGPRVWHWIK
LMEELGMQVV AVACTFAHED DYEKINARAP EGMLVIDAPN EFELEEMLTS TKPDLFLTGL
KEKYLGRKMG IPTVNSHSYE KGPYEGFAGM VNFARDIYQG IYAPVWKFQW GLDSTPGMTG
RDEQCS