Gene Mpal_0558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0558 
Symbol 
ID7271974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp551392 
End bp553017 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content55% 
IMG OID643569205 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002465654 
Protein GI219851222 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000289733 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.238141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATAA CTGAAGAACA ACTGGAGGAG ATGCTCAGTC GTTACCCGGA CAAGGTCAAA 
AAGAACCGGA AGAAGCATCT CATCCTGAAG AACTCAGCCG AGGCCTGTCA GCAGATCGAG
GCGAACACCC GGACGATCCC GGGCATCATC TCCCAGCGTG GTTGTGCCTA TGCCGGCTGC
AAAGGTGTGG TCGTCGGACC GATCAAGGAC ATGGTACATA TCGTCCATGG TCCAGTCGGC
TGCTCATATT ACGCCTGGGG AACCCGGCGA AACAAAGCAC GGGCTGATGA CACGACTCCT
CCAGAGAAGG TCTTCACTCC CCTCTGCTTC ACCACCGATA TGCAGGAGAG CGACATCGTC
TTCGGCGGGG ACAAGAAACT GGCCAAGATG ATCGACCAGG TTGTCGAGAT CTTCCATCCA
CGGGCGGTCT CGATCTGCGC CACCTGTCCA GTCGGTCTGA TCGGTGATGA TATCAATGCA
GTGGCGAAGG CCGCTGAAGA GCGGCACGGG ATCCAGGTCC TCTCGTTCAA CTGCGAAGGG
TATAAAGGGG TCAGCCAGTC GGCAGGACAC CATATAGCGA ACAACAATCT GATGGAGCAT
GTGATCGGAA AGGGAACCGC CGAAAAGAAC CCGGACGATT ATGTGATCAA CATCCTCGGG
GAGTACAACA TCGGAGGGGA TGGATGGGAA CTGGAACGGA TCCTAAAGGA TATCGGCTAC
ACCCTGAACT GCATCATGAC AGGGGACGCC AGTTATGAGA AGATCCGGAA TCTGCACATT
GCCGACCTGA ATCTGGTTCA GTGCCACCGT TCGATCAACT ACATCGCCGA GATGATGGAG
ACCAAGTATG GTATCCCATG GCTGAAGGTC AACTTCATCG GGGTCACCGC CACCATGGCC
TCGCTCCGCG AGATCGCCCA GTGCTTCAAT GATGAAAAAC TGATCGAACG AACCGAGATG
GTGATCGCCA GGGAACTGGC TCGGGTGACC CCCGCATTGG AACAGTACCG GAAGATCTGC
CAGGGAAAGA CTGCATTCAT CTTTGTCGGA GGGTCCAGGA GTCACCATTA CCAGGCCCTT
CTCCGAGACC TGGGTATGGA CGTGGTGGTC GCAGGTTACG AGTTTGCCCA CCGCGATGAC
TATGAGGGAC GGCAAGTGAT CCCAACCATC AAGAGCGACG CCGACTCCAA GAACATTCCG
GAACTCCATC CAACCCCAGA CGAGGAGTTA TACCAGGAGG CACATGTCCA CCTGAAGATG
TCAAAGGAGA AGTACGACGA ACTATCCAGC AGGATTGCCT TCAACAGTTA CCAGGGGATG
ATCCCTGAGA TGAAGGACGG AGAGGTGATC CTGGATGACG CCAACCATCA CGAGGTCGAA
GAACTGATCA GGATGATAAA ACCGGATCTC TTCTTCTCAG GAGTCAGGGA CAAGTATATC
GTTCACAAGA TGGGTGTCCC GGCGAAACAG ATGCACTCGT ATGACTACAG CGGCCCTTAT
GCCGGCTTCA ACGGTTCCCT GATCTTCGCC GAGGACGTGG CGAACGCGCT GGTGACCCCG
GCCTGGAAAC TGGTAACGGC ACCATGGGAA GACCAAAACA GATCAAGGAG TGATAGCAAT
GCTTGA
 
Protein sequence
MSITEEQLEE MLSRYPDKVK KNRKKHLILK NSAEACQQIE ANTRTIPGII SQRGCAYAGC 
KGVVVGPIKD MVHIVHGPVG CSYYAWGTRR NKARADDTTP PEKVFTPLCF TTDMQESDIV
FGGDKKLAKM IDQVVEIFHP RAVSICATCP VGLIGDDINA VAKAAEERHG IQVLSFNCEG
YKGVSQSAGH HIANNNLMEH VIGKGTAEKN PDDYVINILG EYNIGGDGWE LERILKDIGY
TLNCIMTGDA SYEKIRNLHI ADLNLVQCHR SINYIAEMME TKYGIPWLKV NFIGVTATMA
SLREIAQCFN DEKLIERTEM VIARELARVT PALEQYRKIC QGKTAFIFVG GSRSHHYQAL
LRDLGMDVVV AGYEFAHRDD YEGRQVIPTI KSDADSKNIP ELHPTPDEEL YQEAHVHLKM
SKEKYDELSS RIAFNSYQGM IPEMKDGEVI LDDANHHEVE ELIRMIKPDL FFSGVRDKYI
VHKMGVPAKQ MHSYDYSGPY AGFNGSLIFA EDVANALVTP AWKLVTAPWE DQNRSRSDSN
A