Gene Mpal_0556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0556 
Symbol 
ID7271972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp548610 
End bp549998 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content57% 
IMG OID643569203 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_002465652 
Protein GI219851220 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.263807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.366302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAGA CAGGCGTACT GGACGTTGGG TCTGGTGCAT GTATCGATGA ACGGCAGCGA 
TCGATCATAA CCACCGGGAG AAATAAGAGC AGTATCCACT GCTCTGACGA TAGCATGGCC
GGCTCAGTCA GCCAGCGGGC CTGCGTCTTC TGTGGAGCAC GGGTGGTACT GAACCCGGTA
ACGGACGCGA TTCACCTGGT CCACGGTCCG ATCGGCTGTG CCACGTACAC TATGGACATC
CGGGGGAGCC TCTCCAGCGG GTCAGAACTG TATCGAAACA GTTTCTCCAC CGATCTCAAA
GAGAAGGATG TCGTCTTTGG CGGAGAGAAG AAACTCGCCG CCTGCATCGA TGAGGTCGTT
ACCAAGTATC ATCCTCCGGC AGTGTTCGTG TACTCGACCT GCGTGGTTGG GATGATCGGC
GACGATATCA TCGCTGTCTG CAAGGCTGCC TCAGCGCGGC ACCAGATCGA TGTGATCCCG
GTGGAGTCCA CCGGGTTCAT GTCTGGGAAC AAGGTGATCG GCTACCGGGC AGCGGCTGAA
GCACTGCTGA AACTGATCAC CCCCAAAGAA GGGGAGACCG TCCAGATGAC CAGGAAGCTG
AACTTCCTTG GAGAATACAA TCTGGGCGGG GAGAAATGGC TCGTCGAACG CTATCTCAGA
GAGATCGGGA TCGAGATCAA CGTGGCCTTC ACCGGGGACT CTACAGTGGC CGCCCTGAAG
CAGGCACCAG GGGCATGCCT GAATATTGTG CAGTGCACCG GGTCCATGCA CTGGGTGGCT
CAGAACCTGG AACGGACATT CGGTACCCCT TATATCGATG TGAACTTCTT CGGCGCAGAG
AATACGGCAG AGAGCCTGCG AAAGATCGCC GAATTCTATG AGGACGAGGA GATCATGCGC
AGGACCGAGG TCCTGATCGA TCGGGAGATG AAGAATATCC AGCCAGCGAT CCAGAAGTAT
CGGGAGAAAC TGACCGGAAA ACGTGCGGCC ATCTATGTGG GCGGCGCCTT CAAAGCGGTG
GCCATCATCC GCCAGTTGCA GGAACTGGGG ATGGAGGTGG TCTTCACCGG GACGCAGACC
GGCAAACAGG AGGAGTACGA CCGGATCCGT GATATGGTAG ACGAAGGGAC GGTCATCATC
GACGACGCCA ATCCGGCAGA GCTTGAGAAG TTCCTGCTGG AGAAGGAGGT CGACATGATG
GCTGGCGGTG TGAAGGAACG GGTACTCGCC TTTAAACTGG GGATCGGCTT CGTCGATCAC
AACCATGACC GGAAGGAATG CCTGGCCGGC TTCGAAGGAG CCGTCCGGTT CGCGAGAGAG
GTATACATCA CCACCTGCTC CCCAGTCTGG AAGCACCTGA AACAGACTCC CCCTTGTAAG
GAGGACTGA
 
Protein sequence
MEETGVLDVG SGACIDERQR SIITTGRNKS SIHCSDDSMA GSVSQRACVF CGARVVLNPV 
TDAIHLVHGP IGCATYTMDI RGSLSSGSEL YRNSFSTDLK EKDVVFGGEK KLAACIDEVV
TKYHPPAVFV YSTCVVGMIG DDIIAVCKAA SARHQIDVIP VESTGFMSGN KVIGYRAAAE
ALLKLITPKE GETVQMTRKL NFLGEYNLGG EKWLVERYLR EIGIEINVAF TGDSTVAALK
QAPGACLNIV QCTGSMHWVA QNLERTFGTP YIDVNFFGAE NTAESLRKIA EFYEDEEIMR
RTEVLIDREM KNIQPAIQKY REKLTGKRAA IYVGGAFKAV AIIRQLQELG MEVVFTGTQT
GKQEEYDRIR DMVDEGTVII DDANPAELEK FLLEKEVDMM AGGVKERVLA FKLGIGFVDH
NHDRKECLAG FEGAVRFARE VYITTCSPVW KHLKQTPPCK ED