Gene MmarC5_0661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC5_0661 
Symbol 
ID4928301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C5 
KingdomArchaea 
Replicon accessionNC_009135 
Strand
Start bp634526 
End bp635956 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content39% 
IMG OID640166163 
Productnitrogenase alpha chain 
Protein accessionYP_001097187 
Protein GI134045701 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.949804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTCT GTTTATTGGA TGTAGATAAA GATATCCCTG AAAGAGAACA ACACGTTTAC 
ATCAAAGATT CAAAAGATAC AAACGGACAT TGCCAAAAAT GTAATACCAC CACGATCCCT
GGAAGTATGA CCGAAAGAGG CTGTGCTTTT GCAGGAGTTA AAGGTGTGAT TACTGGTGCA
ATAAAAGACG TACTACAAAT AGTACACTCG CCTGTTGGAT GTTCCGCATA CGGAAACGGT
ACAACAAAAA GATACCCAAC AAACTCAACA ATGCCTGATG GAAGTACATT CCCAGTTGAA
AACTTCAACC TCAAACACAT TGTCGGAACA GACTTAACTG AATCCGATGT TGTATTTGGT
GGAATGAACA AACTCAAAAA AGTAATTCGA GAAGGCGCAA AAGAGTACCC TTTCGTAAAT
GCAATCTACG TTTACGCAAC ATGTACAACG GGTCTTATCG GAGACGACTT AGATGCAGTA
TGTAAAGAAA TGCAAGCAGA ACTTGGAAAA GATGTTGTAG CATTCAATGC TCCAGGATTT
GCAGGACCAA CACAATCAAA AGGACACCAC GTAGGAAACT ACACGATATT TTCAAAATTG
GTTGGAACAA AAGAACCTCT AGAAACAACC GATTACGACA TCAACCTTAT TGGAGAATAT
AACATCGATG GTGACTACTG GGTCCTTGAA AAATACTTCG ATGCTATGGG CATCAGAGTT
CTCAGTAAAT TTACTGGAGA TGCATGCCAC GATGAGCTCT GCTGGATGCA CAAAGCAAAA
TTAAGCCTTG TAAGATGCCA AAGATCTGCA ACATACGTAG CAAAATTAAT TGAAGAAAAA
TACGGTGTAC CATACATTAA AGTAGATTTC TTCGGTCCAG AATACTGTGC TGAAAACTTA
AGAACAGTAG GTAAATTCTT CGGAAAAGAA ATTGAAGCTG AAGCTGTTAT TAAAAAAGAA
ATGGAAAAAA TCCAGCCTGA ACTTGATTTC TACAAATCAA AATTACAGGG TAAAAAAGTT
TGGATTTCAG CAGGAGGTCC AAAAAGCTGG CACTTATCCA AACCACTCGA AGAATACTTA
GGAATGGACG TGGTAGCACT TTCCGGTCTT TTCGAACACG AAGATGGATA CGAAAAAATG
CAGGAAAGGG CAAAAGATGG TACAATTATC ATTGACGACC TGAACACACT TGAAATGGAA
GAAGTTGTTG AAAAATACCA CCCCGAAATC GTTCTTGGAG GTATCAAAGA GAAATATTTC
TTCCACAAAT TGGGAGTATC TTCAGTAATG ATACACTCTT ACGAAAACGG CCCATACATC
GGATTCGAAG GATTCGTAAA CTTAGCAAAA GACATTTACA CAGCAATATA CAACCCAGCT
TGGAGTTTAA TGGAATTTGA AGACGAAGAG CCAGGTGATA CAAATGAGTG A
 
Protein sequence
MPFCLLDVDK DIPEREQHVY IKDSKDTNGH CQKCNTTTIP GSMTERGCAF AGVKGVITGA 
IKDVLQIVHS PVGCSAYGNG TTKRYPTNST MPDGSTFPVE NFNLKHIVGT DLTESDVVFG
GMNKLKKVIR EGAKEYPFVN AIYVYATCTT GLIGDDLDAV CKEMQAELGK DVVAFNAPGF
AGPTQSKGHH VGNYTIFSKL VGTKEPLETT DYDINLIGEY NIDGDYWVLE KYFDAMGIRV
LSKFTGDACH DELCWMHKAK LSLVRCQRSA TYVAKLIEEK YGVPYIKVDF FGPEYCAENL
RTVGKFFGKE IEAEAVIKKE MEKIQPELDF YKSKLQGKKV WISAGGPKSW HLSKPLEEYL
GMDVVALSGL FEHEDGYEKM QERAKDGTII IDDLNTLEME EVVEKYHPEI VLGGIKEKYF
FHKLGVSSVM IHSYENGPYI GFEGFVNLAK DIYTAIYNPA WSLMEFEDEE PGDTNE