Gene MmarC6_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC6_1800 
Symbol 
ID5737506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C6 
KingdomArchaea 
Replicon accessionNC_009975 
Strand
Start bp1692619 
End bp1694052 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content38% 
IMG OID641284300 
Productnitrogenase alpha chain 
Protein accessionYP_001549843 
Protein GI159906181 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.515019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTCT GTTTATTGGA TGTAGATAAA GATATCCCTG AAAGAGAACA ACACGTTTAC 
ATCAAAGATT CAAAAGATAC AAACGGACAT TGCCAAAAAT GTAATACCAC CACAATCCCG
GGAAGTATGA CTGAAAGAGG TTGTGCTTTT GCAGGAGTAA AAGGTGTGAT TACAGGTGCA
ATAAAAGATG TACTACAAGT AGTACACTCG CCTGTTGGAT GTTCCGCATA CGGAAACGGT
ACAACAAAAA GATACCCAAC AGATTCAACA ATGCCTGATG GAAGCATATT CCCAGTTGAA
AATTTCAACC TCAAACACAT TGTCGGAACA GACTTAAGTG AATCCGATGT TGTATTTGGT
GGAATGAAAA AACTTAAAGC AACAATTAGG GAGGGTGCAA AAGAATACCC ATTCGTAAAT
GCAATCTATG TTTACGCAAC ATGTACAACA GGCCTTATTG GAGACGACCT AGATGCAGTA
TGTAAAGAAA TGCAAGCAGA ACTTGGAAAG GATGTTGTAG CATTCAACGC TCCAGGATTT
GCAGGACCAA CACAATCAAA AGGACACCAC GTAGGAAACT ACACGATATT TTCAAAATTG
GTTGGAACAA AAGAACCACC ATTTGAATTG GGTGATTACG ACATCAACCT CATTGGAGAA
TATAATATCG ATGGTGACTA CTGGGTCCTT CAAAAATACT TCGACGCTAT GGGCATCAGA
GTTCTCAGTA AATTTACTGG AGATGCATGC CACTATGAGC TCTGCTGGAT GCACAAAGCA
AAACTAAGTC TTGTAAGATG CCAAAGATCT GCAACATACG TAGCAAAATT AATTGAAGAA
AAATATGGTG TTCCATACAT TAAAGTAGAC TTCTTCGGTC CAGAATACTG TGCTGAAAAC
TTAAGGACAG TAGGTAAATT CTTTGGAAAA GAAATTGAAG CCGAAGCTGT TATTAAAAAA
GAAATGGAAA AAATCCAGCC TGAACTTGAT TTCTACAAAT CAAAATTACA AGGTAAAAAA
GTTTGGATTT CAGCAGGGGG GCCAAAAAGC TGGCACTTAG CTAAACCACT TGAAGAATTC
TTAGGAATGG ACGTGGTAGC ACTTTCAGGT CTCTTCGAAC ACGAAGATGG ATACGAAAAA
ATGCAAGAAA GGGCAAAAGA TGGTACAATT ATTATTGACG ACCCGAACAC CCTTGAAATG
GAAGAAGTTG TAGAAAAATA CCAACCAGAT ATAGTTCTTG GTGGTATCAA AGAGAAATAT
TTCTTCCACA AATTAGGAGT ATCTTCAGTA ATGATACACT CTTACGAAAA CGGTCCATAC
ATCGGATTTG AAGGATTCGT AAATCTTGCA AAAGACATTT ACACAGCAAT ATACAACCCA
GCCTGGAGTT TAATGGAATT TGAAGACGAA GAGCCAGGTG ATACAAATGA GTGA
 
Protein sequence
MPFCLLDVDK DIPEREQHVY IKDSKDTNGH CQKCNTTTIP GSMTERGCAF AGVKGVITGA 
IKDVLQVVHS PVGCSAYGNG TTKRYPTDST MPDGSIFPVE NFNLKHIVGT DLSESDVVFG
GMKKLKATIR EGAKEYPFVN AIYVYATCTT GLIGDDLDAV CKEMQAELGK DVVAFNAPGF
AGPTQSKGHH VGNYTIFSKL VGTKEPPFEL GDYDINLIGE YNIDGDYWVL QKYFDAMGIR
VLSKFTGDAC HYELCWMHKA KLSLVRCQRS ATYVAKLIEE KYGVPYIKVD FFGPEYCAEN
LRTVGKFFGK EIEAEAVIKK EMEKIQPELD FYKSKLQGKK VWISAGGPKS WHLAKPLEEF
LGMDVVALSG LFEHEDGYEK MQERAKDGTI IIDDPNTLEM EEVVEKYQPD IVLGGIKEKY
FFHKLGVSSV MIHSYENGPY IGFEGFVNLA KDIYTAIYNP AWSLMEFEDE EPGDTNE