Gene MmarC7_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC7_0101 
Symbol 
ID5328192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C7 
KingdomArchaea 
Replicon accessionNC_009637 
Strand
Start bp116546 
End bp117976 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content38% 
IMG OID640792622 
Productnitrogenase alpha chain 
Protein accessionYP_001329322 
Protein GI150402028 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.299285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0821221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTCT GTTTATTGGA TGTAGATAAA GATATTCCTG AAAGAGAACA ACACGTTTAC 
ATCAAAGATT CAAAAGATAC AAACGGACAT TGCCAAAAAT GTAATACCAC CACAATCCCA
GGAAGTATGA CCGAAAGAGG CTGTGCTTTT GCAGGGGTAA AAGGTGTGAT TACTGGTGCA
ATAAAAGACG TACTACAAGT AGTACACTCG CCTGTTGGAT GTTCCGCATA CGGAAACGGT
ACAACAAAAA GATACCCAAC AAACTCAACA ATGCCTGATG GAAGCACATT CCCAGTTGAA
AATTTCAACC TCAAACACAT TGTCGGAACA GACTTAAGTG AATCCGATGT TGTATTTGGT
GGAATGAAAA AACTTAAAGC AACAATTAGA GAAGGTGCAA AAGAGTACCC ATTCGTAAAT
GCAATCTATG TTTACGCAAC ATGTACAACA GGTCTTATCG GAGACGACTT AGATGCAGTA
TGTAAAGAAA TGCAAGCAGA ACTTGGAAAA GATGTTGTAG CGTTCAACGC TCCAGGATTT
GCAGGACCAA CACAATCAAA AGGACACCAC GTAGGAAACT TCACGATATT CGAAAAATTA
GTTGGAACAA AAGAACCTCT TGAAACAACT GATTACGACA TCAACCTCAT TGGAGAATAT
AACATCGATG GTGACTACTG GGTTCTTGAA AAATACTTCG ATGCTATGGG CATTAGGGTT
CTCAGTAAAT TCACAGGAGA TGCATGCCAC GATGAGCTCT GCTGGATGCA CAAAGCAAAA
CTAAGCCTTG TAAGATGCCA AAGATCTGCA ACATACGTAG CAAAATTAAT TGAAGAAAAA
TACGGTGTTC CATATATTAA AGTAGATTTC TTCGGACCAG AATACTGTGC TGAAAACTTA
AGAACAGTAG GTAAATTCTT TGGAAAAGAA ATTGAAGCTG AAGCTGTTAT TAAAAAAGAA
ATGGAAAAAA TCCAGCCTGA AATTGATTTC TACAAATCAA AATTACAGGG TAAAAAAGTT
TGGATTTCAG CAGGAGGGCC AAAAAGCTGG CACTTAGCTA AACCACTTGA AGAATACTTA
GGAATGGACG TGGTAGCACT TTCAGGTCTT TTCGAACACG AAGATGGATA CGAAAAAATG
CAAGAAAGGG CAAAAGATGG TACAATTATC ATTGATGACC CGAACACCCT TGAAATGGAA
GAAGTAGTTG AAAAATACCA CCCAGATATA GTTCTTGGAG GTATCAAAGA GAAATATTTC
TTCCACAAAT TAGGAGTATC TTCAGTAATG ATACACTCTT ACGAAAACGG TCCATACATT
GGATTTGAAG GATTCGTAAA CCTTGCAAAA GATATTTACA CAGCAATCTA CAACCCAGCT
TGGAGTTTAA TGGAATTTGA AGACGAAGAG CCAGGTGATA CAAATGAGTG A
 
Protein sequence
MPFCLLDVDK DIPEREQHVY IKDSKDTNGH CQKCNTTTIP GSMTERGCAF AGVKGVITGA 
IKDVLQVVHS PVGCSAYGNG TTKRYPTNST MPDGSTFPVE NFNLKHIVGT DLSESDVVFG
GMKKLKATIR EGAKEYPFVN AIYVYATCTT GLIGDDLDAV CKEMQAELGK DVVAFNAPGF
AGPTQSKGHH VGNFTIFEKL VGTKEPLETT DYDINLIGEY NIDGDYWVLE KYFDAMGIRV
LSKFTGDACH DELCWMHKAK LSLVRCQRSA TYVAKLIEEK YGVPYIKVDF FGPEYCAENL
RTVGKFFGKE IEAEAVIKKE MEKIQPEIDF YKSKLQGKKV WISAGGPKSW HLAKPLEEYL
GMDVVALSGL FEHEDGYEKM QERAKDGTII IDDPNTLEME EVVEKYHPDI VLGGIKEKYF
FHKLGVSSVM IHSYENGPYI GFEGFVNLAK DIYTAIYNPA WSLMEFEDEE PGDTNE