Gene Mmc1_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_1201 
Symbol 
ID4481194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp1496361 
End bp1497800 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content53% 
IMG OID639721944 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_865118 
Protein GI117924501 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.75707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00777378 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGTAC TGACTCGCGA AGAGGCACAA AGCCTCATCG AAGAGGTGCT GGAAGTCTAC 
CCTGAGGAGT CTAAAAAAGA TCGCCTCAAG CACCTGACAG TAAATGATCC CAGCATCACC
CAGTCTAAAA AGTGCATCAC CTCCAACCGT AAATCCTTGC CTGGTGTCAT GACCATCCGT
GGTTGCGCCT ATGCGGGTTC CAAGGGTGTG GTATGGGGTC CCATCAAAGA TATGATCCAC
ATCTCCCACG GTCCTGTGGG TTGTGGTCAA TACTCCCGCG CGGGTCGTCG TAACTACTAT
GTGGGTTACA CCGGCGTCAA TGCCTTCGGC ACCATGAACT TCACCTCGGA CTTCCAAGAG
CGTGACGTCG TGTTCGGTGG CGACAAAAAG CTGGAGAAGA TCGTTCACGA GTGCGAAGCG
CTGTTCCCCC TGATGAAGGG TATGTCCGTC CAGTCCGAGT GCCCCATTGG TCTGATCGGT
GACGACATTG AAGCGGTGGC CCGCAAGACC TCGGCGGCCA TCAACAAGCC TGTTATCCCT
GTACGTTGCG AAGGTTTCCG TGGGGTTTCT CAGTCCTTGG GTCACCACAT TGCCAACGAC
GCAATCCGTG ACTGGGTGCT GGAGAACCGT AAAGACAAAA TGCGTGAAAC CGGGCCTTAC
GATGTGGCCG TCATCGGCGA CTATAACATC GGTGGTGACG CTTGGGCCTC GCGTATTTTG
TTGGAAGAGA TGGGTCTGAA CGTGGTTGCT CAGTGGTCTG GCGACGGCAC CCTGGCGGAG
ATGGAGAACA CCCCTGCCGT TAAGCTGAAC CTGATCCACT GCTACCGTTC CATGAACTAC
ATCTCCCGTC ACATGGAAGC CAAGTATGGT ATTCCCTGGA TGGAGTATAA TTTCTTTGGT
CCCACCAAGA TTGCTGAGTC CCTACGCAAG ATTGCTGAGC AGTTCGACGA CAAGATCAAA
GAGGGCGCTG AGAAGGTTAT TGCCAAATAT ACCCCCATCA TGGAGGGCAT CATTGGTAAG
TACCGTCCCC GTCTGGAAGG CAAAAAAGTG ATGTTGTATG TGGGCGGTCT GCGTCCCCGT
CACGTGATCG GTGCTTACGA AGATCTGGGC ATGGAAGTGG TTGGTACCGG CTACGAATTT
GGCCACAACG ACGACTATGA CCGCACCATC AAAGAGATGG GTGACTCCAC CCTGATCTAT
GATGACGTCA CCGGTTACGA GTTTGAAAAG TTTGTGGAGA AGGTACAGCC CGATCTGGTT
GGTTCTGGCA TCAAAGAAAA ATACATCTTC CAGAAGATGG GTATTCCCTT CCGTCAGATG
CACTCCTGGG ACTACTCAGG CCCCTACCAT GGGTATGACG GTTTTGCCAT TTTCGCCCGC
GATATGGACA TGACCATCAA CAACCCTTGC TGGGACTCCT TTAAGGCGCC CTGGAAGTAA
 
Protein sequence
MSVLTREEAQ SLIEEVLEVY PEESKKDRLK HLTVNDPSIT QSKKCITSNR KSLPGVMTIR 
GCAYAGSKGV VWGPIKDMIH ISHGPVGCGQ YSRAGRRNYY VGYTGVNAFG TMNFTSDFQE
RDVVFGGDKK LEKIVHECEA LFPLMKGMSV QSECPIGLIG DDIEAVARKT SAAINKPVIP
VRCEGFRGVS QSLGHHIAND AIRDWVLENR KDKMRETGPY DVAVIGDYNI GGDAWASRIL
LEEMGLNVVA QWSGDGTLAE MENTPAVKLN LIHCYRSMNY ISRHMEAKYG IPWMEYNFFG
PTKIAESLRK IAEQFDDKIK EGAEKVIAKY TPIMEGIIGK YRPRLEGKKV MLYVGGLRPR
HVIGAYEDLG MEVVGTGYEF GHNDDYDRTI KEMGDSTLIY DDVTGYEFEK FVEKVQPDLV
GSGIKEKYIF QKMGIPFRQM HSWDYSGPYH GYDGFAIFAR DMDMTINNPC WDSFKAPWK