Gene Mnod_3995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_3995 
Symbol 
ID7307447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp4070494 
End bp4072011 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content62% 
IMG OID643601655 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002499185 
Protein GI220923883 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTCG ATTACGAAAA CGACGGCGCA TTCCACGAGA AGATTATCAA GGAAGTGCTG 
GCCGCCTATC CCGACAAGTT CGCCAAGCGC CGCCGCAAAC ACCTCTCGGT TGCCACCCCC
GCCGCCGCGG ACGAATCCGC GCCGGAGGAG GAAAAGGCGC TCACCGAGTG CGACGTGAAA
TCCAACATCA AGTCGATCCC GGGTGTCATG ACCATCCGAG GCTGCGCCTA TGCCGGCTCC
AAGGGCGTGG TCTGGGGACC GGTCAAGGAT ATGATCCACA TCTCGCACGG ACCGGTCGGG
TGCGGGCAGT ATTCCTGGTC CCAGCGCCGC AACTACTACA TCGGCACGAC CGGAGTCGAC
ACCTTCGTGA CGATGCAGTT CACTTCCGAC TTCCGGGAGA AGGACATCGT CTTCGGTGGC
GACAAGAAGC TTGACAAGGT CATCGACGAG ATCGAGACGC TGTTCCCGCT CAATCACGGC
GTGACGATCC AGTCGGAATG CCCGATCGGC CTGATCGGCG ACGACACCGA GGCGGTGGCC
AAGAAGAAGA TGACGGACAT CGGCAAGACC GTGGTGCCGG TGCGCTGCGA GGGGTTTCGC
GGCGTGTCGC AGTCGCTGGG CCACCACATC GCCAACGACG CGATCCGCGA CTGGGTGTTC
GAGAAGCAGG AGCGCGAGTT CGCCTTCGCG GGCACGCCCT ACGACGTCAA CGTGGTCGGC
GACTACAATA TCGGCGGCGA CGCCTGGGCC TCGCGCATCC TGTTGGAGGA GATGGGCCTT
CGCATCGTCG GCAACTGGTC GGGGGACGCC ACCCTCGCCG AGATCGAGCG CGCGCCGAAG
GCCAAGCTCA ACCTCATCCA CTGCTACCGG TCGATGAACT ACATCTGTCG CTATATGGAG
GAGAAGTACG CAATCCCGTG GATGGAGTAC AATTACTTCG GCCCGTCGCA GATCGCGGCC
TCTTTGCGCA ATATCGCTAA GCATTTCGGT CCCGAGATCG AGAAGAAGGC AGAGGCGGTG
ATCGCCAAGT ACCAGCCCCT CGTTGATGCG GTGGTCGCAA AGTACGGTCC GCGGCTGAAG
GGCAAGAGAG TCATGCTCTA TGTCGGTGGC CTGCGCCCGC GCCACGTGAT CACAGCCTAC
GAGGATCTCG GCATGGAGAT CGCAGGCACT GGCTACGAAT TCGCCCATAA CGACGACTAC
CAGCGCACCG GCCACTACGT TAAGAAAGGC ACGCTGATCT ACGACGACGT GACAGGCTAC
GAACTCGAGA AGTTCATTGA GACGATCCGG CCCGACCTCG TCGGCTCCGG CATCAAGGAG
AAGTACCCGG TCCAGAAGAT GGGCATTCCG TTCCGGCAGA TGCACTCCTG GGACTATTCG
GGCCCGTATC ACGGCTATGA TGGGTTCGCG ATCTTCGCCC GGGACATGGA CTTGGCGATC
AACAACCCGG TCTGGGGCCT GTTCGACGCG CCCTGGAAGA AGACGCCGGC GCCCTTGCTC
CAAGAGGCCG CCGCGTAA
 
Protein sequence
MSLDYENDGA FHEKIIKEVL AAYPDKFAKR RRKHLSVATP AAADESAPEE EKALTECDVK 
SNIKSIPGVM TIRGCAYAGS KGVVWGPVKD MIHISHGPVG CGQYSWSQRR NYYIGTTGVD
TFVTMQFTSD FREKDIVFGG DKKLDKVIDE IETLFPLNHG VTIQSECPIG LIGDDTEAVA
KKKMTDIGKT VVPVRCEGFR GVSQSLGHHI ANDAIRDWVF EKQEREFAFA GTPYDVNVVG
DYNIGGDAWA SRILLEEMGL RIVGNWSGDA TLAEIERAPK AKLNLIHCYR SMNYICRYME
EKYAIPWMEY NYFGPSQIAA SLRNIAKHFG PEIEKKAEAV IAKYQPLVDA VVAKYGPRLK
GKRVMLYVGG LRPRHVITAY EDLGMEIAGT GYEFAHNDDY QRTGHYVKKG TLIYDDVTGY
ELEKFIETIR PDLVGSGIKE KYPVQKMGIP FRQMHSWDYS GPYHGYDGFA IFARDMDLAI
NNPVWGLFDA PWKKTPAPLL QEAAA