Gene M446_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3535 
Symbol 
ID6131767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3944720 
End bp3946237 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content65% 
IMG OID641643704 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001770352 
Protein GI170741697 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.330335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0313062 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCG ATTACGAGAA CGACGGCGCA CTCCACGAGA AGATCATCGA GGACGTGCTG 
GCCGCCTACC CCGACAAGTT CGCCAAGCGC CGCCGCAAGC ACCTCTCGGT CGCCACCCCG
GCCGGCGCCG AGGAGACCGC GCCCGCGGAG GAGACGCTCC TCACCGAGTG CGACGTGAAG
TCCAACATCA AGTCCATTCC GGGCGTCATG ACCATCCGGG GCTGCGCCTA TGCCGGCTCC
AAGGGCGTGG TCTGGGGACC GGTCAAGGAC ATGGTCCACA TCTCGCACGG GCCGGTCGGC
TGCGGCCAGT ATTCCTGGTC GCAGCGCCGC AACTACTACA TCGGCACGAC CGGGATCGAC
ACCTTCGTGA CGATGCAGTT CACCTCCGAC TTCCAGGAGA AGGATATCGT CTTCGGCGGC
GACAAGAAGC TCGACAAGGT CATCTCCGAG ATCGAGAGCC TGTTCCCGCT CAATCACGGC
GTCACGATCC AGTCGGAATG CCCGATCGGC CTGATCGGCG ACGACATTGA GGCGGTGGCC
AGGAAGAAGA AGAAGGAGAT CGGCAAGACC GTGGTGCCGG TCCGCTGCGA GGGCTTCCGC
GGCGTGTCGC AGTCCCTTGG CCACCACATC GCCAACGACG CGATTCGCGA CTGGGTGTTC
GAGAAGCAGG ACGGCGAGAT CGCCTTCGAG GGCACGCCCT ACGACGTCAA CGTGATCGGC
GACTACAACA TCGGCGGCGA CGCCTGGGCC TCCCGCATCC TGCTGGAGGA GATGGGGCTG
CGCATCGTCG GCAACTGGTC GGGCGACGCC ACCCTGGCCG AGATCGAGCG GGCCCCGAAG
GCCAAGCTCA ACCTCATCCA CTGCTACCGG TCGATGAACT ACATCTGCCG CTACATGGAG
GAGAAGTACG CGATCCCGTG GATGGAGTAC AACTTCTTCG GCCCGTCCCA GATCGCGGCC
TCGCTGCGCA AGATCGCCAA GCACTTCGGC CCCGAGATCG AGGAGAAGGC GGAGGCGGTG
ATCGCCAAGT ACCAGCCGCT CGTCGATGCC GTGATCGCCA AGTACGGCCC GCGCCTGAAG
GGCAAGAGCG TCATGCTCTA CGTCGGCGGC CTGCGCCCGC GCCACGTGAT CACCGCCTAC
GAGGATCTCG GCATGGAGAT CGCGGGCACC GGCTACGAGT TCGCCCACAA CGACGACTAC
CAGCGCACCG GCCACTACGT GAAGAACGGC ACGCTGATCT ACGACGACGT CACCGGCTAC
GAGCTGGAGA AGTTCATCGA GAAGATCCGG CCCGACCTCG TCGGCTCCGG CATCAAGGAG
AAGTACCCGG TCCAGAAGAT GGGCATCCCG TTCCGGCAGA TGCACTCGTG GGACTATTCG
GGCCCGTACC ACGGCTACGA CGGGTTCGCG ATCTTCGCCC GCGACATGGA CCTGGCGATC
AACAACCCGG TCTGGGGCCT GTTCGACGCG CCCTGGAAGG CGAAGCCGGC CCCGGCGTTC
CTGGACGCCG CCGAGTAG
 
Protein sequence
MSLDYENDGA LHEKIIEDVL AAYPDKFAKR RRKHLSVATP AGAEETAPAE ETLLTECDVK 
SNIKSIPGVM TIRGCAYAGS KGVVWGPVKD MVHISHGPVG CGQYSWSQRR NYYIGTTGID
TFVTMQFTSD FQEKDIVFGG DKKLDKVISE IESLFPLNHG VTIQSECPIG LIGDDIEAVA
RKKKKEIGKT VVPVRCEGFR GVSQSLGHHI ANDAIRDWVF EKQDGEIAFE GTPYDVNVIG
DYNIGGDAWA SRILLEEMGL RIVGNWSGDA TLAEIERAPK AKLNLIHCYR SMNYICRYME
EKYAIPWMEY NFFGPSQIAA SLRKIAKHFG PEIEEKAEAV IAKYQPLVDA VIAKYGPRLK
GKSVMLYVGG LRPRHVITAY EDLGMEIAGT GYEFAHNDDY QRTGHYVKNG TLIYDDVTGY
ELEKFIEKIR PDLVGSGIKE KYPVQKMGIP FRQMHSWDYS GPYHGYDGFA IFARDMDLAI
NNPVWGLFDA PWKAKPAPAF LDAAE