Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3535 |
Symbol | |
ID | 6131767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3944720 |
End bp | 3946237 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641643704 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_001770352 |
Protein GI | 170741697 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.330335 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0313062 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTCG ATTACGAGAA CGACGGCGCA CTCCACGAGA AGATCATCGA GGACGTGCTG GCCGCCTACC CCGACAAGTT CGCCAAGCGC CGCCGCAAGC ACCTCTCGGT CGCCACCCCG GCCGGCGCCG AGGAGACCGC GCCCGCGGAG GAGACGCTCC TCACCGAGTG CGACGTGAAG TCCAACATCA AGTCCATTCC GGGCGTCATG ACCATCCGGG GCTGCGCCTA TGCCGGCTCC AAGGGCGTGG TCTGGGGACC GGTCAAGGAC ATGGTCCACA TCTCGCACGG GCCGGTCGGC TGCGGCCAGT ATTCCTGGTC GCAGCGCCGC AACTACTACA TCGGCACGAC CGGGATCGAC ACCTTCGTGA CGATGCAGTT CACCTCCGAC TTCCAGGAGA AGGATATCGT CTTCGGCGGC GACAAGAAGC TCGACAAGGT CATCTCCGAG ATCGAGAGCC TGTTCCCGCT CAATCACGGC GTCACGATCC AGTCGGAATG CCCGATCGGC CTGATCGGCG ACGACATTGA GGCGGTGGCC AGGAAGAAGA AGAAGGAGAT CGGCAAGACC GTGGTGCCGG TCCGCTGCGA GGGCTTCCGC GGCGTGTCGC AGTCCCTTGG CCACCACATC GCCAACGACG CGATTCGCGA CTGGGTGTTC GAGAAGCAGG ACGGCGAGAT CGCCTTCGAG GGCACGCCCT ACGACGTCAA CGTGATCGGC GACTACAACA TCGGCGGCGA CGCCTGGGCC TCCCGCATCC TGCTGGAGGA GATGGGGCTG CGCATCGTCG GCAACTGGTC GGGCGACGCC ACCCTGGCCG AGATCGAGCG GGCCCCGAAG GCCAAGCTCA ACCTCATCCA CTGCTACCGG TCGATGAACT ACATCTGCCG CTACATGGAG GAGAAGTACG CGATCCCGTG GATGGAGTAC AACTTCTTCG GCCCGTCCCA GATCGCGGCC TCGCTGCGCA AGATCGCCAA GCACTTCGGC CCCGAGATCG AGGAGAAGGC GGAGGCGGTG ATCGCCAAGT ACCAGCCGCT CGTCGATGCC GTGATCGCCA AGTACGGCCC GCGCCTGAAG GGCAAGAGCG TCATGCTCTA CGTCGGCGGC CTGCGCCCGC GCCACGTGAT CACCGCCTAC GAGGATCTCG GCATGGAGAT CGCGGGCACC GGCTACGAGT TCGCCCACAA CGACGACTAC CAGCGCACCG GCCACTACGT GAAGAACGGC ACGCTGATCT ACGACGACGT CACCGGCTAC GAGCTGGAGA AGTTCATCGA GAAGATCCGG CCCGACCTCG TCGGCTCCGG CATCAAGGAG AAGTACCCGG TCCAGAAGAT GGGCATCCCG TTCCGGCAGA TGCACTCGTG GGACTATTCG GGCCCGTACC ACGGCTACGA CGGGTTCGCG ATCTTCGCCC GCGACATGGA CCTGGCGATC AACAACCCGG TCTGGGGCCT GTTCGACGCG CCCTGGAAGG CGAAGCCGGC CCCGGCGTTC CTGGACGCCG CCGAGTAG
|
Protein sequence | MSLDYENDGA LHEKIIEDVL AAYPDKFAKR RRKHLSVATP AGAEETAPAE ETLLTECDVK SNIKSIPGVM TIRGCAYAGS KGVVWGPVKD MVHISHGPVG CGQYSWSQRR NYYIGTTGID TFVTMQFTSD FQEKDIVFGG DKKLDKVISE IESLFPLNHG VTIQSECPIG LIGDDIEAVA RKKKKEIGKT VVPVRCEGFR GVSQSLGHHI ANDAIRDWVF EKQDGEIAFE GTPYDVNVIG DYNIGGDAWA SRILLEEMGL RIVGNWSGDA TLAEIERAPK AKLNLIHCYR SMNYICRYME EKYAIPWMEY NFFGPSQIAA SLRKIAKHFG PEIEEKAEAV IAKYQPLVDA VIAKYGPRLK GKSVMLYVGG LRPRHVITAY EDLGMEIAGT GYEFAHNDDY QRTGHYVKNG TLIYDDVTGY ELEKFIEKIR PDLVGSGIKE KYPVQKMGIP FRQMHSWDYS GPYHGYDGFA IFARDMDLAI NNPVWGLFDA PWKAKPAPAF LDAAE
|
| |