Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3537 |
Symbol | |
ID | 6131769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3947948 |
End bp | 3949459 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641643706 |
Product | nitrogenase MoFe cofactor biosynthesis protein NifE |
Protein accession | YP_001770354 |
Protein GI | 170741699 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.597441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0139782 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAAGG ACAAAATCGC GGACGTGTTC AACGAGCCCG GTTGCGAGAA GAACCAGGCC AAGGGCGCCA AGGAGCGCAA GAAGGGCTGC ACGAAGCCCC TCACCCCCGG GGCGGCGGCG GGCGGCTGCG CCTTCGACGG GGCCAAGATC GTGCTGCAGC CGATCACCGA CGTCGCCCAC CTGATCCACG CGCCCCTCGC CTGCGAGGGC AACAGCTGGG ACAATCGCGG GGCGGGCTCC TCCGGCTCCG ACCTCTGGCG GCGCAGCTTC ACCACCGACC TCACCGAACT CGACGTGGTG ATGGGCCAGG GCGAGAGGAA GCTCTACCGG GCCGTGCGCG AGATCGCCCG CACCTACGCG CCCCCGGCGA TTTTCGTCTA CTCCACCTGC GTCACCGCCC TGATCGGCGA CGACATCGAG GCGGTCTGCG CCAAGGCCTC CGAGACCTGC GGGCTGCCGG TGATCCCCGT GAACGCGCCG GGCTTCGTCG GCTCGAAGAA TCTCGGCAAC AAGCTCGCCG GCGAGGCGCT GCTCGACCAC GTCATCGGCA CGGTCGAGCC CGACGACGTC GGCCCCACCG ACATCAACAT CCTGGGCGAG TTCAACCTCG CGGGCGAGTT CTGGGCGGTG CGCCCGCTCT TCGAGCGGCT CGGCATCCGC ATCCGCGCCT GCATCCCGGG CGACGCGCGC TACCGCGAGG TCGCGGCCGC CCACACGGCG CGGGCGACGA TGATGGAATG CTCGACCGCC CTCATCAATC TCGCGCGCAA GATGGAGGAG CGCTGGGGCA TCCCCTTCTT CGAGGGCTCC TTCTACGGCA TCTCCGACAC CTCGGACGCC CTGCGCCAGA CCGCCCGGCT GCTGGTCGGG CGGGGCGCGC CCTCCGACCT CCTCGACCGC ACCGAGGCCC TGATCGCCGA GGAGGAGGCC CGGGCCTGGG CGCGGCTGGA GGCGTTCCGG CCGCGGCTCC AGGGCAAGCG GGTCCTGCTC AACACCGGCG GGGTCAAGTC GTGGTCGGTG GTGGCGGCGC TGATGGAGAT CGGCGTCGAG ATCGTCGGCA CCTCGGTCAA GAAGTCGACC GCCGAGGACA AGGAGCGGAT CAAGCAGCTC CTGAAGGACG AGAACCACAT GTTCGAGAGC ATGGCCCCGC GCGACCTCTA CGCGAAGCTG GCCTCGCACG AGGCCGACAT CATGCTGTCG GGCGGGCGGA CGCAGTTCAT CGCGCTCAAG GCGAAGATGC CCTGGCTCGA CATCAACCAG GAGCGGCATG TCGCGTATGC GGGCTACGAC GGCATGGTGG AGCTCGTCCG GCGCATCGAC CTCGCTCTCT CGAACCCGAT CTGGGCCGAC CTGCGCGATC CCGCGCCCTG GGACGCCGAG GGGCGGCTGA CCGCGGCCGG GGCGGCCCCG CGCGCGGAGC CCGGCCGGGA TCCCGCCGCG GATCCCACCT TCCTGGCCCA TCACCGCAGG AAGTTCGCCG GTGCGGGCGC CGACGACATG GCCGAGTGCT GA
|
Protein sequence | MLKDKIADVF NEPGCEKNQA KGAKERKKGC TKPLTPGAAA GGCAFDGAKI VLQPITDVAH LIHAPLACEG NSWDNRGAGS SGSDLWRRSF TTDLTELDVV MGQGERKLYR AVREIARTYA PPAIFVYSTC VTALIGDDIE AVCAKASETC GLPVIPVNAP GFVGSKNLGN KLAGEALLDH VIGTVEPDDV GPTDINILGE FNLAGEFWAV RPLFERLGIR IRACIPGDAR YREVAAAHTA RATMMECSTA LINLARKMEE RWGIPFFEGS FYGISDTSDA LRQTARLLVG RGAPSDLLDR TEALIAEEEA RAWARLEAFR PRLQGKRVLL NTGGVKSWSV VAALMEIGVE IVGTSVKKST AEDKERIKQL LKDENHMFES MAPRDLYAKL ASHEADIMLS GGRTQFIALK AKMPWLDINQ ERHVAYAGYD GMVELVRRID LALSNPIWAD LRDPAPWDAE GRLTAAGAAP RAEPGRDPAA DPTFLAHHRR KFAGAGADDM AEC
|
| |