Gene M446_3537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3537 
Symbol 
ID6131769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3947948 
End bp3949459 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content70% 
IMG OID641643706 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_001770354 
Protein GI170741699 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.597441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0139782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGG ACAAAATCGC GGACGTGTTC AACGAGCCCG GTTGCGAGAA GAACCAGGCC 
AAGGGCGCCA AGGAGCGCAA GAAGGGCTGC ACGAAGCCCC TCACCCCCGG GGCGGCGGCG
GGCGGCTGCG CCTTCGACGG GGCCAAGATC GTGCTGCAGC CGATCACCGA CGTCGCCCAC
CTGATCCACG CGCCCCTCGC CTGCGAGGGC AACAGCTGGG ACAATCGCGG GGCGGGCTCC
TCCGGCTCCG ACCTCTGGCG GCGCAGCTTC ACCACCGACC TCACCGAACT CGACGTGGTG
ATGGGCCAGG GCGAGAGGAA GCTCTACCGG GCCGTGCGCG AGATCGCCCG CACCTACGCG
CCCCCGGCGA TTTTCGTCTA CTCCACCTGC GTCACCGCCC TGATCGGCGA CGACATCGAG
GCGGTCTGCG CCAAGGCCTC CGAGACCTGC GGGCTGCCGG TGATCCCCGT GAACGCGCCG
GGCTTCGTCG GCTCGAAGAA TCTCGGCAAC AAGCTCGCCG GCGAGGCGCT GCTCGACCAC
GTCATCGGCA CGGTCGAGCC CGACGACGTC GGCCCCACCG ACATCAACAT CCTGGGCGAG
TTCAACCTCG CGGGCGAGTT CTGGGCGGTG CGCCCGCTCT TCGAGCGGCT CGGCATCCGC
ATCCGCGCCT GCATCCCGGG CGACGCGCGC TACCGCGAGG TCGCGGCCGC CCACACGGCG
CGGGCGACGA TGATGGAATG CTCGACCGCC CTCATCAATC TCGCGCGCAA GATGGAGGAG
CGCTGGGGCA TCCCCTTCTT CGAGGGCTCC TTCTACGGCA TCTCCGACAC CTCGGACGCC
CTGCGCCAGA CCGCCCGGCT GCTGGTCGGG CGGGGCGCGC CCTCCGACCT CCTCGACCGC
ACCGAGGCCC TGATCGCCGA GGAGGAGGCC CGGGCCTGGG CGCGGCTGGA GGCGTTCCGG
CCGCGGCTCC AGGGCAAGCG GGTCCTGCTC AACACCGGCG GGGTCAAGTC GTGGTCGGTG
GTGGCGGCGC TGATGGAGAT CGGCGTCGAG ATCGTCGGCA CCTCGGTCAA GAAGTCGACC
GCCGAGGACA AGGAGCGGAT CAAGCAGCTC CTGAAGGACG AGAACCACAT GTTCGAGAGC
ATGGCCCCGC GCGACCTCTA CGCGAAGCTG GCCTCGCACG AGGCCGACAT CATGCTGTCG
GGCGGGCGGA CGCAGTTCAT CGCGCTCAAG GCGAAGATGC CCTGGCTCGA CATCAACCAG
GAGCGGCATG TCGCGTATGC GGGCTACGAC GGCATGGTGG AGCTCGTCCG GCGCATCGAC
CTCGCTCTCT CGAACCCGAT CTGGGCCGAC CTGCGCGATC CCGCGCCCTG GGACGCCGAG
GGGCGGCTGA CCGCGGCCGG GGCGGCCCCG CGCGCGGAGC CCGGCCGGGA TCCCGCCGCG
GATCCCACCT TCCTGGCCCA TCACCGCAGG AAGTTCGCCG GTGCGGGCGC CGACGACATG
GCCGAGTGCT GA
 
Protein sequence
MLKDKIADVF NEPGCEKNQA KGAKERKKGC TKPLTPGAAA GGCAFDGAKI VLQPITDVAH 
LIHAPLACEG NSWDNRGAGS SGSDLWRRSF TTDLTELDVV MGQGERKLYR AVREIARTYA
PPAIFVYSTC VTALIGDDIE AVCAKASETC GLPVIPVNAP GFVGSKNLGN KLAGEALLDH
VIGTVEPDDV GPTDINILGE FNLAGEFWAV RPLFERLGIR IRACIPGDAR YREVAAAHTA
RATMMECSTA LINLARKMEE RWGIPFFEGS FYGISDTSDA LRQTARLLVG RGAPSDLLDR
TEALIAEEEA RAWARLEAFR PRLQGKRVLL NTGGVKSWSV VAALMEIGVE IVGTSVKKST
AEDKERIKQL LKDENHMFES MAPRDLYAKL ASHEADIMLS GGRTQFIALK AKMPWLDINQ
ERHVAYAGYD GMVELVRRID LALSNPIWAD LRDPAPWDAE GRLTAAGAAP RAEPGRDPAA
DPTFLAHHRR KFAGAGADDM AEC