Gene M446_3538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3538 
Symbol 
ID6131604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3949470 
End bp3950852 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content73% 
IMG OID641643707 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifN 
Protein accessionYP_001770355 
Protein GI170741700 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00517693 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAATCCGG TCACCCCGCT CGCCAAGGCG GTCGCCACCA ACCCGCTGAA ATCCTCCCAG 
CCCCTCGGCG CGGCCCTGGC CTTCCTGGGC ATCGAGGGCG CCATGCCGCT CTTCCACGGC
TCGCAGGGCT GCACCTCCTT CGCGCTCGTC CTGCTGGTGC GGCACTTCAA GGAGACGGTC
CCGCTCCAGA CCACCGCGAT GGACGAGGTC GCCACGGTGC TCGGCGGCGC CGACCACCTG
GAGGAGGCGA TCCTCACCCT CAAGACCCGC ACCGGGCCGA GCCTGATCGG CATCTGCACC
ACGGCGCTGG TCGAGACGCG CGGCGAGGAT TTCTCCGGCG ACCTCGCCCT GATCCGCGGG
CGCCACGCCG AGACCCTCGG CGAGACCGAG CTGGTCCTCG CCCAGACGCC GGACTTCGCC
GGCGCCATGG AGGAGGGCTG GGCCAAGGCG GTCTGCGCCA CCATCGCGCA GGTCGTGCCG
GAGGCCGAAA GCCGCGTCGC CGCCCTCAAC CGGATCAACA TCCTGCCGGG CCAGCACCAC
ACGGTGGCGG ATGTCGAGTT CCTGCGCGAG AGCGCGCTCG CCTTCGGGCT CGAACCGGTG
ATCCTGCCCG ACATCGCCGG CTCCCTCGAC GGCACGGTGC CGGCGCGCTG GATCCCGACG
AGCTACGGCG GCACGCCGGT CGCGGCGATC CGCGGCATGG GCCGGGCCCT GCACACGATC
GCGCTCGCCG AGCACGTGCG CGGGGCGGCC GCCCTGCTCG AGGCCCGCAC CGGCGTGCCC
TTCACGGTCC TCGACACGCT GACCGGCCTC AAGGCGGCGG ACCGCTACGT CGCGCTCCTC
GCCTCCCTCT CGGGCAAGCC GGTCCCGGCC GCCCTGCGGC GGCGGCGCAG CCAGCTCGAG
GACGCGCTCC TCGACGGGCA TTTCCACATC GGCGGCCTGC GGGTGGCGAT CGCGGCCGAG
CCGGACCTGC TCTACGGGCT CGCCAGCTTC TTCGCCGGGC TCGGCGCCAC CGTCGCGCTC
GCCGTGACCA CCACGGGCGA CAGTCCGATC CTCGCCCGGG TCCCGGCCGA GACGGTGGTG
GTCGGCGATC TCGGCGATCT CGAGGCCCGG GCGTCGGGCT GCGACCTCCT CGTCACCCAC
AGCCATGGGC GCCAGGCCTC CGAGCGCCTC GGCATCCCGC TGATGCGGGT GGGCTTCCCG
ATCTTCGACC GGCTCGGCTC CCCGCACCGG CTCTCGATCG GCCACGAGGG CACCCGCGAC
CTGATCTTCG AGGTGGCGAA CCTGGTCCAG GCCAACCGCC GCGAGCCCAC CCCCCAATCC
CTCAATCCGT TCCGTGACCG GGGAGAATCC CATGACGGCC TTGCGCAGAT TGCGGCTCGT
TGA
 
Protein sequence
MNPVTPLAKA VATNPLKSSQ PLGAALAFLG IEGAMPLFHG SQGCTSFALV LLVRHFKETV 
PLQTTAMDEV ATVLGGADHL EEAILTLKTR TGPSLIGICT TALVETRGED FSGDLALIRG
RHAETLGETE LVLAQTPDFA GAMEEGWAKA VCATIAQVVP EAESRVAALN RINILPGQHH
TVADVEFLRE SALAFGLEPV ILPDIAGSLD GTVPARWIPT SYGGTPVAAI RGMGRALHTI
ALAEHVRGAA ALLEARTGVP FTVLDTLTGL KAADRYVALL ASLSGKPVPA ALRRRRSQLE
DALLDGHFHI GGLRVAIAAE PDLLYGLASF FAGLGATVAL AVTTTGDSPI LARVPAETVV
VGDLGDLEAR ASGCDLLVTH SHGRQASERL GIPLMRVGFP IFDRLGSPHR LSIGHEGTRD
LIFEVANLVQ ANRREPTPQS LNPFRDRGES HDGLAQIAAR