Gene M446_3536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3536 
Symbol 
ID6131768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3946338 
End bp3947876 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content67% 
IMG OID641643705 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_001770353 
Protein GI170741698 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0309229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAAT CCGCCGACCG GGTGCTCGAT CACGCACCCC TGTTCCGTCA GCCCGAATAC 
CAGGAGATGT TCGCCCGCAA GCGCGAGCAG TTCGAGTGCC CGGCCTCCGG CGAGGCGGTC
GAGGCGCAGC GCGACTACGC CAGGACCTGG GAGTACCGCG AGAAGAACCT CGCCCGCGAG
GCCCTGGTGG TGAACCCGGC CAAGGCCTGC CAGCCGCTCG GCGCGGTCTT CGCGGCGGCC
GGCTTCGAGC GCACGATGAG CTTCGTGCAC GGCTCGCAGG GTTGCGTCGC CTATTACCGC
TCCCACCTGT CGCGCCACTT CAAGGAGCCG TCCTCGGCGG TCTCCTCCTC GATGACCGAG
GACGCGGCGG TGTTCGGCGG CCTCAACAAC ATGATCGACG GCCTCGCCAA CACCTACTCG
CTCTACGACC CGAAGATGAT CGCGGTCTCG ACCACCTGCA TGGCCGAGGT GATCGGCGAC
GACCTGCACG CCTTCATCCA GAACGCCAAG AACAAGGGGT CGGTGCCGCA GGATTACGAC
GTTCCCTTCG CCCATACCCC GGCCTTCGTG GGCAGCCACG TCGACGGCTA CGACAACATG
ATCAAGGGCG TGCTGGAGCA TTTCTGGAAG GGCAGGACCC GCAGCGCCGG CGAGGGACTC
AACCTGATCC CGGGCTTCGA CGGGTTCTGC GTCGGCAACA ACCGCGAGCT GAAGCGCCTG
CTCGACCTGA TCGGGGTCTC CTACACGCTG ATCCAGGACG CCTCCGACAC CTACGACACG
CCCTCGGACG GCGAGTTCCG GATGTATTCG GGCGGCACCC GGCTCGAGGA CGTCGCCGCC
GCGCTGAACG CCAGGGCGAC CCTCTCGCTG CAGAAGTACT GCACGCGCAA GACCCTCGAC
TACGCGGCGG AGCAGGGCCA GGAGACCCAC AGCTTCCACT ACCCGCTCGG GGTGCGCGGC
ACCGACGAGC TCCTGCTCAA GATCTCGGAA CTGAGCGGCC GGCCGATCCC CGAGGCGATC
ACGATGGAGC GCGGCCGGCT CATCGACGCC ATGGCGGACA GCCAGTCCTG GCTGCACGGC
AAGAAGTACG CGATCTTCGG CGACCCGGAC GTGGTCTACG GCCTCGCGCG CTTCGTCATG
GAGACCGGCG GCGAGCCGAT CCACTGCCTC GCCACCAACG GCACCAAGGC CTGGGAGGAG
GAGATGCAGG CGCTCCTCGC CTCCTCGCCC TTCGGCGCCA GCGGCCAGGT CTGGGCCGGC
AAGGACCTCT GGCACCTGCG CTCGCTGCTC TTCACCGAGC CGGTCGATTT CGTGCTGGGC
AATTCCTACG CCAAGTACCT GGAGCGCGAC ACCGGCACGC CGCTGATCCG CACCGCCTTC
CCGATCTTCG ACCGGCACCA CCACCACCGC TTCCCGGTGA TGGGCTACCA GGGCGGCCTG
CGCCTGCTGA CGACGATCCT CGACAAGATC TTCGACAAGC TCGATCAGGA CACCATCGAC
CCGGCCAAGA CCGACTACTC GTTCGACCTC ACCCGCTGA
 
Protein sequence
MPQSADRVLD HAPLFRQPEY QEMFARKREQ FECPASGEAV EAQRDYARTW EYREKNLARE 
ALVVNPAKAC QPLGAVFAAA GFERTMSFVH GSQGCVAYYR SHLSRHFKEP SSAVSSSMTE
DAAVFGGLNN MIDGLANTYS LYDPKMIAVS TTCMAEVIGD DLHAFIQNAK NKGSVPQDYD
VPFAHTPAFV GSHVDGYDNM IKGVLEHFWK GRTRSAGEGL NLIPGFDGFC VGNNRELKRL
LDLIGVSYTL IQDASDTYDT PSDGEFRMYS GGTRLEDVAA ALNARATLSL QKYCTRKTLD
YAAEQGQETH SFHYPLGVRG TDELLLKISE LSGRPIPEAI TMERGRLIDA MADSQSWLHG
KKYAIFGDPD VVYGLARFVM ETGGEPIHCL ATNGTKAWEE EMQALLASSP FGASGQVWAG
KDLWHLRSLL FTEPVDFVLG NSYAKYLERD TGTPLIRTAF PIFDRHHHHR FPVMGYQGGL
RLLTTILDKI FDKLDQDTID PAKTDYSFDL TR