Gene Msil_3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3631 
Symbol 
ID7092904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3995068 
End bp3996534 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content60% 
IMG OID643466919 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002363878 
Protein GI217979731 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.486898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGG CGCCTGCTGA ATCCATCGAA GACATCAAAG CCCGGAACAA GGCCCTCATC 
AAGGAGGTCC TCGAAGTCTA TCCGGAGAAA ACAGCCAAGC GACGCGCCAA GCATCTCGGC
ACGTTCGAGG AAGGCAAGCC TGACTGCGGC GTTAAGTCGA ACATCAAATC GATCCCCGGC
GTCATGACGA TCCGCGGCTG CGCTTATGCC GGCTCGAAGG GCGTCGTGTG GGGTCCGATC
AAGGACATGA TCCACATCAG CCATGGCCCC GTCGGCTGCG GTCAGTATTC CTGGGCCTCG
CGGCGCAACT ATTACATCGG CACCACGGGC GTCGACACCT TCGGCACGAT GCAGTTCACC
TCCGATTTCC AGGAGAAGGA CATTGTGTTC GGCGGCGACA AGAAGCTCGC CAAAATCATG
GACGAAATTC AGGTCCTGTT CCCCCTGAAC AAGGGCATCA CCGTTCAGTC CGAGTGCCCG
ATCGGCCTCA TCGGCGACGA CATCGAGGCG GTCTCCAAGG CCAAGTCGAA GGAATATGAA
GGCAAGACGA TCGTTCCCGT CCGCTGCGAA GGCTTCCGCG GCGTGTCGCA GTCGCTCGGC
CACCACATCG CCAACGACGC CGTCCGCGAC TGGGTGTTCG ACAAGCAGGA GGGCAAGCCC
GCCCGCTTCG AGCAGACCGA CTATGACGTC GCGATCATCG GCGACTACAA CATCGGCGGC
GACGCCTGGT CTTCCCGCAT CCTGCTCGAG GAAATGGGTC TTCGCGTGAT TGCGCAATGG
TCCGGCGACG GCACCATCGC CGAGCTCGAG GCGACGCCGA AGGCGAAGCT GAATGTGCTG
CACTGCTACC GTTCGATGAA CTACATCTCG CGCCACATGG AAGAGAAATA CGGTATACCG
TGGGTCGAGT ATAATTTCTT CGGGCCGACC AAGATCGAAG AATCGCTGCG CAAGATCGCC
AGCCATTTCG ACGACAAGAT CAAGGACGGC GCCGAGCGCG TCATCGCCAA ATATCGCGCG
CTGACCGACG CCGTCATCGC CAAATACCGG CCGCGCCTCG AAGGCAAGAC CGTTATGCTG
TTCGTCGGCG GCCTGCGTCC GCGCCACGTG ATCGGCGCTT ACGAGGATCT CGGCATGGAG
ATCGTCGGGA CGGGCTACGA GTTCGGCCAC AACGACGACT ATCAGCGCAC CACCCACTAT
GTGAAGGACG GCACGCTGAT CTATGACGAC GTGACCGGCT ACGAGTTCGA GAAATTCGTC
GAAAAGATCC AGCCCGATCT CGTCGGCTCC GGCATCAAGG AAAAATACGT CTTCCAGAAA
ATGGGCGTGC CTTTCCGCCA GATGCACTCG TGGGACTACT CGGGCCCCTA TCACGGCTAC
GATGGCTTCG CGATCTTCGC GCGCGACATG GACATGGCGA TCAACTCGCC GGTCTGGAAG
CTGGCCAAGA CCCCCTGGGC GGCCTGA
 
Protein sequence
MSLAPAESIE DIKARNKALI KEVLEVYPEK TAKRRAKHLG TFEEGKPDCG VKSNIKSIPG 
VMTIRGCAYA GSKGVVWGPI KDMIHISHGP VGCGQYSWAS RRNYYIGTTG VDTFGTMQFT
SDFQEKDIVF GGDKKLAKIM DEIQVLFPLN KGITVQSECP IGLIGDDIEA VSKAKSKEYE
GKTIVPVRCE GFRGVSQSLG HHIANDAVRD WVFDKQEGKP ARFEQTDYDV AIIGDYNIGG
DAWSSRILLE EMGLRVIAQW SGDGTIAELE ATPKAKLNVL HCYRSMNYIS RHMEEKYGIP
WVEYNFFGPT KIEESLRKIA SHFDDKIKDG AERVIAKYRA LTDAVIAKYR PRLEGKTVML
FVGGLRPRHV IGAYEDLGME IVGTGYEFGH NDDYQRTTHY VKDGTLIYDD VTGYEFEKFV
EKIQPDLVGS GIKEKYVFQK MGVPFRQMHS WDYSGPYHGY DGFAIFARDM DMAINSPVWK
LAKTPWAA