Gene Smed_6223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6223 
Symbol 
ID5320525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1142843 
End bp1144384 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content56% 
IMG OID640777828 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_001314760 
Protein GI150378165 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAGT CCGCCGAGAA AGTGCTCGAC CATGCTCCCC TTTTCCGCGA GCCGGAATAT 
AGGAAAATGC TGGCCGAAAA GAAGCGAAAC TTCGAACGAC CATACCCGGA TCGGACCGTC
ACTGATCAAC GCGAATTCAC CAAAACCTGG CATTATCGCG AAATAAATCT CGCCCGCGAA
GCGCTCGTCG TGAACCCGGC CAAGGCCTGT CAGCCGCTTG GCGCGGTTTA TGCGGCGGCC
GGGTTTGAGC GGACAATGTC GTTTGTACAT GGCAGTCAAG GGTGCGTTGC CTATTACCGT
TCGCACCTGT CGCGCCATTT CAAGGAGCCG TCGTCCGCGG TCTCATCTTC GATGACAGAA
GACGCTGCGG TATTTGGCGG CTTGAAGAAC ATGGTCGATG GCTTAGCCAA TACTTATAAG
CTCTACGATC CGAAGATGAT CGCCGTGTCG ACCACGTGCA TGGCGGAGGT CATTGGTGAC
GACTTGCACG GCTTCATTGA AAATGCCAAG GACGAAGGCG CCGTCCCCCA TGATTTCGAT
GTCCCTTTTG CGCACACGCC GGCATTCGTC GGCAGCCATG TCGATGGTTA TGACAGCATG
GTTAAAGGCG TGCTGGAGAA TTTCTGGAAG GGCGAGCAGC GCACTGTAAA ACCTGGCTCG
ATCAACATCA TCCCGGGCTT CGATGGATTC TGCGTCGGGA ACAATCGCGA GTTGAAGCGC
CTGCTCAATT TGATGGGTGT TTCCTATACG TTCATCCAAG ATGCCTCCGA CCAATTTGAT
ACGCCCTCGG ATGGTGAGTT CCGCATGTAT GACGGAGGCA CCAAGATCGA GGACGTGAGA
GCGGCACTGA ATGCCGAGGC GACAGTGTCA CTGCAGCAAT ACAATACTCG CAGAACACTG
GAATATTGCA AAGCGGCCGG ACAGGCGACG GTGTCCTTTC ACTATCCTCT CGGTGTCAAG
GCGACAGACG AGTTCCTGGT GAAAGTATCG GAGATCTCCG GCAGGGAAAT CCCCGAGGCA
ATCCGCCTGG AGCGCGGCCG GCTTGTCGAC GCGATGGCGG ACAGCCAGTC TTGGCTGCAC
GGTAAGAAAT ACGCGATCTA CGGCGACCCC GACTTCGTAT ACGCCGTAGC GCGATTCACC
ATGGAGACTG GAGGGGAGCC AACCCACTGC CTAGCAACCA ACGGCACCCC AGCCTGGGAA
GCTGAGATGA AAAAGCTGCT CGCATCCTCG CCCCTGGGCA ATGATGCACA GGTTTGGACG
AACAAGGATC TCTGGGCAAT GCGCTCACTC CTTTTCACCG AGCCGGTGGA CCTGCTGATC
GGCAATTCCT ATGGCAAGTA TCTGGAGCGC GATACCGGCA CGCCGCTGAT CCGGCTGATG
TTTCCGATTT TCGACCGGCA CCACCATCAC CGTTTTCCCC TGATGGGCTA TCAAGGTGGA
CTGCGTGTGT TGACGACGAT CCTCGATAAG ATCTTCGACC GACTCGATCG TGAGACGATG
CAGGTGGGAG TGACGGACTA TTCTTATGAC CTTACTCGCT AG
 
Protein sequence
MPQSAEKVLD HAPLFREPEY RKMLAEKKRN FERPYPDRTV TDQREFTKTW HYREINLARE 
ALVVNPAKAC QPLGAVYAAA GFERTMSFVH GSQGCVAYYR SHLSRHFKEP SSAVSSSMTE
DAAVFGGLKN MVDGLANTYK LYDPKMIAVS TTCMAEVIGD DLHGFIENAK DEGAVPHDFD
VPFAHTPAFV GSHVDGYDSM VKGVLENFWK GEQRTVKPGS INIIPGFDGF CVGNNRELKR
LLNLMGVSYT FIQDASDQFD TPSDGEFRMY DGGTKIEDVR AALNAEATVS LQQYNTRRTL
EYCKAAGQAT VSFHYPLGVK ATDEFLVKVS EISGREIPEA IRLERGRLVD AMADSQSWLH
GKKYAIYGDP DFVYAVARFT METGGEPTHC LATNGTPAWE AEMKKLLASS PLGNDAQVWT
NKDLWAMRSL LFTEPVDLLI GNSYGKYLER DTGTPLIRLM FPIFDRHHHH RFPLMGYQGG
LRVLTTILDK IFDRLDRETM QVGVTDYSYD LTR