Gene Rleg2_5054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5054 
Symbol 
ID6978148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp701587 
End bp703128 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content52% 
IMG OID643394194 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_002279012 
Protein GI209547094 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAAT CGGCTGATAA GGTTCTCGAT CATGCTTCTC TGTTCTGCGA GCCTGAATAC 
AGGAATCTGC TTGCTAGAAA AAAGCTGAAA TTCGAATGTC CGCGTCCGAG GGAGGTCGTT
TCTTATCAGC GCGAGTTCAC CGAGACTTGG GAATATCGAG AAAGAAACCT GGCCCGCGAA
GCGCTTGTCG TTAATCCTGC CAAAGCCTGC CAGCCCCTCG GCGCGGTCTT CGCAGCCGCT
GGATTTGAGC GGACAATGTC CTTTGTCCAT GGCAGTCAGG GTTGCGTGGC ATATTACCGC
TCACACCTGT CACGTCATTT CAAGGAACCC GCTTCCGCGG TTTCATCTTC GATGACAGAA
GACGCTGCTG TATTCGGCGG GCTTAAAAAT ATGGTCGAAG GGCTCGCCAA CACCTATAGG
CTCTACGACC CCAAGATGAT CGCGGTCTCG ACGACTTGCA TGGCGGAGGT CATTGGGGAT
GATCTTCACG GCTTTATTGA AGCTGCAAAA AATGAAGGCT CTGTTTCAAT CGATTTCGAC
GTGCCTTTCG CCCACACCCC AGCTTTCGTC GGCAGTCATG TAGATGGCTA CGACAACATG
GTGAAAGGCA TTTTGGAAAA CTTCTGGAAG AACACGGAAC GTGTTGCGAC GCCTGGCCTC
GTCAACATCA TTCCGGGCTT CGATGGCTTC TGCGTTGGCA ACAATCGCGA GATAAAACGT
CTGCTCGACA CGATGGGAGT GAATTACGTT TTCATTCAAG ACGCTTCAGA CCAGTTCGAC
ACGCCTTCCG ACGGCGAATA CCGCATGTAT GATGGAGGGA CAAAGATCGA CGATGTCACG
GTAGCGAAAC ATGCGGAGGC AACAATATCG CTTCAGCACT ACAATACACG CAAGACGTTG
GACTATTGCC GTGAGCTTGG TCAGACTACT GCCTCGTTCC ACTACCCGCT CGGAATTGGG
GCAACAGACG AGTTCCTTTT GAAAATATCA GAGCTCTCCA AAAAGGAAAT TCCCGAGGCT
TTCGAACGTG ATCGAGGCCG GCTGGTCGAT GCCATGGCTG ATAGTCAATC CTGGTTGCAT
GGCAAAAAAT ACGGAATCTA TGGCGACCCG GACTTCGTCT ACGCAATGGC GCGTTTTGTT
ATGGAAACCG GTGGCGAACC CACACATTGC CTTGCGACCA ACGGCACCGC AGCATGGGAA
GCTGAAATGA AAGGATTGCT TGCAACTTCT CCGTATGGGG GAGGTGCACA GGTATGGGCT
GGCAAAGACC TCTGGGCAAT GCGCTCGCTT CTCTTGACGG AACCCGTCGA TCTGTTAATC
GGCAATTCCT ATGGCAAGTA TCTTGAGCGC GACACTGGCA CGCCACTGGT TCGGCTTTCC
TTTCCGATTT TCGATCGCCA TCATCATCAT CGCTTTCCAC TCATGGGCTA CCAAGGCGGA
TTGCGTCTCC TCACGGTTAT TCTCGACAAG ATCTTCGACA GGCTCGATCA GGAGACAATG
TTGCTGGGCG TGACAGATTA TTCCTACGAC CTCACGCGCT AA
 
Protein sequence
MPQSADKVLD HASLFCEPEY RNLLARKKLK FECPRPREVV SYQREFTETW EYRERNLARE 
ALVVNPAKAC QPLGAVFAAA GFERTMSFVH GSQGCVAYYR SHLSRHFKEP ASAVSSSMTE
DAAVFGGLKN MVEGLANTYR LYDPKMIAVS TTCMAEVIGD DLHGFIEAAK NEGSVSIDFD
VPFAHTPAFV GSHVDGYDNM VKGILENFWK NTERVATPGL VNIIPGFDGF CVGNNREIKR
LLDTMGVNYV FIQDASDQFD TPSDGEYRMY DGGTKIDDVT VAKHAEATIS LQHYNTRKTL
DYCRELGQTT ASFHYPLGIG ATDEFLLKIS ELSKKEIPEA FERDRGRLVD AMADSQSWLH
GKKYGIYGDP DFVYAMARFV METGGEPTHC LATNGTAAWE AEMKGLLATS PYGGGAQVWA
GKDLWAMRSL LLTEPVDLLI GNSYGKYLER DTGTPLVRLS FPIFDRHHHH RFPLMGYQGG
LRLLTVILDK IFDRLDQETM LLGVTDYSYD LTR