Gene Rleg2_5043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5043 
Symbol 
ID6978137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp690035 
End bp691126 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content54% 
IMG OID643394186 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_002279004 
Protein GI209547086 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0411446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAGG TTGCCAGAAT AGCGCCTGCC GCTGACAAGT CAGAAGCGAC AAATTTTGGT 
GGCTACGGAT CTTTGTCCGA TGGCAGTTCA GTGTCAACAG CTATAGATCC AGCGACCTAT
GAAAATATCA AAGACCACCC GTGCTTCTCG CGGGACGCTC ATCGGAATTA TGCGCGATTG
CATCTAGCGG TAGCGCCCGC CTGCAACATC CAGTGCAACT ACTGCAATCG AAAATATGAC
TGTGCGAATG AAAGCAGGCC AGGGGTAGCA TCGCATCGGC TAACTCCCGA TCAGGCTCTG
CGCAAGACCA TGGCTGTCGC CAGCGAGGTC CCGCAGCTTT CTGTAGTCGG CATCGCCGGG
CCAGGCGATG CTTGCTACGA TTGGAATAAG ACAAAAGCAA CACTCATACC TATCGCTCGG
GAAATTCCGG ACATCAAGCT GTGCGTCTCG ACCAATGGCC TCGCACTACC TGACCGTGTC
GAGGAGCTCG TCGACATGAA TGTCGGACAC GTCACGATCA CCGTCAATAT GGTAGATCCG
AATATCGGAA CGAAGATCTA TCCGTGGATA TTCTATCAGG GCCGCCACTA TAACGGCATC
GAGGCCGCGA AGATCCTTCA TGAGAGGCAA ATGCTCGGGC TGGAAATGCT TACAGAGCGC
GGAATCCTTA CGAAAATCAA TTCGGTAGTG ATCCCGGGGG TGAACGACGA GCATCTCATC
GAAGTTAATA AGTGGGTCAA GGACAGAGGC GCATTGGTGC ACAACATCAT GCCCCTGATT
TCAAAGGCGT CTCACGGTAC CTTCTATGGT TTGAACGGTC AGCGTAGCGC TGCGCCTTTT
GAACTAACCG CGCTTCGGGA CCGTCTCGAA GGCACCACGG AGGTTATGCG CCATTGCCGC
CATTGCCGCT CCGATGCAAT TGGACTGCTA AGTTATGATC GCGCGCGCGA GTTCACGCTT
GCCCAGCTGC CAGCCGAGCC AACCTACGAC GAGGAAAAAC GGCGCGCTTT TCGTCAGTTG
ATCGAGCGCG AGCAAGGCAG TCAAATATTG CAAGCAGGAG ATGCGATCAC AGCAGGTTTC
AGCGCGGCCT GA
 
Protein sequence
MRKVARIAPA ADKSEATNFG GYGSLSDGSS VSTAIDPATY ENIKDHPCFS RDAHRNYARL 
HLAVAPACNI QCNYCNRKYD CANESRPGVA SHRLTPDQAL RKTMAVASEV PQLSVVGIAG
PGDACYDWNK TKATLIPIAR EIPDIKLCVS TNGLALPDRV EELVDMNVGH VTITVNMVDP
NIGTKIYPWI FYQGRHYNGI EAAKILHERQ MLGLEMLTER GILTKINSVV IPGVNDEHLI
EVNKWVKDRG ALVHNIMPLI SKASHGTFYG LNGQRSAAPF ELTALRDRLE GTTEVMRHCR
HCRSDAIGLL SYDRAREFTL AQLPAEPTYD EEKRRAFRQL IEREQGSQIL QAGDAITAGF
SAA