Gene Rleg_4924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4924 
Symbol 
ID8007519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp301314 
End bp302786 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content56% 
IMG OID644821843 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_002973103 
Protein GI241113268 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGAG GGATGAGTAC GTTCAGGATC ACCGACACAG CACCTGCCGC ATGCGAGTCG 
GAGGCAACAG CGTTCGGTGA CTATCCGCCT TCGTCCCGTG GCAGTTCCGA GCCGGATGCT
CTGGCTCCAG CGATCCGTGA AAAGATCAAA GACCACCCGT GCTTCTCGCG GGAAGCGCAT
CTTTATTTCG CGCGGATGCA TCTCGCGGTG GCACCAGCCT GCAACATCCA ATGCAATTAC
TGCAATCGAA AATATGACTG CGCAAACGAA AGCAGACCAG GAGTAGCATC ACATAGGCTC
ACTCCCGACC AGGCGCTGCG CAGGGCCATT GCGGTTGCAA ACGAAGTGCC GCAGCTCTCA
GTGGTCGGCA TCGCCGGGCC AGGCGATGCC TGCTATGATT GGAGGAAGAC AAAAGCGACC
CTCATACCGA TCGCCCGCGA AATCACCGAC GTCAAGCTTT GCATCTCAAC CAATGGCCTC
GCCCTCCCTG AACATGTCGA CGAGCTTGTC GACATGAATG TCGGGCACGT TACAATCACC
ATCAACATGG TAGATCCGAA GATCGGGACC GAGATCTACC CCTGGATATT CTATGATGGC
CGCCGCTACA ACGGTATCGA CGCGTCCAGG ATCCTCCATG AGAGGCAAAT GTTGGGGCTC
GAAATGCTGA CAGAACGCGG CATACTCACC AAGGTCAATT CGGTGATGAT CCCGGGCGTC
AACGACGAGC ATCTCATCGA GGTTAACAAG TGGGTCAAGG ACAGGGGCGC GTTTATGCAT
AACGTAATGC CCCTGATCTC AGAGCCTTCA AATGGTACTC TATATGGTCT GAACGGTCAG
CGTTGCCCTA CCCCTTCTGA GTTAATCGCG CTTCGGGACC GGCTTGAAGG CAACACGAAG
GTGATGCGCC ATTGCCGTCA GTGCCGTTCC GATGCAGTCG GTCTGCTCAG TGATGATCGC
GCACACGAAT TCACGATTTC CCAACTCCCA GCTGAAGCGA CCAACGACAG TGGCAAGCGC
CATGCCTATC GCAAGTTGAT CGAGCGCGAG CGACGCGGCC AAACATTGGA AGCAAGGGGC
GCGGCCATAC CGGTCTCAGC TCCATCTGAC GAACTTCTCC TTATTGCTGT AACCACCAAT
GGTGGAGGCC GGGTCAATGA ACATTTCGGT CATGCGCAGG AAATACAGAT TTTCTCGGTC
TGTAAAAAAG GCCTCGGATT GATAGGTCAC CTGAAGATCG ACCCGTACTG CCTTGGTGGA
TGGGGGGAGG AGGCTAGTCT CAACAGTATC ATCAATGCGC TCGAAGGCTT AGATTTGCTG
ATTTGTTCTC AGATCGGCAA TGGCCCTACG AATAAGCTCG CACGCCGAGG TGTTCGAGCA
ACGGGCGCTT ATGGCGGCTC CTACATCGAG CAGGCAATCG ACGCCCATTA TAGCGCGGTG
CTTCACGACG ACGCTTTAGC AGCCGCGATT TGA
 
Protein sequence
MSRGMSTFRI TDTAPAACES EATAFGDYPP SSRGSSEPDA LAPAIREKIK DHPCFSREAH 
LYFARMHLAV APACNIQCNY CNRKYDCANE SRPGVASHRL TPDQALRRAI AVANEVPQLS
VVGIAGPGDA CYDWRKTKAT LIPIAREITD VKLCISTNGL ALPEHVDELV DMNVGHVTIT
INMVDPKIGT EIYPWIFYDG RRYNGIDASR ILHERQMLGL EMLTERGILT KVNSVMIPGV
NDEHLIEVNK WVKDRGAFMH NVMPLISEPS NGTLYGLNGQ RCPTPSELIA LRDRLEGNTK
VMRHCRQCRS DAVGLLSDDR AHEFTISQLP AEATNDSGKR HAYRKLIERE RRGQTLEARG
AAIPVSAPSD ELLLIAVTTN GGGRVNEHFG HAQEIQIFSV CKKGLGLIGH LKIDPYCLGG
WGEEASLNSI INALEGLDLL ICSQIGNGPT NKLARRGVRA TGAYGGSYIE QAIDAHYSAV
LHDDALAAAI