Gene Rleg2_5039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5039 
Symbol 
ID6978133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp687046 
End bp688083 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content57% 
IMG OID643394182 
Productnodulation factor exporter subunit NodI 
Protein accessionYP_002279000 
Protein GI209547082 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID[TIGR01288] ATP-binding ABC transporter family nodulation protein NodI 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGTC AATCGGATCT AGAGAGAGCC GTCGCTGAGA CACTCCGCCG CGAAATCAAT 
CTGTTGGAGC GGCGCCACCT TTCGATTCCA GAAGCAATTG CTCGCCGCGT GGGATCGATG
TCTTCCGTTG CGATCGAGCT TGCCGGTGTT CGCAAATCAT ATCAGGGCAA GCCTGTTGTT
GACGGGGTGT CTTTTCACAT CGCATCAGGA GAGTGCTTTG GCCTGCTAGG TCCGAACGGC
GCGGGAAAGA GCACCATCAC CCGTATTATT CTGGGAATGA CGTCGCCTGA TGCGGGCAAC
ATTTCTGTGC TCGGAGTGCC GGTGCCTGAC AAGGCCCGCG CGGCGCGCGC GCGTATTGGG
GTCGTTCCAC AATTCGATCG TCTCGATTTG GAGTTCACGG TCCGGGAAAA CCTTGTGGTT
TACGGCCGGT ATTGCCGAAT GAAGGCCCGC GACATCGAAG CGGTTATCCC ATCGTTGCTG
GAATTTGCGC GCCTTGAGAA AAAGGCGGAT ACGCGCGTGG CGGATCTTTC GGGAGGCATG
AAACGGCGCC TCACGTTGGC GCGCGCATTA ATCAATGACC CGGAGATCCT GATACTAGAT
GAACCGACCA CCGGCCTCGA CCCGCACGCA CGCCACCTGA TCTGGGAGCG ACTGCGATCG
CTCTTGGCAA AAGGAATGAC AATTCTCTTG ACTACCCATT TCATGGAGGA GGCCGAGCGC
CTCTGTGACC GTCTATGTGT GCTCGAAAGT GGAGTTAAGA TCGCCGAAGG CCGCCCCCAG
GAACTGATAG ACGAACACAT CGGCTGTTCA GTTATCGAGA TCTATGGCGG TAACCCACAG
GAGCTCAGCA TTTTGATCAA GCCAAATGCA GGGCGGGTGG AGATTAGTGG CGAGACCCTG
TTCTGCTATA CCCCAGACCC GGATCAAGTT CGCGCGCAAC TGCGAGGATA CAAAGGTCTG
CGCCTCCTCG AGCGGCCGCC GAACCTAGAG GACGTATTCT TGCGATTAAC CGGACGTGAG
ATGGAGAAGC AGCAATGA
 
Protein sequence
MNGQSDLERA VAETLRREIN LLERRHLSIP EAIARRVGSM SSVAIELAGV RKSYQGKPVV 
DGVSFHIASG ECFGLLGPNG AGKSTITRII LGMTSPDAGN ISVLGVPVPD KARAARARIG
VVPQFDRLDL EFTVRENLVV YGRYCRMKAR DIEAVIPSLL EFARLEKKAD TRVADLSGGM
KRRLTLARAL INDPEILILD EPTTGLDPHA RHLIWERLRS LLAKGMTILL TTHFMEEAER
LCDRLCVLES GVKIAEGRPQ ELIDEHIGCS VIEIYGGNPQ ELSILIKPNA GRVEISGETL
FCYTPDPDQV RAQLRGYKGL RLLERPPNLE DVFLRLTGRE MEKQQ