Gene Rleg2_4771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4771 
Symbol 
ID6977865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp402637 
End bp403797 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content61% 
IMG OID643393935 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002278753 
Protein GI209546835 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG ACTACAAGGA TCATCTGCCT ATTACCCCTG AAGGCTTCAT GGACGAGTTC 
ATGCGCCTGA AGCGCGGCTC CGTCAGCCGC CGCCACTTCC TCGGCGTCAC CGGCCTCGGG
CTTGCGACCG CCGTGTTGTC GCGTTTCCCC GGTGCGCTGT CGACGCCCGC CTACGCCGAG
GACCTCGGAA CCCAGATGTC GATCGCCACC TGGCCGAATT ACCACGACCC TGCGACCTTC
GAGAATTTCA AAGCGGCGAC CGGCGTTGCC GTCGAGGTCA ACGTCTTCGG CTCCAACGAA
GAAATGCTGG CCAAGCTCCA AGCGGGCGCC TCCGGCTGGT CGCTCTTCGT GCCGACCAAC
TACACGATCT CCACCCACCA CAAGCTCGGC CTGATCGACG AACTCGATCT CTCCAAGATC
CCGAATTTCA GCCAGGCGAC GGAAAATCCG CGCTTCACCA AGGAAGGCAT GATCGACGGC
AAGACCTACG CCGTGCCGAA GAACTGGGGT ACGACCGGGT TCTCGGTCAA CACCGCAAAG
ATCAAGACCA AGCTTTCGAG CTGGAAGGAC TTCTTTGACA TCGCCCAGAC GGAAGCCGAC
GGCCGCGCCA TGGTGCATGA CTATCAGTTG ACGACGATCG GCAGCGCGCT GGTTTCGCTC
GGCTACGATT TCAATTCGAT CAAGGCGGAC GAACTCGCAA AGGCCGAGGA ACTGCTGATC
AAAGTCAAAC CGCACCTTTT CGCCATCAAC AGCGACTACC AACCGGCCAT GCGCGCCACC
GACGCCTGGC TCACCATGTG CTGGACCAAC GACGGAGCGC AGCTCAACCG TGACGTCCCC
GAGATCGCCT ATGTGCTTGG CACCGACGGC GGCGAGATCT GGACCGACTA TTACGCCATT
CCGAAGGACG CGCCGAACAA GGCGGCGGGT TACGCGCTGC TCAACTACCT CATGGATCCG
GCCAATGCCG TCAAGGAGCA CGTCGCCAAT GGCGCACCGA CGACCGACAG CCGGGTCATC
GCACTGCTGC CGAAGGAGAT CACAGCGAAC AAGATCGTCT ATCCGGACGA GGCGTCATTG
ACGCCGCTGG AATTCGGCGC GGCGGTGACG TTGACCGACC CCGGCCGGGC AGAACTGATG
GCGCGTTTCA AGTCGGCTTG A
 
Protein sequence
MSKDYKDHLP ITPEGFMDEF MRLKRGSVSR RHFLGVTGLG LATAVLSRFP GALSTPAYAE 
DLGTQMSIAT WPNYHDPATF ENFKAATGVA VEVNVFGSNE EMLAKLQAGA SGWSLFVPTN
YTISTHHKLG LIDELDLSKI PNFSQATENP RFTKEGMIDG KTYAVPKNWG TTGFSVNTAK
IKTKLSSWKD FFDIAQTEAD GRAMVHDYQL TTIGSALVSL GYDFNSIKAD ELAKAEELLI
KVKPHLFAIN SDYQPAMRAT DAWLTMCWTN DGAQLNRDVP EIAYVLGTDG GEIWTDYYAI
PKDAPNKAAG YALLNYLMDP ANAVKEHVAN GAPTTDSRVI ALLPKEITAN KIVYPDEASL
TPLEFGAAVT LTDPGRAELM ARFKSA