Gene Rleg2_5589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5589 
Symbol 
ID6978683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1233635 
End bp1234645 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content62% 
IMG OID643394687 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002279505 
Protein GI209547587 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGA CATTTGCTTT CGCGGCGGTT GCCGCATTCG TCGCGGTCTC CGCGCCGGCC 
AAAGCCGACA ACATGGTCTT CTCGAGCTGG GGCGGAACGA CCCAGGATGC GCAGAAGGCG
GCATGGGCGA GCCCCTTCAC CGAAAAGACC GGCATCACCG TCGTGCAGGA CGGCCCGACC
GACTACGGCA AGCTGAAGGC CATGGTCGAG GCCGGCGAAG TCACCTGGGA CGTCGTCGAC
GTCGAAGGCG ATTATGCCGC CCAGGCCGGC AAGAACGGCC AGCTCGAGAA GCTCGACTTC
TCCGTCATCG ACAAATCCAA GCTCGATCCG CGCTTCGTCA CCGACTATTC GGTCGGCAGC
TTCTATTATT CCTTCGTCAT CGGCTGCAAT GCCGATGCCG TCAAAGCCTG CCCGAAAACA
TGGGCTGACC TGTTCGACAC GGCAAAGTTC CCCGGCAAGC GCACATTCTA CAAGTGGTCG
GCTCCGGGCG TGATCGAAGC GGCGCTGCTT GCCGACGGCG TGGCTGCCGA CAAGCTTTAT
CCGCTTGACC TCGACCGCGC CTTCAAGAAG CTCGATACGA TCAAATCGGA CATCATCTGG
TGGTCGGGCG GCGCCCAGTC GCAGCAGCTC CTGGCATCCG CCGAAGCGCC CTTCGGCAGC
GTCTGGAACG GCCGCATGAC GGCGCTTGCC GCCACCGGCA TCAAGGTTGA AACCTCCTGG
GAACAGAACA TCACCGCTGC CGATGCGCTC GTCGTGCCGA AGGGCTCGCC GAATGTCGAA
TCCGCGATGA AGTTCATTGC GCTGGCGACC TCGGCCGAAC CGCAGGCCGC TCTGGCAAAA
GCCACCGGAT ATGCGCCGAT CAATGTCGAC TCCGCCAAGC TGATGGACCC GGAAACGGCC
AAGACCCTGC CGGACCAGCA GACAGCAAGC CAGGTCAATG CCGATATGAC CTATTGGGCT
GATAATCGCG ATGCCATCGG CGAGAAGTGG TACGCTTGGC AGGCGAAATA A
 
Protein sequence
MKTTFAFAAV AAFVAVSAPA KADNMVFSSW GGTTQDAQKA AWASPFTEKT GITVVQDGPT 
DYGKLKAMVE AGEVTWDVVD VEGDYAAQAG KNGQLEKLDF SVIDKSKLDP RFVTDYSVGS
FYYSFVIGCN ADAVKACPKT WADLFDTAKF PGKRTFYKWS APGVIEAALL ADGVAADKLY
PLDLDRAFKK LDTIKSDIIW WSGGAQSQQL LASAEAPFGS VWNGRMTALA ATGIKVETSW
EQNITAADAL VVPKGSPNVE SAMKFIALAT SAEPQAALAK ATGYAPINVD SAKLMDPETA
KTLPDQQTAS QVNADMTYWA DNRDAIGEKW YAWQAK