Gene Rleg_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2454 
Symbol 
ID8013432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2454273 
End bp2455151 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content58% 
IMG OID644825035 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002976265 
Protein GI241205169 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.459609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTTT CTCTTTCGAG AAGGATGGTG ACGTGCGCTC TGCTGGCAGG CAGCTTCATG 
TCCCTTTCAT CTATCGCCAA TGCTTTCGAA CTTGCCGAGC AGGGCAAGCT GACGGTTGCC
TTCACCGGCG ACATGCCCGG CTCCGGCTGG CAGGACGGGA AGCTGGTAGG CTATGACGGT
GAAATCATGC AACGCATTGC CGAGAAGCTC GGCCTCAAGA TTCAGCCTGC CCTCATGGAA
TGGTCGGGTA CGATCGCCTC CGTTCAGTCG GGCCGCGTCG ACGTCATGCT CGGCACCATG
GGCTGGACCG AAAAACGCAC GAAGATCATG ACGCTGTCCG AGCCGATCCA CTACTTCAAG
AACGGCATCA TGCAATCGAC GAAGACGAGC TGGGACAAGC TTTCTGATCT GGAAGGCAAG
AAGATCGGCA CGATCACCGG CTTTTCGTTC GTCCCTGAAC TGAAGAGCAT CAAAGATCTC
CAGCTTTCGC TCTACGACAC TTCCGATGCG GCCGTGCGCG ATCTTATCGC CGGGCGCATC
GACGCTGTCA TCGGCGATCC GCCGGTGATC TCCTACGCCA TCAAGCAGAA CCCCGATTGG
AATATGCACT TCCTCGCCTT CACCGATAAC AGTCCGGATT TTCCGCTGCT GACCGGTCTC
GGCCAGGTCG TTTATGGCCT CAACCAGAAG AACGACGATC TGCGCCAGAA GATGGACGCC
ATCATCGCCG ACATGTGGAA GACTTGCGAG ATGAAGGAAA TCGGCGCCCG CTACGGATTG
TCGTCGGATG TCTGGTTCAA GCCTGCGGGT CAAAACTTCC GCGCCGGTGT CGATCGCCCC
GCAGATTACA AGCTGCCGTC CTGCGCAGCT GGCGGCTGA
 
Protein sequence
MALSLSRRMV TCALLAGSFM SLSSIANAFE LAEQGKLTVA FTGDMPGSGW QDGKLVGYDG 
EIMQRIAEKL GLKIQPALME WSGTIASVQS GRVDVMLGTM GWTEKRTKIM TLSEPIHYFK
NGIMQSTKTS WDKLSDLEGK KIGTITGFSF VPELKSIKDL QLSLYDTSDA AVRDLIAGRI
DAVIGDPPVI SYAIKQNPDW NMHFLAFTDN SPDFPLLTGL GQVVYGLNQK NDDLRQKMDA
IIADMWKTCE MKEIGARYGL SSDVWFKPAG QNFRAGVDRP ADYKLPSCAA GG