Gene Rleg2_5223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5223 
Symbol 
ID6978317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp854809 
End bp855699 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content62% 
IMG OID643394337 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002279155 
Protein GI209547237 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.170331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTAA AGAAAATCAC CACTGTTGCG CTCGCCGGCG TCATGCTCTC CGGCGCAGCC 
TTCGCCGAAG ACGCAAGCCT GCCGAAACTC TCGGTCAACG AGGAGCTGAA GGCCAAGTTG
CCGGAGGCGA TCCGCACCGC CGGCAAGATG ATTTCCGTCA ACAACGGTTC CTTCCCTCCC
TATGAGATCG TCACCGGCAC CGAGATGACC GGTGCCAGCG CCGACCTGAC CGACGCGCTC
GGACAGGTGC TTGGCGTCAC GATCGAGCAT CAGACGGTCG GCGGCCTGCC CGCCCTCCTC
GCCGGCGTCA ATTCCGGCCG CTACCAGTTC GCCTTCGGCC CCGTCGGCGA CTTCAAGAGC
CGCGAAGAGG CCAACGACTT CGTCGACTGG GTCCAGGAAT TCGTGGTTTT CGCGGTCCAA
AAGAGCAATC CGAAAGCGAT CACCTCACTC GACACCGCCT GCGGCAACCG TATCGCCGTG
ATGGCCGGCG GCTCGGCGGA AAAGGTCATC CAGGTCCAGG CCGAGAAGTG CAAGACCGAT
GGCAAGGATC CGATCGAAGT CCAGTCCTTC ACCGATCAAC CGAGCTCAAT CCTCGCTGTT
CGATCGAAGC GTTCGGACGC CTTCTTCTCC TCCCAGGCGC CGCTCACCTA TTTCGTGTCG
CAGTCCAATG GCCAGCTGGA GCTCACCGGT GTCGGTCAGA AGAACGGCTT CGAAGCGCTC
TACCAGGGCG CCGTCGTTCC GAAAGGCTCG CCGCTCGGCC CGGTGCTCCG TGACGCGGTC
AAGTTTCTGA TGGATAATGG CACCTATGCC GCCATCATGA AGAAGTGGGG CCTCGAGAAC
AACATGATCA AGGAGCCGGG CATCAACCTC GGCGGGACGT TGCCGAAATG A
 
Protein sequence
MQLKKITTVA LAGVMLSGAA FAEDASLPKL SVNEELKAKL PEAIRTAGKM ISVNNGSFPP 
YEIVTGTEMT GASADLTDAL GQVLGVTIEH QTVGGLPALL AGVNSGRYQF AFGPVGDFKS
REEANDFVDW VQEFVVFAVQ KSNPKAITSL DTACGNRIAV MAGGSAEKVI QVQAEKCKTD
GKDPIEVQSF TDQPSSILAV RSKRSDAFFS SQAPLTYFVS QSNGQLELTG VGQKNGFEAL
YQGAVVPKGS PLGPVLRDAV KFLMDNGTYA AIMKKWGLEN NMIKEPGINL GGTLPK