Gene Rleg2_2327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2327 
Symbol 
ID6981066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2386807 
End bp2388081 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content63% 
IMG OID643397040 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002281828 
Protein GI209549911 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.169538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTTT TGAAGCAATT TCCGTCAACC ACCCGCAGAC GCTTCCTGAA GGGCGCGGGC 
CTGGTGTCCG CCGCGGCCGT CACCGGCAGC TTCCCGATCC CGGCGATCGC GCAGGCGCAG
GAGGTCACGA TGATCTCGGC CGAGAACAAC GGCGAAGCGC TTGACGCGCT GAAGGCGATT
GCCGCGGGCT TCAGCAAGGA AGCCGGTGTC AATGTCGTCA TCAACAATAT GGATCACGAG
GCCCACAAGA CGGCGATCCG CAACTATCTC GTCGCCGGCG CGCCGGATGT CTGCTCCTGG
TTTTCCGGCA ACCGCATGCG CGCCTTCGTC AAGCGCGGCC TGTTCGACGA TATTTCCGAT
CTCTTCGAGA AGGAAAAGTA CAAGGATGTG CTGGGGGCGA CGGCCGGCGC CGTCACCGAA
GACGGCAAGC AGTACGGCCT TCCCACCGGC GGCACGCTCT GGGGCATGTT CTACCGCAAG
GACGTGTTCG AAGAGCATGG CCTCACCGTG CCGAAGACCG CCGAGGACTT CATGGCCTAT
GGCGACAAGT GCAAGGCGGC CGGCATCACC CCGGTTGCGA TCGGCACCAA GGAATTGTGG
CCGGCGGCCG GCTGGTTCGA TCAGATGAAC CTGCGCATCA ACGGGCTCGA CAAGCACATG
GCGCTGATGA ACGGCGAAAT GAGCTATCTC GACCCGTCGC TGACCGCCGT CTTCGACCAG
TGGGAAGCGA TGATTTCCAA GGGCTTCTTC ACCCCCAACC ATACCTCCTT CGGTTGGCAG
GAGGCCGCAG CGCTTCTGGC GCAGAAGAAG GCGGGGATGA TGAACCTCGG CGCCTTCCTG
CGTTCGGCCT TCACCGCCGA GGATCTGCCG CAGCTTGGCT ACGCGACCTT CCCGGTGCTC
GACGCCAAAG TCGGTCATTT CGAGGAGTTC TCGGTCAATT CGATCCACAT TCCCGCCAAG
GCGAAAAACA AGCAGGGGGC CCGCGACTTC CTCGCCTATT TCTACAGACC GGAGAACCTG
GCGGCCTATC TCGAGCCCGG CGGCAACGTG CCGCCGCGCA ACGACCTGCC TCAGAGCAAG
GATCCGCTGG TCAATATCGC TGTCGAGACG ATGAAGACGG TGCAGGGCAC CTCGCAATAT
TACGACCGCG ACAGCGACCC CGACATGGCC CAGGCCGGCC TCGTCGGCTT CCAGGAGTTC
ATGGCCAAAC CCGACCGGCG CAAGGCGATC CTCACGCGCC TCGAGGGGAC GCGCAAGCGG
ATCTATAAGA TCTGA
 
Protein sequence
MTFLKQFPST TRRRFLKGAG LVSAAAVTGS FPIPAIAQAQ EVTMISAENN GEALDALKAI 
AAGFSKEAGV NVVINNMDHE AHKTAIRNYL VAGAPDVCSW FSGNRMRAFV KRGLFDDISD
LFEKEKYKDV LGATAGAVTE DGKQYGLPTG GTLWGMFYRK DVFEEHGLTV PKTAEDFMAY
GDKCKAAGIT PVAIGTKELW PAAGWFDQMN LRINGLDKHM ALMNGEMSYL DPSLTAVFDQ
WEAMISKGFF TPNHTSFGWQ EAAALLAQKK AGMMNLGAFL RSAFTAEDLP QLGYATFPVL
DAKVGHFEEF SVNSIHIPAK AKNKQGARDF LAYFYRPENL AAYLEPGGNV PPRNDLPQSK
DPLVNIAVET MKTVQGTSQY YDRDSDPDMA QAGLVGFQEF MAKPDRRKAI LTRLEGTRKR
IYKI