Gene Rleg2_5608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5608 
Symbol 
ID6978702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1254478 
End bp1255761 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content61% 
IMG OID643394706 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002279524 
Protein GI209547606 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.959607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACATC GGATAAAGCG CATTCTTGCG GGCGCATCCA CGCTTCTGGC GTTGGCCGCG 
GCCGGCCCGT CGCATGCAGA AACCACGCTG TCTTTCCTGA TCGACAACAA TCCCGACACG
GTCGCAGCAG CCGAGGCCTT GGTTGCCGCC TATCAGACCA AGGCGCCCGG CGTGACGATC
GAAATCGAAC AGCGGCCGGG CGGCGGCGAG GGCGACAACA TCATCAAGAC GCGCCTGGCG
ACGGGGGAGA TGTCCGATGT ATTCCTCTAT AATTCGGGTT CGCTCTTGCA GGCGCTGAAG
CCCACGCAGA AACTGGTCGA TCTGAGCGGC CTGCCGTCAC AGGCAAAGGT GGACGAAAGC
TTCAAGGCGG TGGTCAGCGC CGACGGCAAG CTCTATGGCG TTCCCTTCGG CACGGCGATG
GCCGGCGGGA TCCTCTACAA CAGGAAGATC TATCAGGATC TCGGCCTCTC CGTTCCGAAG
ACATGGGCGG ATTTCATGGC GAACAACGCC AAGGTCAAGG CATCCGGCAA GGTCGCCGTG
GCGCAGACCT ATCGCGATAC GTGGACCTCG CAGCTGTTCG TTCTGGCCGA TTATTACAAT
CTGCATGCCG CCGTGCCGAA CTTTGCCGCC GACTATACCG CCAACAAGGC GAAATATGCC
GAGACGCCGG CGGCAATGAA GGGCTTCGAA CGGCTGAAGG ACGTTCATGA TGCCGGCCTG
ATGAACGAAG ACTTCGGCGC GGCAAGCTAC GACGACGGCT TGAGAATGGT GTCGACCGGC
GAGGCAGCGC ATTATCCGAT GCTGAGCTTC GCAGTCAGCG CGCTCAAGCA GAATTATCCG
GAGAACCTCG CAGATGTCGG CTTCTTCGCC CAGCCGAGCG ACGATGCCGC AACGAACGGC
CTGACGGTCT GGATGCCGCC GGGCCTTTAC ATTCCTGCGA CCAGTCAGCA TGCCGAGGAA
GCGAAAAAAT TCGTCGATTT CGCCGGGAGC GTCGAGGGCT GCAAGATCAT GGTGGAAACC
AACGCGGTCC AGGGCCCCTC CCTGGTCGAC GGCTGCGACC TGCCTGCCGA CGTGCCGCCG
GCGATCAAGG ATATGCTTCC CTATTTCGAG GCCAAGGACA AGACGACCCC GGCCCTGGAA
TTCGTTTCTC CCGTCAAGGG ACCGGCGCTC GAGCAGATCA CCGTCGAGGT CGGCTCCGGC
ATTCGCCAAC CAGCCGAGGC GGCGAAACTC TATGATGAGG ATGTGCGCAA GCAGGCCAAG
CAGCTCGGCC TGCCCAACTG GTAG
 
Protein sequence
MTHRIKRILA GASTLLALAA AGPSHAETTL SFLIDNNPDT VAAAEALVAA YQTKAPGVTI 
EIEQRPGGGE GDNIIKTRLA TGEMSDVFLY NSGSLLQALK PTQKLVDLSG LPSQAKVDES
FKAVVSADGK LYGVPFGTAM AGGILYNRKI YQDLGLSVPK TWADFMANNA KVKASGKVAV
AQTYRDTWTS QLFVLADYYN LHAAVPNFAA DYTANKAKYA ETPAAMKGFE RLKDVHDAGL
MNEDFGAASY DDGLRMVSTG EAAHYPMLSF AVSALKQNYP ENLADVGFFA QPSDDAATNG
LTVWMPPGLY IPATSQHAEE AKKFVDFAGS VEGCKIMVET NAVQGPSLVD GCDLPADVPP
AIKDMLPYFE AKDKTTPALE FVSPVKGPAL EQITVEVGSG IRQPAEAAKL YDEDVRKQAK
QLGLPNW