Gene Rleg_2906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2906 
Symbol 
ID8013836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2902898 
End bp2904157 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content61% 
IMG OID644825476 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002976705 
Protein GI241205609 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.194008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA CTGCTCTGAA AAGCCTGCTG CTTGCCTCCA GTCTGCTGAG TTCAGCGGGT 
CTCGTCCACG CCGCCGACGT CACGCTGACG GTCGAAAGCT GGCGTAACGA CGACCTGCAG
ATCTGGCAGG AAAAGATCAT CCCGGCTTTC GAAGCCAAGA ACCCGGGCAT CAAGATCGTC
TTCTCGCCGA CCGCGCCGAC CGAATACAAC GCCTCGCTGA ACGCCAAGCT CGATGCCGGC
TCCGCAGGCG ATATCATCAC CTGCCGTCCG TTCGATGCCT CGCTCGAACT CTTCAACAAG
AAGCAGCTCA CTGACATCAC CAGCCTGCCT GGCATGGAGA ACTTCTCGCC GGTCGCCAAG
GCCGCCTGGA GCACCGACGA CGGCAAGTCG ACCTTCTGCG TGCCGATGGC TTCTGTCATC
CACGGCTTCA TCTACAACAA GGATGCCTTC GACAAGCTCG GCATCTCGGT GCCGAAGACC
CAGGACGAGT TCTACGCAGC GCTCGACAAG ATCAAGGCCG ACGGCACCTA CATTCCGCTC
GCCATGGGCA CGAAGGACCT CTGGGAAGCC GCTACCATGG GCTACCAGAA TATCGGCCCG
AACTACTGGA AGGGTGAGGA TGGCCGCGCC GCCCTGATCG CCGGCAAGCA GAAGCTGACG
GATGCCGACT GGGTCAAGCC CTATGAAGAG CTTGCCAAGT GGAAGCCCTA TCTCGGCGAC
GGCTTCGAAG CCCAGACCTA TTCGGACAGC CAGAACCTGT TCACGCTCGG TCGTGCCGCG
ATCTATCCGG CCGGCTCGTG GGAAATCTCG CTGTTCAACA GCCAGGCCCG GTTCAAGATG
GGCGCCTTCC CGCCGCCGGT TCCGAAGGCC GGCGACACGG GCTACATCTC CGACCATCCG
GATATCGGTG TCGCTCTGAA CACCAAGAGC ACGCATGCCG AGGAAGCCAA GAAGTTCCTC
AGCTGGGTCG CTTCGCCTGA GTTCGCCGAT ATCTACGCCA ACGCGCTGCC CGGCTTCTTC
AGCCTGAACT CCAACCCGGT CAAGATGTCC GATCCGCTCG CTCAGGAGTT CGTTTCCTGG
CGCGGCCCGT ACAAGTCGAC GGTGCGCTCG ACTTACCAGA TCCTGTCGCG CGGCACGCCG
AACCTCGAAA ACGAGACCTG GGTCGAATCG GCCAATGTGA TCAACGGCAC GGATACGCCT
GCGGTTGCTG CCGAAAAGCT CCAGAAGGGC CTCGACAGCT GGTACAAGCC GGCCAAGTGA
 
Protein sequence
MKNTALKSLL LASSLLSSAG LVHAADVTLT VESWRNDDLQ IWQEKIIPAF EAKNPGIKIV 
FSPTAPTEYN ASLNAKLDAG SAGDIITCRP FDASLELFNK KQLTDITSLP GMENFSPVAK
AAWSTDDGKS TFCVPMASVI HGFIYNKDAF DKLGISVPKT QDEFYAALDK IKADGTYIPL
AMGTKDLWEA ATMGYQNIGP NYWKGEDGRA ALIAGKQKLT DADWVKPYEE LAKWKPYLGD
GFEAQTYSDS QNLFTLGRAA IYPAGSWEIS LFNSQARFKM GAFPPPVPKA GDTGYISDHP
DIGVALNTKS THAEEAKKFL SWVASPEFAD IYANALPGFF SLNSNPVKMS DPLAQEFVSW
RGPYKSTVRS TYQILSRGTP NLENETWVES ANVINGTDTP AVAAEKLQKG LDSWYKPAK