Gene Rleg2_2646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2646 
Symbol 
ID6981389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2692874 
End bp2694133 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content61% 
IMG OID643397358 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002282143 
Protein GI209550226 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0628324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA CTGCTCTGAA AAGCTTCCTG CTTGCCTCCA GCTTGCTGAC ATCGGCAGGT 
CTCGTCCATG CCGCCGACGT CACGCTGACT GTGGAAAGCT GGCGTAACGA CGACCTGCAG
ATCTGGCAGG AGAAGATCAT CCCGGCTTTC GAAGCCAAGA ACCCGGGCAT CAAGATCGTC
TTCTCGCCGA CCGCGCCGAC CGAATACAAC GCGTCGCTGA ACGCCAAGCT GGATGCCGGT
TCCGCAGGCG ATATCATCAC CTGCCGTCCG TTCGACGCCT CGCTCGAACT CTTCAACAAG
AAGCAACTCG TCGACATCAC CAGCCTGCCC GGTATGGAGA ACTTCTCGCC GGTCGCCAAG
GCCGCCTGGT CGACCGACGA CGGCAAGTCG ACCTTCTGCG TGCCGATGGC TTCGGTCATC
CACGGTTTCA TCTACAACAA GGATGCCTTC GACAAGCTCG GCATCTCCGT GCCGAAGACG
CAGGACGAAT TCTACGCGGC GCTCGACAAG ATCAAGGCCG ACGGCACCTA TATCCCGCTC
GCCATGGGCA CGAAGGACCT CTGGGAAGCC GCAACCATGG GCTACCAGAA CATCGGCCCG
AATTACTGGA AGGGCGAGGA CGGCCGCGAC GCCCTGATCG CCGGCAAGCA GAAGCTGACC
GATGCCGACT GGGTCAAGCC CTATGAAGAG CTTGCCAAGT GGAAGCCCTA TCTCGGCGAC
GGTTTCGAAG CCCAGACCTA TTCGGACAGT CAAAACCTCT TCACCCTCGG ACGCGCCGCC
ATCTATCCGG CCGGTTCCTG GGAAATCGCG CTTTTCAACA CGCAGGCGCA GTTCAAGATG
GGCGCCTTCC CGCCGCCGGT TCCGAAGGCC GGCGACCAGG GCTACATCTC CGACCATCCG
GATATCGGCG TCGCCCTGAA TGCCAAGAGC AAGCATGCCG AGGAAGCCAA GAAATTCCTC
AGCTGGGTCG CTTCGCCCGA GTTCGCCGAC ATTTACGCCA ACTCCCTGCC GGGCTTCTTC
AGCCTGAACT CCAACCCCGT CAAGATGTCC GATCCGCTTG CTCAGGAATT CGTTTCCTGG
CGCGGCCCGT ACAAGTCGAC CGTGCGCTCG ACCTACCAGA TCCTGTCGCG CGGCACGCCG
AACCTCGAAA ACGAGACCTG GGTCGAATCG GCCAACGTGA TCAACGGCAC GGATACGCCG
AAGGTCGCTG CCGAGAAGCT GCAGAAGGGC CTCGACAGCT GGTACAAGCC GGCCAAGTGA
 
Protein sequence
MKNTALKSFL LASSLLTSAG LVHAADVTLT VESWRNDDLQ IWQEKIIPAF EAKNPGIKIV 
FSPTAPTEYN ASLNAKLDAG SAGDIITCRP FDASLELFNK KQLVDITSLP GMENFSPVAK
AAWSTDDGKS TFCVPMASVI HGFIYNKDAF DKLGISVPKT QDEFYAALDK IKADGTYIPL
AMGTKDLWEA ATMGYQNIGP NYWKGEDGRD ALIAGKQKLT DADWVKPYEE LAKWKPYLGD
GFEAQTYSDS QNLFTLGRAA IYPAGSWEIA LFNTQAQFKM GAFPPPVPKA GDQGYISDHP
DIGVALNAKS KHAEEAKKFL SWVASPEFAD IYANSLPGFF SLNSNPVKMS DPLAQEFVSW
RGPYKSTVRS TYQILSRGTP NLENETWVES ANVINGTDTP KVAAEKLQKG LDSWYKPAK