Gene Rleg2_4758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4758 
Symbol 
ID6977852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp390207 
End bp391460 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content59% 
IMG OID643393925 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002278743 
Protein GI209546825 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAA AATTCTTATC ACTGCTTGCC TCCGCCGCAC TGGCTGTCGC ACTTCCTGCC 
GCCGCCCAAG ACAAGCCGCT CGCCGGCAAG TCGATCACCG TATTGATGCC ATCCCCGCAG
GGGCCGGACA TCGCGTCCGC GTTTGAAGCC GAGACCGGCA TTCATGTCGA TCTCCAGACA
CTTTCCTGGG ACGACATCCG CCCGAAGTTG GTGACGGCGC TTGTCGCCGG CACCGCGCCT
GCTGACGTAA CCGAATTCGA CTGGTCCTGG ACCGGTCAGT TCGCGGCGGC AGACTGGTAT
ATGCCGCTGA ACGATTCTTT CGATGCGGAT ACACTCAAGG ACATCAGCGT TGCCAAGATC
TTCACCGTCG ATGGCAAGCT GCTGGGCATA CCCTACACCA ACGACTTTCG GGTGATGCTC
GTTAACAAGA AGCACTTCGC CGATGCCGGC ATAACCGAGA TGCCGAAGAC ACTTGAACAG
CTTGAAGCTG CCGCAAAGCA GATTAAGGAG AAAGGCGTCG CCACCTATCC GATCGGTCTG
CCGCTGTCGG CCACGGAAGG GGCTTCCACA AGCTGGTATC TCCTGACCAA GGCATTCGGA
GGCGAGCTGT TCGACAAGGA CTTCAACCCA CTCTTCACCA AGCCCGATTC CGCCGGCTAC
AAGGCGCTCG CCTTCGAACT GAAGCTGCTC AAGGAAGGTC TTGTTGATCC CGCGTCGACC
GGCCTCAAGG ACAGCCAGAT CAACGAAGGC ATGTTCTCCC AGGGCCTGAC GAGCATCATG
ATTTCGGGCG AACCGGGCCG TCTCGGTCAG ATGAACGATC CCAAACAGTC AAAGGTTGCC
GGCCAGGTCG AGGCGATCCT GGTTCCGACC GAAAGCGGCC AGACGCGCAG CTTCGGTCTA
CCGGAGGCCC TGGCGATTCC GAACGTCTCG TCCAACAAGG AAGCGGCCGT CGCCTTTGTC
AAATGGTTTA CGAGCCGCGA GTTCCAGAAG AAGAACGTCG CCAATGGCTT CCTTCCGACC
AGGACATCCG CCTTGTCTGA ACTAAATTCG GAAGGAAAGC TGAACAGCGG CGATGCGCTC
GTGGCGCAGT CGAAGACCGT TGAAGCGCTC TTTCCGCAGG GCACGCCCCC ATGGTACCCA
CAATTCTCGA GCGGCGTGAA CACCGCGATT AACAGCGCTG CCAAGGATCA GATGACGGTT
GACCAGGCGG TCGAGAGCAT TGCCTCTGCA GCAAAGCAGG CGATGGCACA ATGA
 
Protein sequence
MRKKFLSLLA SAALAVALPA AAQDKPLAGK SITVLMPSPQ GPDIASAFEA ETGIHVDLQT 
LSWDDIRPKL VTALVAGTAP ADVTEFDWSW TGQFAAADWY MPLNDSFDAD TLKDISVAKI
FTVDGKLLGI PYTNDFRVML VNKKHFADAG ITEMPKTLEQ LEAAAKQIKE KGVATYPIGL
PLSATEGAST SWYLLTKAFG GELFDKDFNP LFTKPDSAGY KALAFELKLL KEGLVDPAST
GLKDSQINEG MFSQGLTSIM ISGEPGRLGQ MNDPKQSKVA GQVEAILVPT ESGQTRSFGL
PEALAIPNVS SNKEAAVAFV KWFTSREFQK KNVANGFLPT RTSALSELNS EGKLNSGDAL
VAQSKTVEAL FPQGTPPWYP QFSSGVNTAI NSAAKDQMTV DQAVESIASA AKQAMAQ