Gene Rleg_2286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2286 
Symbol 
ID8013285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2293753 
End bp2295207 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content59% 
IMG OID644824871 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002976101 
Protein GI241205005 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.322369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGAT TGCTATTGAG TTCAACGGCC GCAGGACTAC TTGCTGCGGC GGGCGTCACA 
TCCGCGCTCG CGTGCGAACC GGACTACACC GGTGTCACGC TCACCGCTAC GACGCAGACA
GGCCCCTATA TCGCCTCTGC GCTACAACTC GCGGCCAAGG GCTGGGAAGA AAAGACCTGC
GGCAAGATGA ATGTCGTCGA ATTTCCGTGG TCGGAACTCT ATCCGAAAAT CGTAACCTCG
TTGACCTCGG GCGAAGACAC GTTCGACGTG GTCGCCTTTG CGCCGGCCTG GGCACCGGAC
TTCACCGATT TTCTCTCGGA AATGCCCAAG GCGATGCAAT CAGGTGCCGA CTGGGAGGAC
ATCGCGCCGG TTTACCGCGA GCAACTGATG GTTTGGAACG GCAAGGTCCT GTCGCAGACC
ATGGACGGTG ACGCCCATAC CTATACCTAC CGCATTGATC TGTTTGAAAA CGCGGAAAAC
CAGAGCGCCT TCAACGCGAA GTATGGCTAC GATCTGGCCC CACCGAAGAC ATGGAAGCAG
TATCTCGACA TCGCTGAATT CTTCCAGCAG CCGGACAAGG GCCTTTGGGG CACGGCGGAA
GCCTTCCGCC GTGGTGGCCA GCAATTCTGG TTCCTGTTCA GTCACGTGGC GGGATACACC
AGCCATCCCG ACAATCCCGG CGGCATGTTC TTCGATCCTG ACACGATGGA TGCGCAGGTC
AACAATCCAG GCTGGGTGCG CGGCCTGGAG GAATATATTC GCGCCTCCAA ACTGGCACCG
CCAAATGCGC TGAACTTCTC GTTCGGCGAA GTGAACGCAG CCTTTGCCGG TGGCCAGGTC
GCGGAATCGA TCGGCTGGGG CGATACCGGC GTCATCGCCG CCGACCCGAA GCAGTCCAAG
GTTGCTGGCA ATGTCGGTTC GGCATCGCTG CCGGGATCCG ACGAGATCTG GAACTACAAG
ACCAAAAAGT GGGACAAGCA GCCCGAGGTC GTCCAGACTT CCTTCATGGC CTTCGGCGGT
TGGCAGGCAG CCGTACCGTC GTCCTCCAAG AACCAGGAGG CCGCTTGGAA CTATATCCAG
TTCCTGACGA GCCCGGCGGT TTCCGGTCAG GCGGCGATTA CCGGCGGCAC AGGCGTCAAT
CCATACCGTC TTTCGCACAC GACGAATACA GCGTTGTGGT CGAAGATCTT TTCCGAGCGT
GAGGCCAAGG AATATCTTGG AAGCCAGAAG GACGCGGTGA CCGCCAAGAA CACGGCGCTC
GACATGCGCC TGCCGGGCTA TTTCTCCTAT ACGGAAATTC TCGAAATCGA GCTTTCCAAG
GCATTGGCTG GAGAGGTGAC GCCGCAGCAG GCGCTGGATA CCGTGGCTGC CGGATGGAAC
AAGCTGACGG ACGAGTTCGG CCGCGACAAG CAACTGGCAG CCTATCGTTC GTCGATGGGC
CTGCCTGCGA AGTAA
 
Protein sequence
MRRLLLSSTA AGLLAAAGVT SALACEPDYT GVTLTATTQT GPYIASALQL AAKGWEEKTC 
GKMNVVEFPW SELYPKIVTS LTSGEDTFDV VAFAPAWAPD FTDFLSEMPK AMQSGADWED
IAPVYREQLM VWNGKVLSQT MDGDAHTYTY RIDLFENAEN QSAFNAKYGY DLAPPKTWKQ
YLDIAEFFQQ PDKGLWGTAE AFRRGGQQFW FLFSHVAGYT SHPDNPGGMF FDPDTMDAQV
NNPGWVRGLE EYIRASKLAP PNALNFSFGE VNAAFAGGQV AESIGWGDTG VIAADPKQSK
VAGNVGSASL PGSDEIWNYK TKKWDKQPEV VQTSFMAFGG WQAAVPSSSK NQEAAWNYIQ
FLTSPAVSGQ AAITGGTGVN PYRLSHTTNT ALWSKIFSER EAKEYLGSQK DAVTAKNTAL
DMRLPGYFSY TEILEIELSK ALAGEVTPQQ ALDTVAAGWN KLTDEFGRDK QLAAYRSSMG
LPAK