Gene Rleg_6631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6631 
Symbol 
ID8022881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp59912 
End bp61144 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content60% 
IMG OID644833498 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002984632 
Protein GI241666548 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.325237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATAT CAATCAAGAC AGGCCTTATG GCGCTCGCCC TGCTCGGTTC GACGGCACTG 
ACGGCCGTCA CTGCCCAGGC AGCCGACAAG GAAATCAGCT GGATCTATTG CGGCGACACG
ATCGACCCGG TCCACACCAA ATACATCAAG CAGTGGGAAG AAAAGAACAC GGGCTGGAAG
ATTGCCCCTG AGGTCGTCGG ATGGGCACAG TGCCAGGACA AGGCAACGAC GCTCGCTGCC
GCCGGTACGC CGGTGGCGAT GGCCTATGTC GGCTCGCGCA CGCTGAAGGA ATTCGCGCAG
AACGACCTTA TCGTTCCTGT GCCGATGACG GACGACGAGA AGAAGACCTA CTATCCGAAC
ATCGTCAACA CCGTGACCTT CGAGGGCTCA CAGTGGGGCG TTCCGATCGC CTTCTCTACC
AAGGCGCTCT ATTGGAACAA GGATCTCTTC AAGCAGGCCG GCCTCGATCC CGAGACGCCG
CCGAAGACCT GGGCTGAAGA AATCGAGATG GCAAAGACCA TCAAGGAAAA GACCGGCATT
CCGGGCTTCG GTCTCTCCGC CAAGACCTTC GACAACACGA TGCACCAGTT CATGCATTGG
GTTTACACCA ACAACGGCAC GGTGATCGAT GCCGACGGCA AAGTTACGCT CGACAGCCCG
CAGATACTCG CCGCGCTAAA GGCCTACAAG GATATCGTCC CCTACTCCGA AGAAGGCCCG
ACGGCCTACG AGCAGAACGA AGTCCGCGCC ATCTTCCTCG ACGGCAAGGT GGCGATGATC
CAGGCAGGAT CGGGTGCAGC CGACCGCCTG AAGGCGACGA AGATCAGCTG GGGCATCACG
ACGCTGCCGC TCGGTCCCGA CGCCAAGGGT CCCGGCACGC TGCTGATCAC CGACAGCCTG
GCGATCTTCA AGGGTTCGGG GGTCGAGGAC AAGGCGACGG AATTCGCCAA GTTCATCACC
TCGCCCGATG TGCAGTCCGA ATACGAATTG CAGGGCGGCG CCGGCCTCAC CCCGCTGCGG
CCGTCTGCAA AGGTCGATGA ATTCGTCGCC AAGGATCCCC ATTGGAAGCC GCTCATCGAC
GGCATCAGCT ACGGTGGTCC CGAGCCGCTC TTCACCGACT ACAAGGGCTT CCAGAACTCG
ATGATCGAGA TGGTACAGTC CGTGGTGACG GGCAAGGCCG AGCCGGAGGC TGCTCTCAAG
AAGGCTGCCG GCGAAGTCGA GGCGTTCAAG TAA
 
Protein sequence
MSISIKTGLM ALALLGSTAL TAVTAQAADK EISWIYCGDT IDPVHTKYIK QWEEKNTGWK 
IAPEVVGWAQ CQDKATTLAA AGTPVAMAYV GSRTLKEFAQ NDLIVPVPMT DDEKKTYYPN
IVNTVTFEGS QWGVPIAFST KALYWNKDLF KQAGLDPETP PKTWAEEIEM AKTIKEKTGI
PGFGLSAKTF DNTMHQFMHW VYTNNGTVID ADGKVTLDSP QILAALKAYK DIVPYSEEGP
TAYEQNEVRA IFLDGKVAMI QAGSGAADRL KATKISWGIT TLPLGPDAKG PGTLLITDSL
AIFKGSGVED KATEFAKFIT SPDVQSEYEL QGGAGLTPLR PSAKVDEFVA KDPHWKPLID
GISYGGPEPL FTDYKGFQNS MIEMVQSVVT GKAEPEAALK KAAGEVEAFK