Gene Rleg_5385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5385 
Symbol 
ID8007343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp797285 
End bp798559 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content63% 
IMG OID644822289 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002973549 
Protein GI241113714 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.832628 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATA TCATGTCATT TGGAATTCTG GCTTCCACGG TGCTGGCCTT CGCTTCGCCG 
GTGCTGGCCC AGACCGTTTT CGTATCCACC CAGCTTCGTC CGATCGAGGA GGCGACCGTC
GTTCGCGAGG AACTCCTGAA GGACGTCGGC TCGGTTGACT ACGTGGTCGA GGAGCCGCCG
CAGTTTGCGG TGCGCATGGA AGCCGAGCGT CAGGCCGGCA AGCACACCGT CAGCCTCGTC
GGTGCACTGC ATGGCGAGCT CTCGCCGCTC GCCGACAAGG ACACGCTCGA GCCGCTTGAC
GATCTCGCCA AGAAGTTGGC AGCGAGCGGC ATGCCGCAGT CATTGCTCGA CCTCGGCAAG
CTCGGCAAGT CGACCCAGCA GTACATCCCG TGGATGCAGG CGACCTATGT GATGGCCGCC
AAGAAGGAAG CGCTGCAATA CCTGCCTGCC GGCGCCGACG TGAACGCGCT GAACTACGAT
CAGTTGATCG AGTGGGGCAA GAATATGCAG GACGCAACCG GCCAGCCGCA GATCGGCTTT
CCCGCCGGCC CGAAAGGCCT GATGGCGCGT TATTTCCAGG GCTATTTCTA TCCCTCCTTC
ACAGGCGGCG TCGTGCGTAC TTTCCAGAGT GCCGATGCGG CCGCCGGCTG GGAGAAGCTG
AAGGTGCTGT GGGCCTATGT GACACCGAAC TCGACCAGCT ACGACTTCAT GCAGGAGCCG
CTCGCGGCCG GCGAAGTCAT GGTTGCCTGG GACCACATCG CCCGGCTGAA GAATGCTATT
TCCGCAGCAC CGGATGATTA TGTGGTCTTC CCCGCTCCCG CCGGTCCGAA GGGCCGCGGC
TATATGCCGG TCGTTGCCGG TCTCGCCATT CCGAAGGGCG CGCCTGACAA GGCCGGCGCA
GAAAAAATCA TCGAGCACCT GTCCATGCCG GACACGCAGC TCCTGACCGC CTCCAAGGTC
GGCTTCTTCC CGACCCTCAA CGTCAAGCTG CCGCCGGATC TCGATGCCGG CGTCGCCCTG
CTCGCCGGTG CCGTCACCGC CACCCAGGCC TCCAAGGACG CGGTCATCTC GCTGCTCCCG
GTCGGCCTCG GCGACAAGGG CGGCGAGTTC AACAAGGTCT ACATGGACAG TTTCCAGCGC
ATCGTACTGC AGAACGAGCC CGTCGCAGAC GTCCTGAAGG CCCAAGGCGC GACAATGGCC
AAGCTGATGG CCGATACGAA AGCTGCCTGC TGGGCGCCCG ATGCCAAGAG CGACGGCCCC
TGCCCAGTCG AATAA
 
Protein sequence
MKHIMSFGIL ASTVLAFASP VLAQTVFVST QLRPIEEATV VREELLKDVG SVDYVVEEPP 
QFAVRMEAER QAGKHTVSLV GALHGELSPL ADKDTLEPLD DLAKKLAASG MPQSLLDLGK
LGKSTQQYIP WMQATYVMAA KKEALQYLPA GADVNALNYD QLIEWGKNMQ DATGQPQIGF
PAGPKGLMAR YFQGYFYPSF TGGVVRTFQS ADAAAGWEKL KVLWAYVTPN STSYDFMQEP
LAAGEVMVAW DHIARLKNAI SAAPDDYVVF PAPAGPKGRG YMPVVAGLAI PKGAPDKAGA
EKIIEHLSMP DTQLLTASKV GFFPTLNVKL PPDLDAGVAL LAGAVTATQA SKDAVISLLP
VGLGDKGGEF NKVYMDSFQR IVLQNEPVAD VLKAQGATMA KLMADTKAAC WAPDAKSDGP
CPVE