Gene Rleg2_6542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6542 
Symbol 
ID6983612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp215107 
End bp216453 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content61% 
IMG OID643399538 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002284294 
Protein GI209552379 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.162087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTT TGCCGACACT GAAATCCCTC ACTATTGCTG CCGCCATCCT GGCCTCGAGC 
TCTGCAATCG TGCTTGCCAA GGACGTTCAC ATCAGCGTCT GGGCCGGCGG CACCGGCCCG
AACGACGCCT ATCGCCTCGA CGCCATCGAA ATTGCAGCCC AGCAGCTGCA GCGCGAAGCC
GCCCTCAAGG GCGAAGACCT GAAGATCACC GTCGAGAAGA AGCCCTATTC CGCCTGGGAA
GACTTCAAGC AGGCGCTGAC CCTTGCCGCG GAAGCCAAGA CCGCCCCGAA CATCGTCGTC
AGCGGCCACG AAGACATCGC CCCCTGGTCG CAGGCCGGCC TCATCGTCCC GATCGAGGAT
TACGTCGATC TCGACTCCTG GCCGCTCAGC GACATCTACG AAAACCTGCT GAAGATCGCC
TCCTACAACG GCACCGTCTA CGGCATTCCG CAGGATGCCG AATCCCGCCC GATGTTCTTC
TGGAAGCCTT ATATGAAGGC GATCGGCTAC AGCGACGCCG ATCTGGATGC GCTGCCGCAG
AGTGTCCAGG ACGGCAAGTA CACCATGAAA AACCTGCTCG AAGACGCCAA GAAGATGCAG
GACAAGGGCC TCGTTCAGCC CGGTTACGGT TTCTATCCGC GCACCAGCAA CGGTCCCGAT
TATTGGCAGT TCTACACCAG CTTCGGCGGT ACGATGGAAG AAGGCGGCAA GCTCGTCTTC
GACAAGGCGG CGATGGCCCG CACCTATCAG TTCTTCGCCG ACGCCGTTAA ATCAGGCGTC
ACCAAGAAGA ACCACATCGG CATGCCTGGT GATCAGTGGT GGAAGGAAGT CGCCACCGGC
AAGGCAGGCA TCTGGGACGG CGGCACCTGG CATTATGCCC GCCTCGTCAA CCAGGAAGGC
CTCAAGGACT TCTTCGGCAA CGTGATCTTC ACGCTGATCC CCGCCGGCGA AGGCGGCAAG
GCCAACACGC TGACCCATCC GCTCGTCTAC CTCTTGACCG CAGGTCACGA TCAGGAAGAC
ACCGAGATCG CCGCCCAGCT GGTCAAGATC GCCTCCGAGC CGCGCACCAA CGCGCTGCAT
GCGGTCAAAT CGGCCCATCT CGGCATCTCC AAGTCGGAAG CCACCGTCGA CTTCTACTCG
GCCGACCGCT GGACCCGCGA AGCCACCGAG CGCCTGCTGC CGCATGCCAA TGCAATGCCG
AACAATTCCG ATTTCGGCAC CTATTGGAAC ATCATGTGGA AGAACCTCGA AGCCTCCTGG
ACCGGCGCCA AGACCGTCGA CGCCGCCATC GGTGATGCCG AGAGCGAGCT GAAGAGCACG
CTCGGCGACA AGATCGTCAT CCGCTGA
 
Protein sequence
MTILPTLKSL TIAAAILASS SAIVLAKDVH ISVWAGGTGP NDAYRLDAIE IAAQQLQREA 
ALKGEDLKIT VEKKPYSAWE DFKQALTLAA EAKTAPNIVV SGHEDIAPWS QAGLIVPIED
YVDLDSWPLS DIYENLLKIA SYNGTVYGIP QDAESRPMFF WKPYMKAIGY SDADLDALPQ
SVQDGKYTMK NLLEDAKKMQ DKGLVQPGYG FYPRTSNGPD YWQFYTSFGG TMEEGGKLVF
DKAAMARTYQ FFADAVKSGV TKKNHIGMPG DQWWKEVATG KAGIWDGGTW HYARLVNQEG
LKDFFGNVIF TLIPAGEGGK ANTLTHPLVY LLTAGHDQED TEIAAQLVKI ASEPRTNALH
AVKSAHLGIS KSEATVDFYS ADRWTREATE RLLPHANAMP NNSDFGTYWN IMWKNLEASW
TGAKTVDAAI GDAESELKST LGDKIVIR