Gene Rleg_6532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6532 
Symbol 
ID8017055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012854 
Strand
Start bp249924 
End bp251207 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content61% 
IMG OID644828319 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002979519 
Protein GI241554306 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0178193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACATC GGATTAAAGT CATTCTTGCG GGCGCGTCCG CGCTTTTGGC ATTGGTCGCG 
GCCGGCCCGT CGCATGCAGA AACCACGCTG TCATTCCTGA TCGACAACAA TCCGGACACG
GTCGCTGCCG CCGAGGCGCT GGTTGCCGCC TATCAGACAA AGGTGCCCGA CGTGACGATC
GAAATCGAGC CACGGGCCGG TGGCGGCGAG GGTGACAACA TCATCAAGAC GCGCCTAGCG
ACCGGCGAGA TGTCCGATGT GTTTCTCTAT AATTCCGGCT CGCTGCTCCA GGCGCTGAAA
CCCGCGCAGA CGCTTGTCGA TCTGAGCGGT CTTGCGTCGC AGGCGAAGGT GGACGAAGGT
TTCAAGTCGG TCGTCCGCGC CGACGGCAAG CTCTACGGTG TTCCCTTCGG CACGGCGATG
GCGGGCGGCA TCCTCTACAA CAGGAAAATC TACCAGGACC TCGGCCTGTC AGTCCCGAAG
ACATGGGCGG ACTTCATGGC GAACAACGCA AAGGTCAAGG CATCAGGCAA GGTCGCCGTG
GCGCAGACCT ATCGCGATAC CTGGACCTCG CAGTTATTCG TTCTGGCGGA CTATTACAAC
CTGCATGCCG CCGTGCCGAA CTTCGCCGCC GACTACACTG CCAACAAGGC GAAATACGCA
GAGACGCCGG CGGCCATGAA AGGCTTCGAA CGGCTGAAGG ACGTTCACGA CGCTGGCCTG
ATGAACGAGG ATTTCGGCGC AGCAAGCTAT GACGATGGCC TGAGAATGGT GGCAACGGGC
GAGGCAGCGC ACTATCCAAT GCTGAGCTTC GCAATCGGCG CGCTCAAGCA GAATTATCCC
GACAACCTCG CAGATGTCGG TTTCTTCGCG CAGCCGAGCG ACGACGCGGC GACGAATGGC
CTGACGGTCT GGATGCCGCC CGCCCTCTAC ATTCCCCTTA CCAGTCAGCA CGCAGAGGAA
GCGCAGAAAT TCGTGGATTT CGCAGGAAGC GTCGAAGGCT GCAAGATCAT GGTGGAAACC
AACACCGTGC AAGGGCCTCC CTTGATCGAC GGCTGCGGTC TGCCTGCCGA CGTGCCGCCT
GCGGTGAAGG ACATGCTTCC CTATTTCGAG GCCAAGGACA AGACGACCCC GGCGCTGGAA
TTTGTTTCGC CGGTCAAGGG GCCGGCTCTT GAGCAGATCA CCGTCGAAGT CGGCTCCGGT
ATTCGCCAGC CGGCCGACGC GGCAAAACTC TATGACGACG ATGTGCGCAA ACAGGCCAAG
CAACTCGGCC TGCCCAACTG GTAG
 
Protein sequence
MTHRIKVILA GASALLALVA AGPSHAETTL SFLIDNNPDT VAAAEALVAA YQTKVPDVTI 
EIEPRAGGGE GDNIIKTRLA TGEMSDVFLY NSGSLLQALK PAQTLVDLSG LASQAKVDEG
FKSVVRADGK LYGVPFGTAM AGGILYNRKI YQDLGLSVPK TWADFMANNA KVKASGKVAV
AQTYRDTWTS QLFVLADYYN LHAAVPNFAA DYTANKAKYA ETPAAMKGFE RLKDVHDAGL
MNEDFGAASY DDGLRMVATG EAAHYPMLSF AIGALKQNYP DNLADVGFFA QPSDDAATNG
LTVWMPPALY IPLTSQHAEE AQKFVDFAGS VEGCKIMVET NTVQGPPLID GCGLPADVPP
AVKDMLPYFE AKDKTTPALE FVSPVKGPAL EQITVEVGSG IRQPADAAKL YDDDVRKQAK
QLGLPNW