Gene Rleg_4863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4863 
Symbol 
ID8007251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp243972 
End bp245450 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content60% 
IMG OID644821793 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002973053 
Protein GI241113218 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.426159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGGA TTTCACTCGG CGGTGCCGCC GCGCGCGCAA CTGTATCCAC TGCTTTGATG 
CTATCGTTGA TGTCCGCGTC TGCACTGGGC GCGCCGGTCG ACCTGAGCAA GTGGTCGCCG
GAATATGTGC GCTCCATTGC CGGCACACAG GACTTTGACA CGGCGGGCGA TTGCGCCAAG
GTCACCCCCC TCGACTACAA GGGGCGACTG ACTTTCTGGT ATCAGGGCGT GTTCGAGGGT
GACCCCGACC TCCTGCGCCA GTATTACAAG GAGTTCTTCG AGACCTTCCG CAAGACCTAT
CCAAACATCC AGCTTGAGGA ACAAGCCCTC ACCTATAACG ACCTGCTGGA CAAGTTCCGC
ACCGCGCTCC TTGGCAATGC AGCGCCAATG GCGGTGCGTC TGCAAATCCT GGGCGGCACC
GAGTTCGCCT CGAAGGGCTA TCTGGAACCC CTCAAACCAG AGGACGTAGG GTATTCGACC
GACGACTTCT GGCCCGGTGC AATGAAGGCC GTAACCTGGG AGGGGGTGAC CTACGGCATC
CCGACCAACA ACGAGACGAT GGCGTTCATC TGGAACGCCG ACGTCTTCAA GCGTGCAGGC
CTCGATCCGG AAAAGGCTCC GGCAACCTGG GACGACGTCG TCAAATATTC CAAGCAGATC
CACGACAAGC TCGGCATTGC CGGTTACGGC CTCGTGGCGC GCAAGAACGC CGGCAATACG
CCGTATCGCT TCATGCCGCA GCTGTGGGCC TATGGCGGCG GCGTTTTCGA CGAAGCCACC
GCCAATCCGA CCTACAAGCA GGTCCAGCTC GACAGCCCGC AGAGCAAAGC GGCATTGCAA
GCCTCCTACG ATATGTATGT CCGCGACAAA TCGGTTCCGG TTTCGGCGCT CACCAACCAG
CAGGCGGATA ACCAGCCCCT CTTCCTCGCT GGCCAGCTCG GCATGATGGT CTCGCACCCC
TCCGACTACA ACGTCATGCT CGACCTGCAG AAAAAGACGA CGGGCGGCGA CAAGGACAAA
GCGCAGACCG TCATCGACAA TATGCGCTAC GGCCTGATTC CGACTGGCCC CGACGGCAAG
CGTGCCGTCG TGTTTGGCGG CTCGAACATT CACATCCTGA AGCCCGAATA TGTCGAGGGC
GGCAAGGTCG ACGAGCCGGC TGCAAAGGCT ATCAGCTGCA TGTGGGCAAG CCCCGAATGG
TCGCTGAAAA TGGCCTATGC CGGCTCGAAC CCGGGAAACC TTAACGGCTT CAAGACCAAA
TGGATGAAGG AACGCCTGGA CAGTATAAAG TTCCTTGATG TCACGACTTC GATGCTGCCA
TACGGCATCC CGTTTCCGGC GCTGCCCCAG TCCCCCGAGA TCATGAACAT CATCGTCCCG
GACATGCTGC AGAATGCCCT CACCGGAGCC ATGACTGTCG ACCAAGCAGC GGACGACGCA
GCCAAGAAGG TCAAAGACCT AACGGATGGC GGACTCTAG
 
Protein sequence
MTRISLGGAA ARATVSTALM LSLMSASALG APVDLSKWSP EYVRSIAGTQ DFDTAGDCAK 
VTPLDYKGRL TFWYQGVFEG DPDLLRQYYK EFFETFRKTY PNIQLEEQAL TYNDLLDKFR
TALLGNAAPM AVRLQILGGT EFASKGYLEP LKPEDVGYST DDFWPGAMKA VTWEGVTYGI
PTNNETMAFI WNADVFKRAG LDPEKAPATW DDVVKYSKQI HDKLGIAGYG LVARKNAGNT
PYRFMPQLWA YGGGVFDEAT ANPTYKQVQL DSPQSKAALQ ASYDMYVRDK SVPVSALTNQ
QADNQPLFLA GQLGMMVSHP SDYNVMLDLQ KKTTGGDKDK AQTVIDNMRY GLIPTGPDGK
RAVVFGGSNI HILKPEYVEG GKVDEPAAKA ISCMWASPEW SLKMAYAGSN PGNLNGFKTK
WMKERLDSIK FLDVTTSMLP YGIPFPALPQ SPEIMNIIVP DMLQNALTGA MTVDQAADDA
AKKVKDLTDG GL