Gene Rleg2_5687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5687 
Symbol 
ID6977078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp85758 
End bp86963 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content61% 
IMG OID643393144 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002277962 
Protein GI209546072 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.627079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTCG CCTTGCTCGG TTCGACGGCA ATGACCACGG TCACTGCCCA TGCCGCCGAC 
AAGGAAATCA GCTGGATCTA TTGCGGCGAC ACGATCGACC CGGTCCACAC CAAATACATC
AAGCAGTGGG AAGAAAAGAA CACTGGCTGG AAGATCACCC CTGAGGTCGT CGGCTGGGCG
CAGTGCCAGG ACAAGGCGAC GACCCTTGCC GCCGCCGGCA CGCCGGTTGC CATGGCCTAT
GTCGGCTCGC GCACGCTGAA GGAATTCGCG CAGAACGATC TTATCGTTCC GGTGCCGATG
ACCGATGACG AAAAGAAGAC CTACTACCCC CACATCGTCG ACACGGTAAC CTTCGAGGGC
AACCAGTGGG GCGTTCCGAT CGCCTTCTCC ACCAAGGCGC TTTACTGGAA CAAGGATCTC
TTCAAGCAGG CGGGCCTCGA TCCCGAGAAG CCGCCGAAGA CCTGGGCCGA AGAAATCGAG
ATGGCAAAGA CCATCAAGGA AAAGACCGGC ATTCCGGGCT TCGGTCTCTC CGCCAAGACC
TTCGACAACA CCATGCACCA GTTCATGCAT TGGGTTTACA CCAACAACGG CAGCGTGATC
GGTGCCGACG GCAAGGTCAC GCTCGACAGC CCGCAGATTC TCGCCGCGCT GAAAGCCTAT
AAGGACATTG TCCCCTACTC CGAAGAAGGC CCGACGGCCT ATGAGCAGAA CGAAGTCCGC
GCCATCTTCC TCGACGGCAA GGTGGCGATG ATCCAGGCCG GCTCGGGTGC GGCCGACCGT
CTGAAGAAGA CGCAGATCAG CTGGGGCATC ACGACGCTGC CGCTCGGCCC CGACGCCAAG
GGTCCCGGCA CGCTGCTGAT CACCGACAGC CTGGCGATCT TCAAGGGTTC GGGCGTCGAG
GACAAGGCGA CGGAGTTCGC CAAGTTCATC ACCTCGCCGG ATGTGCAGTC GGAATATGAG
CTGCAGGGCG GCGCCGGCCT CACGCCGCTG CGCCCCTCTG CAAAGGTCGA CGAGTTCGTC
GCCAAGGATC CCTATTGGAA GCCGTTGATC GACGGCATCA GCTATGGTGG TCCGGAGCCG
CTCTTCACCG ATTATAAGGG CTTCCAGAAC TCGATGATCG AAATGATCCA ATCGGTGGTG
ACTGGAAAGG CCGAGCCGGA AGCCGCACTC AAGAAGGCTG CCGGCGAAGT CGAAGCCTTC
AAGTAA
 
Protein sequence
MALALLGSTA MTTVTAHAAD KEISWIYCGD TIDPVHTKYI KQWEEKNTGW KITPEVVGWA 
QCQDKATTLA AAGTPVAMAY VGSRTLKEFA QNDLIVPVPM TDDEKKTYYP HIVDTVTFEG
NQWGVPIAFS TKALYWNKDL FKQAGLDPEK PPKTWAEEIE MAKTIKEKTG IPGFGLSAKT
FDNTMHQFMH WVYTNNGSVI GADGKVTLDS PQILAALKAY KDIVPYSEEG PTAYEQNEVR
AIFLDGKVAM IQAGSGAADR LKKTQISWGI TTLPLGPDAK GPGTLLITDS LAIFKGSGVE
DKATEFAKFI TSPDVQSEYE LQGGAGLTPL RPSAKVDEFV AKDPYWKPLI DGISYGGPEP
LFTDYKGFQN SMIEMIQSVV TGKAEPEAAL KKAAGEVEAF K