Gene Rleg_0126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0126 
Symbol 
ID8011364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp120320 
End bp121339 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content61% 
IMG OID644822717 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002973976 
Protein GI241202880 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.986111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT CTGTTCTCGC TTTCGGCGCG CTCGCGCTTG GTGTCACCTT TTCCGCTCCT 
GTTATGGCGG CTGACGTTGC TGCCTGCCTC ATCACCAAGA CCGACACCAA CCCCTTCTTC
GTCAAGATGA AGGAAGGTGC GACGGCCAAG GCCAAGGAAC TCGGCGTCTC GCTGAAGTCC
TATGCCGGCA AGGTCGACGG TGACAGCGAA AGCCAGGTGG CCGCGATCGA AAGCTGCATT
GCCGACGGCG CAAAGGGCAT CCTGATTGCT GCTTCCGACA CCAAGGGCAT CGTGTCTTCG
GTCAAGAAGG CCCGTGATGC CGGCCTGCTG GTCATCGCGC TCGACACGCC GCTCGAGCCG
GCCGATGCCG CCGACGCCAC CTTCGCCACC GACAACCTGC TCGCCGGCAA GCTGATCGGC
CAGTGGGCCA AAGAAACGAT GGGCGACAAG GCCAAGGATG CCAAGGTCGG CTTCCTCGAC
CTGACGCCGT CACAGCCGAC GGTCGACGTT CTGCGCGACC AGGGCTTCAT GATGGGCTTC
GGCATCGATC CGAAGGACCC GAACAAGATC GGCGACGAGG ACGATGCTCG TATCGTCGGT
CATGACGTGA CCAACGGCAA TGAAGAAGGC GGCCGCAAGG CCATGGAAAA CCTTCTGCAG
AAGGATCCGA GCATCAACGT CATCCACACG ATTAACGAGC CGGCCGCTGT CGGCGCCTAT
CAGGCGCTGA AGGCCGTCGG CATGGAAAAG AACGTGCTGA TCGTCTCGGT CGACGGCGGT
TGCCCGGGCG TGAAGTCGGT CAAGGAAGGC GTCATCGGCG CTACCTCGCA GCAATATCCG
CTGATGATGG CAGCCCTTGG CGTCGAAGCG ATCAAGAAGT TCGCCGACAG CGGTGAAAAG
CCGAAGCCGA CCGAAGGCAA GTCCTTCTAC GACACCGGCG TCTCGCTCGT CACCGACAAG
CCGGTTTCCG GCGTCAAGTC GATCGACACC AAGGAAGGCA CGGACAAGTG CTGGGGCTGA
 
Protein sequence
MKKSVLAFGA LALGVTFSAP VMAADVAACL ITKTDTNPFF VKMKEGATAK AKELGVSLKS 
YAGKVDGDSE SQVAAIESCI ADGAKGILIA ASDTKGIVSS VKKARDAGLL VIALDTPLEP
ADAADATFAT DNLLAGKLIG QWAKETMGDK AKDAKVGFLD LTPSQPTVDV LRDQGFMMGF
GIDPKDPNKI GDEDDARIVG HDVTNGNEEG GRKAMENLLQ KDPSINVIHT INEPAAVGAY
QALKAVGMEK NVLIVSVDGG CPGVKSVKEG VIGATSQQYP LMMAALGVEA IKKFADSGEK
PKPTEGKSFY DTGVSLVTDK PVSGVKSIDT KEGTDKCWG