Gene Rleg_3765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3765 
Symbol 
ID8014595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3819751 
End bp3820734 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content62% 
IMG OID644826328 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002977547 
Protein GI241206451 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.216228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.775581 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTGT TCAAAGCAGC CATTCTGGCT GGCACTTTCG CCATTCTGGC GGCCGGCTCG 
GCATTTTCCG CGGACGTCAA GATAGGGTTC ATCGTCAAGC AGCCCGAGGA GCCTTGGTTC
CAGGACGAAT GGAAATTCGC CGACCAGGCC GCCAAGGAAA AAGGCTTCAC CGTCGTCAAG
ATCGGCGCCG AAGACGGCGA GAAGGTCCAG TCGGCGATCG ATAATCTCGG TGCCCAGGGT
GCGCAGGGCT TCATCATCTG CACGCCCGAT GTCAAGCTCG GCCCCGGCAT TGTCGCCAAG
GCCGAAGCCA ACCAGCTGAA GCTGATGACG GTCGACGACC GCCTCGTCAG CGCCGACGGC
AAGCCGCTGG AAGACGTGCC GCATATGGGC ATTTCAGCCA CCAAGATCGG CGAGACCGTC
GGCCAGGCGA TCGTCGACGA AATCAAGAAG CGCGGCTGGG ACATGAAGAA TGTCGGCGCC
GTGCGGGTCT CCTATGACCA GCTGCCGACC GCCGTCGACC GCGTCGAAGG CGCGATGTCG
GTGCTGAAGG CCGCCGGCTT CCCGGCCGAA AACATCTATG ATGCGCCACA GGCGAAGACG
GATACGGAAG CGGCGCTCAA CGCGGCAACC ACCGTTTTCA ACAAACATGC CGACGTGAAA
TACTGGGTCG CCTTCGGCCT CAACGACGAA GCCGTCCTCG GCGCCGTCCG TGCTTCCGAA
TCGGTCGGCA TTCCGGCGGC CAACGTCATC GGTGTCGGCA TCGGCGGTGC GGAATCGGCG
ATCAACGAGT TCAAGAAGCC GGCGGCGACG GGCTTCTTCG GCACCGTCAT CATCTCGCCG
AAGCGTCACG GCTATGAGAC GGCGCTCAAC ATGTATGACT GGATCGCCAA CGACAAGGAG
CCGGCAAAGC TGACGTTGAC CTCCGGTTCC CTGGCACTGC GCGGCGATTT TGAAAAGGTC
CGCAAGGATC TTGGCATCGA GTGA
 
Protein sequence
MRLFKAAILA GTFAILAAGS AFSADVKIGF IVKQPEEPWF QDEWKFADQA AKEKGFTVVK 
IGAEDGEKVQ SAIDNLGAQG AQGFIICTPD VKLGPGIVAK AEANQLKLMT VDDRLVSADG
KPLEDVPHMG ISATKIGETV GQAIVDEIKK RGWDMKNVGA VRVSYDQLPT AVDRVEGAMS
VLKAAGFPAE NIYDAPQAKT DTEAALNAAT TVFNKHADVK YWVAFGLNDE AVLGAVRASE
SVGIPAANVI GVGIGGAESA INEFKKPAAT GFFGTVIISP KRHGYETALN MYDWIANDKE
PAKLTLTSGS LALRGDFEKV RKDLGIE