Gene Rleg_3771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3771 
Symbol 
ID8014601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3825643 
End bp3826668 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content67% 
IMG OID644826334 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002977553 
Protein GI241206457 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.613992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.844233 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACACC TCTTTCTCGT CAAGGATATT GCCTTCCAGG CAGGGCTCAG CACTGCGACC 
GTCGACCGGG TGCTGAACGG CAGGCCGGGC GTGCGCCGGC AGACCGAGAT GCGGGTGAAG
GCGGCGATCG CGGAGCTGGA GAAGCAGCAG GCCGGGGCGA TGGGCAGCGG GCGGGTGCTG
GCGATCGATA TCGTCATGGA GACACCGCAG CGTTTCAGCG ATGCGGTGCG CGCCGCCTTC
GAGGCGGAGA TGGCGACCTT TCTGCCGGGC GTCTTCCGCT GTCGCTTCCA TTTTGCCGAG
GTGATGAAGC CGGCCGAGCT GGCGCAGCTT CTCGATCGCA TCCGGCTGCG TGGCACGCAC
GGCATCGTAC TCAAGGCGCC TGACGTCACC GAGGTGGCTG CCGCCGTCGC GCGGGCGGAT
GCGACCGGCA TTCCGGTCGT GACGCTGGTG ACCGACCTGC CGAATTCGGC CCGCATCGCC
TATGCCGGCG CCGACAACCG GGCAGCGGGA GAGACCGCCG CCTATCTGAT CGGCGAATTT
CTCGGGGCTG GCGGCGGCAA GGTGCTGGTG ACCCTGTCGA GCGGACGCTT CCGTGGCGAG
GAGGAGCGCG AAATCGGCTT TCGCCGCGTC ATCCGCGCCC GTTATCCCGA CATCGGCATT
ACCGAAATCA GCGAGGGGCA CGGCACCGAT GCTGCGACGG GAACGCTTGC CGCGGCAGCG
CTTGCCGCCG ATCCCACCAT CAACGCCGTC TATTCGATCG GCGGCGGCAA CCGGGCGGTG
CTTGCGGCCT TCGACGCGGC CAAACGCCCC GTCCGCGTTT TCGTCGCCCA TGATCTCGAC
GCGGATAATC GCGCCCTGCT TGCGGCGCGT CGGATCGGCT TCGTGCTGCA TCATGATCTC
AGAACCGATG CGCGTTCGGC CTTCCGGGCG ATCATGAGCC GCGCGACTGC CTTGCCGCGC
GCGGTTACGC CCTCGCTTTC ATCAGTCGAA ATCGTCACGC CCTACAATAT GCCCGCGGCC
GATTGA
 
Protein sequence
MAHLFLVKDI AFQAGLSTAT VDRVLNGRPG VRRQTEMRVK AAIAELEKQQ AGAMGSGRVL 
AIDIVMETPQ RFSDAVRAAF EAEMATFLPG VFRCRFHFAE VMKPAELAQL LDRIRLRGTH
GIVLKAPDVT EVAAAVARAD ATGIPVVTLV TDLPNSARIA YAGADNRAAG ETAAYLIGEF
LGAGGGKVLV TLSSGRFRGE EEREIGFRRV IRARYPDIGI TEISEGHGTD AATGTLAAAA
LAADPTINAV YSIGGGNRAV LAAFDAAKRP VRVFVAHDLD ADNRALLAAR RIGFVLHHDL
RTDARSAFRA IMSRATALPR AVTPSLSSVE IVTPYNMPAA D