Gene Rleg_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1050 
Symbol 
ID8012179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1025457 
End bp1026497 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content63% 
IMG OID644823633 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002974884 
Protein GI241203788 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.190777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTGCT GTGTCAAGCT TTGGGAGTGG GTGAGCCTGA TGAAGGTGAC TTTGAAGGAT 
GTCGCAAGCC AAGCCGGCGT TGGCACAGCG ACCGTCGAGC GCGTCCTCAA CGGCCGCGGC
GGCGTGCGGC CGGGAACGGT GGAGAAAGTC TTCCTGGCGG CAAGGCGGCT TGAATACCGG
CAAAGCCTGC CGGTCGCCCA TCGCGGTCTG ATCCGGATAG AGGTGATCCT CGTTCGCCCG
GAGACCAGCT TCTATTCGCG TCTGAACCGG GCGTTCGAGC GCATCGCCGC CTCGCTCGAC
GACAGCATCA CCGTTCACCG CACCTTCGTC CGCGAGAACG AGCCGGCGCA ATTCGCCCGC
TACATCGCCA ACCCGACGGC ACGCCGGTCG GCGCTGATCG TGGTCGCGCC CGACCATGCC
GATGTGGTCA CAAGCGTGCG CAAGGCGGCC GGTCTCGGCA TTCCCGTTGT ACAGATCATG
ACCCGTCCGG CCCCCGAACT GCCCTATGTC GGCATCGACA ACTATGCTGC AGGACGCACC
GCAGCTCACT ACATGTCGGG CATGCTGGCG CAGCGCACCG GTTCGTTCGT CGCTCTCTGC
CACAGCGGCG CCTATGAGAA CCATAAGGAG CGGATCCGCG GCTTCTCCAG CTATCTCTCG
GAGAAGGGTT CAAACGACCA TCGTTTCATC GAGGTCATGT TCGACCTCGA TGACGAGCAC
AATGCCATGG AGCTGCTGCA AGCGGCACTC CGGCGCGAGC CCGGCATCAT CGGGGTCTAT
AGTGCAGGCG GCGACAACAA GGGTGTTGCC AGGGTGCTCG CGGCCAACAA GGCTGGGCGG
CCCTTCTGGG TCGGCCACGA ATTGACGCGA GAGACGCAGG ACTATCTCAG CCGTGGCATC
ATGTCGATCG TGCTCGACCA GGCGCCGGAG GTCCAGGCGA GGCGATCGAT AGACCTTGCC
CTGAACAGGC TCGGTCTCAT CGAGATGGAG GTCAGCGCCG AGCCGGTGCG CTTTCTGACG
ATCACACCGG AAAATCTTTG A
 
Protein sequence
MLCCVKLWEW VSLMKVTLKD VASQAGVGTA TVERVLNGRG GVRPGTVEKV FLAARRLEYR 
QSLPVAHRGL IRIEVILVRP ETSFYSRLNR AFERIAASLD DSITVHRTFV RENEPAQFAR
YIANPTARRS ALIVVAPDHA DVVTSVRKAA GLGIPVVQIM TRPAPELPYV GIDNYAAGRT
AAHYMSGMLA QRTGSFVALC HSGAYENHKE RIRGFSSYLS EKGSNDHRFI EVMFDLDDEH
NAMELLQAAL RREPGIIGVY SAGGDNKGVA RVLAANKAGR PFWVGHELTR ETQDYLSRGI
MSIVLDQAPE VQARRSIDLA LNRLGLIEME VSAEPVRFLT ITPENL