Gene Rleg_5122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5122 
Symbol 
ID8007393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp520669 
End bp521739 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content65% 
IMG OID644822035 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002973295 
Protein GI241113460 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.958101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.135885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGATT CCGCTCATCC GAAACGTGCA GTCACGGTCG CCGACGTCGC GAAGGCCTCA 
AAGGTCTCGA AGGCGACGGC CGCCCGCGTT CTCGGCGGCT ACGGCGTGGT CAGCGCCAAG
ATCACAGATC AGGTCATGGC GGCCGCCGCC GCCCTCGAAT ACCGCCCGAA CGAACTCGCC
CGGAGCATGA GCACCGGAAG ATCCGGTATC ATCGGGGTCG TGGTCGGCGA TATCGAGAAC
GCTTTCTTCA GCCTGGCGGT GCGCGGCATC AGCGATGCCG CCCGCCTCGC GGGATTCAAC
GTCATCATCG CGAATTCGGG CGAAGAGCTC GATGCGGAAA AATCGGCCGT CGACCTCCTG
ATCGGCAAAC GCGTCGATGG CCTGATCGTC ACTCCGGCCC GCTGCGACAG CATCGATCAT
CTCCACCACG TCCGCCGCGC CGGCGTGCCG CTCGTCCTGT TCGACCGGGC GATCCCGGAA
CTCGATGTCG ACGCCGTGAC CGGCGACGAC CGGGATGCGG CCATCGCGGC GACCCGGTAT
CTGATCGAGC AAGGACATCG CCGCCTCGCC TACGTCTCTG CTATGGATGC CGAGGGCGGC
GGGCCCACAG ATATCGGGCG GATCTCGAAT TCCGCCGTGC GCGAACGCGT AGAAGGTTTT
GTCAGCGTCC TGACCGAGGC GGGTTTGCCG AACCCTCTTC ATTATGTCAG GCTCGGAGCC
ACGGACCAGC GCCAGACAGA CGGCGTGATC AAAAGCCTGC TCGCCGACAG TGCCGCGCCG
ACGGCGCTGC TGGCATCGGA CAGCCTCGTC GGCTTGCGCA TCTTCAAGTC GCTACAATCG
CTCGGCCTGT CGATCCCCAA GGACGTGTCG ATGATTTCGT TTCTCGACGC CGACTGGACC
AGCGTCACCG TTCCGCCGAT CACCATCGTA GACCAGCGCG TCTACGAGAT GGGCAAGCTC
GCCGGCGAGC GGCTCATCGC CCGTATCGAA CGCACCCCTC TTGCCGTCGA ACGTCTGCGC
GTCCGCACGA GCCTCGTCCT GCGCGGCTCC GTCGCCACGA TCGGCCGGTG A
 
Protein sequence
MEDSAHPKRA VTVADVAKAS KVSKATAARV LGGYGVVSAK ITDQVMAAAA ALEYRPNELA 
RSMSTGRSGI IGVVVGDIEN AFFSLAVRGI SDAARLAGFN VIIANSGEEL DAEKSAVDLL
IGKRVDGLIV TPARCDSIDH LHHVRRAGVP LVLFDRAIPE LDVDAVTGDD RDAAIAATRY
LIEQGHRRLA YVSAMDAEGG GPTDIGRISN SAVRERVEGF VSVLTEAGLP NPLHYVRLGA
TDQRQTDGVI KSLLADSAAP TALLASDSLV GLRIFKSLQS LGLSIPKDVS MISFLDADWT
SVTVPPITIV DQRVYEMGKL AGERLIARIE RTPLAVERLR VRTSLVLRGS VATIGR