Gene Rleg_5013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5013 
Symbol 
ID8007604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp398956 
End bp399954 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content63% 
IMG OID644821928 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002973188 
Protein GI241113353 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0771857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.165296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTCA AACATCAGCC GACGGTAACG CTTGCCGAAG TCGCGGAAAC CGCCGGTGTC 
GGCGAGAGCA CGGTGTCGCG CGTGCTGCGC AACCATGGTT CGTTTTCCGA CAAGACGCGC
GAGCGGGTGA TGGCCGCCGT CGAGCAACTG GGCTATGTGC CGAACCGGAT CGCGGGAACG
CTTGCCTCCA CCGGTTCGCG TCTTGTCGCC TTCGTCATTC CCTCTTTGTC CAACATCGTC
TTTCCCGATG TGCTGCGCGG TGCCAGTGCC GTTCTGGAGG AAAACCGGTA TCAGGCGGTT
TTCTCGGTGA CCGACTATGA TCCGGGCAAG GAAGAAGCGC TCGCCGCTGC GATGCTTGCC
TGGCGGCCGG CGGCGGTCAT GCTGGCGGGG TGCGAGCATA GCGAGGGCAC GGTGAAGATG
CTGCGCGCCA GCGGGTGCCG GGTCGTCGAA CTGCTGGATC TGGACGGTGA TGCTCTCGAT
ATCGCCGTCG GCTTCTCGAA CCGCGCGGCC GGGCGGGAGA GCGCTGCCTT CCTGCTCAAG
CGGGGGTATC GCCGGATCGG CTATGTCGGT CACGACCTGA ACCGCGATAC CCGTGCCGGC
AAGCGTTTTT CCAGCTTTTG TGAAACGCTC GGCGCGCATG ACGCCCCGCT CGTCGCCCGT
GAAATTCTCG CCGGCGCTTC ATCCGTGGAA AACGGCAGGC TGGGGCTGGA GCGGCTGCTT
GCCCGGACGA GGGATCTCGA TGCGGTTTAT TTCTCCAACG ACGACATGGC GCTGGGCGGC
TATTTTCATT GTCTGGCCGA GGGGATAGCG ATCCCCTCGA AGCTCGCCAT TTTCGGCTAT
AACGGCCTCG ATATCGGCCG GGCAACACCG CAACCCTTAT CGACCATCCG AACGCCGCGT
GTCGCGACCG GACAGATGGC CGCGCAGTTG GTCGTCACGA ATGCGCCGCC GCAGGCCGTC
GATCTCGGCT TTGAACTGAT CGAAGGGGCA ACCGCGTAA
 
Protein sequence
MEFKHQPTVT LAEVAETAGV GESTVSRVLR NHGSFSDKTR ERVMAAVEQL GYVPNRIAGT 
LASTGSRLVA FVIPSLSNIV FPDVLRGASA VLEENRYQAV FSVTDYDPGK EEALAAAMLA
WRPAAVMLAG CEHSEGTVKM LRASGCRVVE LLDLDGDALD IAVGFSNRAA GRESAAFLLK
RGYRRIGYVG HDLNRDTRAG KRFSSFCETL GAHDAPLVAR EILAGASSVE NGRLGLERLL
ARTRDLDAVY FSNDDMALGG YFHCLAEGIA IPSKLAIFGY NGLDIGRATP QPLSTIRTPR
VATGQMAAQL VVTNAPPQAV DLGFELIEGA TA