Gene Rleg_5214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5214 
Symbol 
ID8007109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp625573 
End bp626607 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content60% 
IMG OID644822123 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002973383 
Protein GI241113548 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.657938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA AAAATTATAA ATCGATGGAC GACTTCGCGA CCGCCGCCGG CGTATCGCGT 
CCGACCTTGT CGAAGTATTT CGACGATCCC TCACAGATCA AGGAGGTGAC CCGCAAGCGT
ATCGAGGCGG CTCTCAAGAA GTCGCATTTC GAGCCGAACC TTTTCGCCCG GCATCTCAAT
CGCAAGCGGA CCAGGAATAT CGGGATTCTG GTCCCGACCA TGTCGGATCC CTTTTATGTG
CAGGTCGTGT CGCTGATCGA GCTGGAGTTG CGCGAAAAGG GATTCTGGCC CGTCCAGATC
TCTTCGCACA CGCGGCCGGA GCTGGAGGCG GAGGCCGTGC GGACCTTGCT GTCGCTGAAG
GTTGCAGGCG CGATCGTGGC GCCGCTCGGC GCCGGTTCAC GCCGTTCCGC GCTGGAAAGA
CTAAGCGGCG CAATCCCCGT CGTCTGCTTC GACAACCCTG CCGGCGCTGA CCTTCCCTAT
GTCGGAAACG ATAACGCGCA AAGCATGGCA GCCATTGTGC AATATCTCTG CCGGTCCGGT
GAGCCACCGA TATTCCTGGA GATTCCACAT TTGACCGAAA ATGCTGGCGA GCGGCAGGAG
AGCTACCGCG CGACCATGGC GAGGGAGGGG CATACTCCGC TGGTCATCGA TTGCGACGCA
CCACCTTCCT GGGAATTCGA GCGTCTCGGA TACGAGCAGA TGCAGAGAAT TCTTGCCAAA
GACGGTCTTC CCGGCAAAAC GCTGCTATGC GCCAATGACC GCTTCGCTTT CGGAGCAATG
GCTGCCGCAT TCGCTTCGGG ACTGAAAATC GGTCGTAGCG ACGCGTGCGA CGTACGCATT
GCCGGGCACG ACGACCACCC GCTTAGCCGA TACACCTGCC CGTCGCTGAC GACCATGGCT
CAGAACGCGC CGGAGATCGC CGCGAAATCC GTGGAGCTTT TGCTGGCACA TATCGAGAGC
GAGGATGATG GAACGACCCC TGCGACGAAT AAGGTGATCC TCGAGGCAAC GCTGGTTATG
CGCGAATCGG CTTGA
 
Protein sequence
MAKKNYKSMD DFATAAGVSR PTLSKYFDDP SQIKEVTRKR IEAALKKSHF EPNLFARHLN 
RKRTRNIGIL VPTMSDPFYV QVVSLIELEL REKGFWPVQI SSHTRPELEA EAVRTLLSLK
VAGAIVAPLG AGSRRSALER LSGAIPVVCF DNPAGADLPY VGNDNAQSMA AIVQYLCRSG
EPPIFLEIPH LTENAGERQE SYRATMAREG HTPLVIDCDA PPSWEFERLG YEQMQRILAK
DGLPGKTLLC ANDRFAFGAM AAAFASGLKI GRSDACDVRI AGHDDHPLSR YTCPSLTTMA
QNAPEIAAKS VELLLAHIES EDDGTTPATN KVILEATLVM RESA