Gene Rleg2_5610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5610 
Symbol 
ID6978704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1258274 
End bp1259332 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content64% 
IMG OID643394708 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002279526 
Protein GI209547608 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.339749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTCG AAGACAAACG AAAGCCGCAA CGAAACGATC GCGTGACGAT CCGGACAGTG 
GCAACCCATG CAGGCGTCTC GGTCGCAGCG GTTTCCAAGG TGATGCGAAA CGCCTACGGC
GTCAGTGAAG CGCTGCGCGC AAGGGTGACC GACGCTATCG AGGCGCTTGG CTATCGCCCC
TCGCGGGCTG CGCGGGGGCT GCGCGGCCGC AGTTTCACCA TCGGCGTGCT GTTGATCGAC
ATCCGCAATC CTTTCCTTCC GGAGGTGATC GCCGGCGTCA ACGTGGTACT GGCGCCTTCG
CATTATCAGG CGATGATCGG CGTCAGCGAT GCGCGCGTGC AACTGGAGAC GTCGCTGATC
GAATCGATGA TCGATTACAA GATGGACGGC CTGATCCTCG TTGCGCCGCG CTTGCCCTCG
GAGATCATCG CAAGGTTCGC AGTGCAGATC CCGATTGTCG CGGTCGGCTA TCACGATGCC
GGCGCCACGG CCTTCGACAC CGTCAATGCC GACGATCAGC GCGGGGCGGA GATCGCGGTG
GAAGCGCTGC TTGCCTGCGG CTATCGCGAC ATCGAAATGC TCAGTCTCGG CGAGCGCGAG
GGGCATGCGG TTTCCGTCGT CCGCCAGCGT GAGATCGGCT ACCGCCGGGC GATGCAGCGC
GGCGGGCTCG GCGCCTCCGC ACCGATCGGC AAAATTCCGA TCGCCTCGCC GAAGCGGGAA
GCCGCGATGC GAAAGTTCCT GTCGCGGAAG GACAGGCCGC GTGCTGTGTT CTGCTGGAGC
GATCTCGATG CGATCACGCT GCTGAGCCTG GCGATGGAGA TGGGCGTGCG CGTGCCCGAA
GACCTCGCGG TCATCGGATA TGACAATTCC TCGACCGCAG CCCTTGGCCT CGTCAATCTC
GCAAGCATCG ATCAGTCGGG CAGGGAGCTC GGTCAGGTCG CAACCCGGGC CCTCATTTCC
AGAATAGAAG GCCGCACCGC CTCCGAGCAT ATTCTCCAGA TACCGTCGCT CGTCAGCCGC
AACAGCCTTG AACGCTCCGA GGGCGTGGAC CGGGTCTGA
 
Protein sequence
MSVEDKRKPQ RNDRVTIRTV ATHAGVSVAA VSKVMRNAYG VSEALRARVT DAIEALGYRP 
SRAARGLRGR SFTIGVLLID IRNPFLPEVI AGVNVVLAPS HYQAMIGVSD ARVQLETSLI
ESMIDYKMDG LILVAPRLPS EIIARFAVQI PIVAVGYHDA GATAFDTVNA DDQRGAEIAV
EALLACGYRD IEMLSLGERE GHAVSVVRQR EIGYRRAMQR GGLGASAPIG KIPIASPKRE
AAMRKFLSRK DRPRAVFCWS DLDAITLLSL AMEMGVRVPE DLAVIGYDNS STAALGLVNL
ASIDQSGREL GQVATRALIS RIEGRTASEH ILQIPSLVSR NSLERSEGVD RV