Gene Rleg2_5636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5636 
Symbol 
ID6977027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp24298 
End bp25257 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content66% 
IMG OID643393093 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002277911 
Protein GI209546021 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCGC CCGATAGTAA AAATGTCCAG GAAATCGGCT TCATCCTGAT CCCGGGATTC 
GCGCTGATGT CCTATGCCTC GGCGACCGAG CCGCTCAGGG CGGCAAACCT TCTGGCCGGA
CGAGAAATCT ATCGGCTGTC GATCTTTTCG CCGGACGGAG GGCCGGCGCG CTCCTCTTCA
GGCGTCAGCG TGCCCGCCGA ACCCCTTCCG GCCAGAGGTT CCGGCCTCGG CACGGCCTTC
GTCTGCGCCG GCGGCTTGCC GCGCGACTGG CGTTATCCCG GCGTGCTTGC CTGCCTCAGG
CAACTGTCGC GCGAGGGTGT GAGGATCGGC GGCATTTCGG GCGGCCCCTA TCTGATGGCT
GCCGCCGGAC TGCTGGCCGG CCGCGATTTT ACCATCCACT GGGAACATGC GGCCGCCCTG
CTCGAGGCCT TTCCGGAGCT TACGCCGCGC CAGGCGCGCT TCATGATCGA CGGCAACCGG
ATCACCTGCG GCGGCGGCAT CGCCCCGCTC GATATGATGC ATGTGCTGAT CGCCGAGCGC
ATGGGACCGG ATTTTGCCCG CCGCGTCAGC GACTGGTATC TTCACACCGA GGTCAATGAG
CCCGCCGCCC CCCAGCGCGC CTCGCTCGCC GAGCGCTATG GCGTCCACCA TCCAGGGCTG
CTCAGCGTTC TCGAACGGAT GGAGGAGACG ATCGAAATGC CGCTCGACCG CGCCGCCATG
GCGCGCATCG CCGGCGTCAC CGTCCGCCAT CTCGACCGGC TCTTTTCCGC CCATCTTAAG
ACCAGCTTCC TCGATCAGTA CCACAGGATC AGGCTGCAGC ACGCCCATCG CCTGCTGAAG
CAGAGCCCGC TTTCCGTCTC GGAGATCGCC GTTGCCACCG GCTTTTCCAG TCTTAGCCAC
TTTTCCCGGA TGTTCCGCGC CGTCTACGGC ATCGCTCCGC GTGAGGCGCG CCGGGAATAG
 
Protein sequence
MSSPDSKNVQ EIGFILIPGF ALMSYASATE PLRAANLLAG REIYRLSIFS PDGGPARSSS 
GVSVPAEPLP ARGSGLGTAF VCAGGLPRDW RYPGVLACLR QLSREGVRIG GISGGPYLMA
AAGLLAGRDF TIHWEHAAAL LEAFPELTPR QARFMIDGNR ITCGGGIAPL DMMHVLIAER
MGPDFARRVS DWYLHTEVNE PAAPQRASLA ERYGVHHPGL LSVLERMEET IEMPLDRAAM
ARIAGVTVRH LDRLFSAHLK TSFLDQYHRI RLQHAHRLLK QSPLSVSEIA VATGFSSLSH
FSRMFRAVYG IAPREARRE