Gene Rleg2_6036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6036 
Symbol 
ID6977422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp464859 
End bp466811 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content64% 
IMG OID643393488 
Producttranscriptional regulator, SARP family 
Protein accessionYP_002278306 
Protein GI209546416 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0951775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.443759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGGA CATCCGGACA GATGTCGAAG AACAGGCTCT GCCTGCTCGG AAGACCGCGG 
CTGCTGGCAG CGGGGAGGGA ACTTCCCTTG CCGGAGAAGT CTTATTTTCT CCTCGCCATG
CTGACCGCCG AAGCCAATCT CGAACTCGAC CGCGAAACCG TCAGGCGGCA GCTCTGGCAA
TCGGAACTGC CGGAAAAGCG TGCGGGCAGC CTGCGCCAGC TGCTGTCGCG TATCGAGCAG
AGCATTCCCG CCGATCTTCC ACCGCTGCTT GCCGCGACCC GAACCCATAT CGCGCTTGCG
GATGGCTGGG AGGTCGATGT TCATATCCTG AAACAGAAAG GGCCGCTCGC CCCCGAAGAC
AGCGATATGC TGAACGGCGA ACTGCTGGAA GGCGCCAAAT CGCCGACGCA GGGCGCCGAG
GACTGGCTGA CCTTTGAGCG CCAGCGTGTC GACGAATTGC GCTCGGCCCA TCTCACCCGG
CTGATCGAAA CATCGGAAGA CGGATCGGAT GACGAGCAGG TGGCGTTTGC CCGGCGCCTG
CTGGAACTCG ACCCTGCCAG CGAGACGGCC TATCGGGCCC TGATGCGCAC CTATGTCAGG
CTCAACGATG CGGCCGCGGC CCGTCAGGCC TATCTGAAGT GCAAGAGCCA GCTGAAGGAC
GACTTCGATA CCGAGCCGGA GGAAAGCACC ACCGCGCTTG CCCGCGAACT CGGCCTGATC
CCGGCAGCGC AGGCAGCCGC GCCGGAGCGC CCGCCTGCAT CGGGCATGTT CGCCAATCTG
CTCGGCCAGC CGCGCATCAT CATCCTGCCG CCCGAAAGCA TCTTCACCGA TCCGCTGATG
GAGCGTGTCG GCCGGGCGCT GCTCGAAGAC GTCACCATCG GCCTCAGCCA GCAGCGCGGC
TTCAAGGTGA TCGCCGCGCA TACGAGCCTC GAAATCCTCA GCCGCTCGAT CGATCCGGCG
CGGGCCGTGC CCGGCCCGCT CGACCTCAGC TTCGATTATG CGGTCTACGT CACCATCCAG
GGCCGCGACG AGGATGTCTT TGCCACCTGC CGGCTGACGC GGACGACGAC GTCGGAGGTG
ATCTGGGCGC TGGAACTGCC GCTGGTGATG CAAAAGATCA GCGAGTCCTT CGCGCATCTG
ACGCGGCGGA TCGTCTCCTC GCTCGCCGAC ACGATCGAGC GCCACGAACT GGCAATGCCG
ATCGGCGATG CGCCGGCCTC CGCCTATCGC CTTTATCTCG AAGGCAAGCG GCTGATCGCC
CAGACCGACC TGCAGCATCT GCGCCAGGCG CGCAAATGGT TCAAATCTTC GCTCAATCGT
TACGAGCATT TCTCGGCCGC CCATGCCGGC GTGTCGCGGG CGCTCGGCAT GGAATGGCTG
ATCCGCGGCA TGCGCGACAA GGAACTGCTC GACGAGGCGA ATGGTGCCGC CCGGCAGGCG
CAGCAGTCCG ACCCGAACAG CGGCCGGGCC TATCGCGAGC TCGGTTTTGT GGCGCTTTAT
CGCCGCCGCT TCGACGAAAG CCTGGAATAT TTCCAGCAGG CCCAGGATCT CAATCCCAAC
GATGCCGACA TCCTCGCCGA TTTCGCCGAC GCGCTTTCCC ATGACGGCGA TTTCGATCGG
GCGCTGGAGC TCAGCCGCGC GGCCTTCAAA CTCAATCCAC TGCCGCCGGA TTATTATTAC
TGGAACCTCG GCGGCATCCA CTTCATGCGC GAAGAGTACG AAAAGGCGAT CGACGCGCTG
GAACCGGTGA AGACCAAACA GGCGACGGCG CGTCTGCTTG CCGCCTCGCA TGCGATGGCG
GGCGAGACCG GCAAGGCTCA GAACTATGCC CGGACGGTGC TGGAAAACTT CCCCGATTTC
CGCAGCGAGG ACATTCGTCA TTTCGTCCCC GATCGCGATC CCGCCTTCAC AGAACCGCTG
ATAAAAGGCC TGCAACTCGC CGGTCTTCCC TGA
 
Protein sequence
MQRTSGQMSK NRLCLLGRPR LLAAGRELPL PEKSYFLLAM LTAEANLELD RETVRRQLWQ 
SELPEKRAGS LRQLLSRIEQ SIPADLPPLL AATRTHIALA DGWEVDVHIL KQKGPLAPED
SDMLNGELLE GAKSPTQGAE DWLTFERQRV DELRSAHLTR LIETSEDGSD DEQVAFARRL
LELDPASETA YRALMRTYVR LNDAAAARQA YLKCKSQLKD DFDTEPEEST TALARELGLI
PAAQAAAPER PPASGMFANL LGQPRIIILP PESIFTDPLM ERVGRALLED VTIGLSQQRG
FKVIAAHTSL EILSRSIDPA RAVPGPLDLS FDYAVYVTIQ GRDEDVFATC RLTRTTTSEV
IWALELPLVM QKISESFAHL TRRIVSSLAD TIERHELAMP IGDAPASAYR LYLEGKRLIA
QTDLQHLRQA RKWFKSSLNR YEHFSAAHAG VSRALGMEWL IRGMRDKELL DEANGAARQA
QQSDPNSGRA YRELGFVALY RRRFDESLEY FQQAQDLNPN DADILADFAD ALSHDGDFDR
ALELSRAAFK LNPLPPDYYY WNLGGIHFMR EEYEKAIDAL EPVKTKQATA RLLAASHAMA
GETGKAQNYA RTVLENFPDF RSEDIRHFVP DRDPAFTEPL IKGLQLAGLP