Gene Rleg_5023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5023 
Symbol 
ID8007614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp408908 
End bp410779 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content60% 
IMG OID644821938 
Producttranscriptional regulator, SARP family 
Protein accessionYP_002973198 
Protein GI241113363 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.494336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.650955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGACT GGCCCCTGGA TGCCAGGTTT TCCGGCCGCC AAACGAGCGT ATCGGAGGCA 
GCGGGGGCGG GGCCGGTGGA TCGCATGAAC AAGGAGCAAT ATCCGTTTCG GATGTTCCTG
CTCGGGCCCT TTGCCCTTGT GGACGCCGGG GGGCGGTCGG TTGCTCCGAA ATCCAAAAAG
GCACAGGCTC TTTTGGCAAT GCTTGCATTG TCCACCCGGG GCTCGCGCTC GAGAATCTGG
CTTAGGGACA AGTTATGGAG CGATCGCTCC GACGACCAGG CGGCAGCCAG TCTACGCCAG
GCGCTTTTGG ACATTCATAA GAGTCTGGGG CCGGCACGTG ATCTCTTGAT TGCGGATAAG
AATACCGTTT GGCTGGATAT GGACCGACTG GCGCTCGATA CCGACCTGGT GGTTCGGACG
GAGCGGTCTG CGGATCAAGT CACCGACGAA TTGCTCGAAG GTATCGACAT CCGCGATCCT
GAATTCGAGG ACTGGTTGGC GCTGGAAAGG CAAAACTGGT ATCGCCGTCT CGATGAAGGA
CAAGTCCACG ACGTCTTCGA GCCGCGACAG CAGCCGAGCC GCGATATCGC CAAACATTCC
GCCCTGCTGC CGTTAACTGG CGCCCCGGAT ACGTCGAGAA CAGGCAAACC ACCCGTGGAT
ATCGCCAGCA GCGGTCCCTC TGGGCGGCGG GTTGGCGGTG ACTGGCGATG GATGATGGCT
CTTCTGTCCC CCATCGTGGT GGGTGCCGGC GAGGGCGGGC AAATTGCTGC GACACGGTTC
CAAAACCTCA TTGCAAAAGC CATCATCGAT GGGCTGGGCT TTGGCGTCAC CGACCTCTCG
TTCACCTCGC CGCATATTGA AGAGAGTGAA CAGCAAATCA GCCTTCCCCT ATGCCTTCAG
CTTCGCCTGA CGTTTGATGG TGACATGGTG CTGATCGAAC TGGTGATGAA GCACCTGATC
AACAACCGCA TTCATTGGCT GGGAAGTCAG GCAATCAACC GCACGCAGTT CGAGCGCGGC
GAGTTCGGCA TCGCCGCTGC GCTGATCAGC CAGGCAGTCG ATCAACTGGC CTATTTCGAG
GAGATCCAGG CAACCGACAG CAGATTGTCG CAAGACGGTC TCCTGATCGA CGCCGTCAAT
GCGATCTTTC GGCTGTCGCG CGACGACCTC GACAACGCGG AACGGCGCCT GGAAGAACAG
ATCCAGTATC AGCCGCGATC ATCGACTTTT GCCTGGCTGT CATTCATTCG GACTTTCCAG
GTCGGCCAGC GTTTCAACGC GCTGGATGCC CATCTGATCG AGGAAGCCCA GGCCTATGCA
CGCAAGGCGC TGGAACTCGA TCCGCAGAAT TCCGTGTCGC TCGCGCTCGT CGGCCACGTC
CATTCGTTCC TGTTCGGCGA ATACGACTAT GCGGCCAACC TGTTCGAAAA ATCGATCCGC
CTGAATCCGG CCCTGCCGCT CGGTTGGGAC CTCTACGCGA TGCTGCACTG CTATGCAGGC
CAGCCCGACA AGGCGGTGGC GATGGCGCGT TGGGTACAAG AGCTCGGCGT CTACAGCCCG
CATAAATATT ACTTCGATAC GACCAAATGC ATTGCGGCAG CGCTTGCAGG CGATCATGCC
GCAGCCATAT CTGCAGGCGA AGAAGCCCTG CGGGCACGGC CGAACTTCAA CAGCCTGCTG
CGCTATCTTG CCTCCAGTCA TGCCCATTCC AACGATCTCG GCGGCGCGCG GCATTACCTG
CAGCGTCTTG AGGCAGTCGA GGGCGGTTTC TCCATCGACG CCTTCCGCGG CAGCGGCTAT
CCGCTGCTTG ACACAGGCGG CGGGCAGATC CTGATCGACG GCCTGCTCAA GGCCGGCGCC
AAGCTGCGCT GA
 
Protein sequence
MADWPLDARF SGRQTSVSEA AGAGPVDRMN KEQYPFRMFL LGPFALVDAG GRSVAPKSKK 
AQALLAMLAL STRGSRSRIW LRDKLWSDRS DDQAAASLRQ ALLDIHKSLG PARDLLIADK
NTVWLDMDRL ALDTDLVVRT ERSADQVTDE LLEGIDIRDP EFEDWLALER QNWYRRLDEG
QVHDVFEPRQ QPSRDIAKHS ALLPLTGAPD TSRTGKPPVD IASSGPSGRR VGGDWRWMMA
LLSPIVVGAG EGGQIAATRF QNLIAKAIID GLGFGVTDLS FTSPHIEESE QQISLPLCLQ
LRLTFDGDMV LIELVMKHLI NNRIHWLGSQ AINRTQFERG EFGIAAALIS QAVDQLAYFE
EIQATDSRLS QDGLLIDAVN AIFRLSRDDL DNAERRLEEQ IQYQPRSSTF AWLSFIRTFQ
VGQRFNALDA HLIEEAQAYA RKALELDPQN SVSLALVGHV HSFLFGEYDY AANLFEKSIR
LNPALPLGWD LYAMLHCYAG QPDKAVAMAR WVQELGVYSP HKYYFDTTKC IAAALAGDHA
AAISAGEEAL RARPNFNSLL RYLASSHAHS NDLGGARHYL QRLEAVEGGF SIDAFRGSGY
PLLDTGGGQI LIDGLLKAGA KLR