Gene Rleg_0584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0584 
Symbol 
ID8011771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp611051 
End bp613360 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content65% 
IMG OID644823174 
ProductRNA binding S1 domain protein 
Protein accessionYP_002974427 
Protein GI241203331 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.717148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCAG ACCTCCGTTT TCTCGCAGCC CGCATCTCCG CCGAAATCAA CGCCCGGCCC 
GAACAGGCCA AAGCTGCAAT CGAGCTGCTC GACGAAGGCT CTACCGTGCC CTTCATCGCC
CGCTACCGCA AGGAAGTGAC GGGCGGGCTG GATGATACGC AGCTGCGCAA TCTCGCCGAG
CGGCTGGTCT ATCTGCGCGA ACTAGAAGCT CGGCGCGACG CGATCGTCGA ATCGATCACC
GGCCAGGGCA AGATGACCGA CGAGCTGATG ACCAAGGTCG CAGGCGCCGA AACCAAGGCC
GAGCTTGAGG ATCTCTATCT GCCCTACAAG CCGAAGCGCC GCACGCGCGC CGAAATCGCC
CGTGAACGTG GCCTCGGCCC GCTCGCCGAG ACGATCCTCG CCGACCGCGC CAGGGAACCG
GCGGTATTGG CCGAAGGCTT CGTCACCGCG GATGTGCCCG ATGTGAAGAC GGCGCTCGAA
GGCGCGCGCG ACATCATCGC CGAAGGCATT GCCGAAAATG CCGACCTGCT CGGCAGGTTG
CGCGCCCATA TGCGCCAGGC CGCGCTGCTG AAGGCGAAAG TCGTTGATGG CAAGCAGGCG
ACCGGCGAGA AGTTTTCCGA TTATTTCGAC CATTCCGAAC GCTGGGCGAC CGCCCCCGGC
CACCGCGCGC TCGCCATGCT GCGCGGCTGG AACGAGGAGG TGCTGACGCT GACGATCGAG
GCCGACGCCG AGACGACCTC TCCGAACAAG CCGGTCGAGC GCATGATCGT GGCCGCCTAC
GAGATCGGTA CCAGCCGTCC CGGCGACCGC TGGCTGATGG AGGTCGCAAG CTGGACCTGG
CGCGTCAAGC TTTCCATGTC GCTCTCGCTC GACCTGATGC GGGAACTGCG CGAAAGGGCC
GAAGAGGAGG CGATCCATGT CTTCGCCCGC AATCTCAAGG ATCTGCTTTT GGCTGCACCC
GCCGGCTCGC GCGCGACGAT GGGTCTTGAT CCCGGCATCC GCACCGGCGT CAAGGTCGCC
GTCGTCGACG GCACCGGCAA GGTGGTGGCG ACATCGACCG TCTATCCCTT CCAGCCGAGG
AACGATGTGC GCGGCGCCCA GGTCGAGCTC GCATCGCTGA TCCGCAAGCA CAATGTCGAG
CTGATCTCGA TCGGCAACGG CACCGGCAGC CGCGAAACCG AAAAGCTGGT GGCCGACATG
CTGGCCGAGT TGCCGGCGCC GAAGCCGACC AAGGTCATCG TGTCGGAAGC CGGCGCCTCG
GTCTATTCCG CCTCGGCGAC CGCAGCGGCT GAATTCCCCG ATCTCGACGT ATCACTGCGC
GGCGCCGTCT CCATCGCACG CCGCCTGCAG GATCCGCTGG CCGAACTCGT CAAGATCGAG
CCGAAGTCGA TCGGCGTCGG CCAATATCAG CACGACGTCG ACCAGCAGAA GCTGTCGCGT
TCGCTCGATG CCGTGGTGGA AGACGCAGTG AATGCCGTCG GGGTCGATCT CAATACCGCG
TCTGCGCCGC TGCTGTCGCG CGTCTCCGGC CTCGGCCCGT CGATCGCCGA CGCCATTGTC
CGCCACCGCG ACAGCGAAGG CCGTTTCGAG ACGCGAAAGG ATCTTCTGAA GGTCGCCCGC
CTCGGCGGCC GCACTTTCGA GCAGTGCGCC GGCTTCCTGC GCATTCCGAA CGGCAAGGAG
CCGCTCGATT CCTCCTCGGT CCACCCGGAG GCCTATGGCG TCGCAAAGAA GATCGTCGCT
GCCTGCGGCC GCGATCTGCG CGCGTTGATG GGCGATAGCG CGGTGTTGAA ATCGGTTGAT
CCGCGCCAGT TCATCGACGA GAAATTCGGC CTGCCGACGG TCAGGGACAT CATTGCGGAG
CTGGAAAAGC CCGGCCGCGA CCCGCGTCCG AGCTTCAAGA CCGCGACCTT CGCCGAGGGC
GTCAACGAAA TTTCCGACCT CAAGCCCGGC ATGGTGCTCG AAGGCACGGT GACCAATGTC
GCGGCCTTCG GCGCCTTCGT CGATATCGGC GTGCACCAGG ATGGCCTGGT GCATGTGTCC
CAGCTTGCCG ATCGCTTCGT CAAGGATCCC CACGAGGTCG TCAAGGCGGG TGATGTCGTC
AAGGTGCGGG TTGTAGAAGT CGACGCCAAG CGCAAGCGCA TCGCTCTTTC TATGAAACGC
GATGACGGTT CTTCAGCGCC GCCGCCGCGG GGTGATTCTC GCGCGAACCA GGGTTCCCGG
CCGCAGAACG AGAGCCGGCC CGCGGCCGCG AAACCCGAGA GCCAGGGCGC TTTCGGCGCG
GCACTTGCCG AAGCGATGAA GCGAAAATAA
 
Protein sequence
MAADLRFLAA RISAEINARP EQAKAAIELL DEGSTVPFIA RYRKEVTGGL DDTQLRNLAE 
RLVYLRELEA RRDAIVESIT GQGKMTDELM TKVAGAETKA ELEDLYLPYK PKRRTRAEIA
RERGLGPLAE TILADRAREP AVLAEGFVTA DVPDVKTALE GARDIIAEGI AENADLLGRL
RAHMRQAALL KAKVVDGKQA TGEKFSDYFD HSERWATAPG HRALAMLRGW NEEVLTLTIE
ADAETTSPNK PVERMIVAAY EIGTSRPGDR WLMEVASWTW RVKLSMSLSL DLMRELRERA
EEEAIHVFAR NLKDLLLAAP AGSRATMGLD PGIRTGVKVA VVDGTGKVVA TSTVYPFQPR
NDVRGAQVEL ASLIRKHNVE LISIGNGTGS RETEKLVADM LAELPAPKPT KVIVSEAGAS
VYSASATAAA EFPDLDVSLR GAVSIARRLQ DPLAELVKIE PKSIGVGQYQ HDVDQQKLSR
SLDAVVEDAV NAVGVDLNTA SAPLLSRVSG LGPSIADAIV RHRDSEGRFE TRKDLLKVAR
LGGRTFEQCA GFLRIPNGKE PLDSSSVHPE AYGVAKKIVA ACGRDLRALM GDSAVLKSVD
PRQFIDEKFG LPTVRDIIAE LEKPGRDPRP SFKTATFAEG VNEISDLKPG MVLEGTVTNV
AAFGAFVDIG VHQDGLVHVS QLADRFVKDP HEVVKAGDVV KVRVVEVDAK RKRIALSMKR
DDGSSAPPPR GDSRANQGSR PQNESRPAAA KPESQGAFGA ALAEAMKRK