Gene Rleg2_0539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0539 
Symbol 
ID6979255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp556695 
End bp559004 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content67% 
IMG OID643395251 
ProductRNA binding S1 domain protein 
Protein accessionYP_002280062 
Protein GI209548145 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.500263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.212991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCTG ACCTCCGTTT TCTCGCAGCC CGCATCTCGG CCGAAATCAA CGCCCGTCCG 
GAACAGGCCA AAGCCGCCAT CGAGCTGCTC GACGAGGGCG CCACCGTGCC CTTCATTGCG
CGCTACCGCA AGGAGGTGAC GGGCGGGCTC GACGATACGC AGCTGCGCAA TCTCGCCGAG
CGGCTGGTCT ATCTGCGCGA ACTCGAAGCC CGGCGCGCCG CGATCGTCGA ATCGATCACC
GGCCAGGGCA AGATGACCGA CGAGCTGATG GGGAAGGTGG CAGGCGCCGA AACCAAGGCC
GAGCTCGAAG ACCTCTACCT GCCCTATAAG CCGAAGCGGC GCACCCGCGC GGAAATTGCC
CGCGAACGCG GCCTCGGCCC GCTTGCCGAG ACGATCCTTG CCGACCGCGG CCGGGAACCA
GCGGCGCTCG CCGAAGGCTT CGTCACCGCT GACGTGCCCG ATGTGAAGAC GGCGCTCGAA
GGCGCGCGCG ACATCATCGC CGAAGGCATT GCCGAAAATG CCGATCTGCT CGGCCGATTG
CGCGCCCATA TGCGCCAGGC GTCACTGCTG AAGGCCAAGG TCGTCGACGG CAAGCAGGCG
ACGGGCGAGA AGTTCTCCGA TTATTTCGAC CATTCCGAAC GCTGGGCGAC CGCCCCCGGC
CACCGCGCGC TCGCCATGCT GCGCGGCTGG AACGAAGAGG TGCTGACGCT GACGATCGAA
GCCGACGCCG AGACCGCCTC GCCGAACAAG CCGGTCGAAC GGATGATCGC TGTGGCCTAT
GAGATCGGCG CGAGCCGGCC CGGCGACCGC TGGCTGATGG AGGTCGCCAG CTGGACCTGG
CGCGTCAAGC TTTCCATGTC GCTGTCGCTC GACCTGATGC GGGAGCTGCG CGAGAGGGCC
GAAGAGGAGG CGATCCATGT CTTCGCCCGC AATCTCAAGG ATCTGCTGCT GGCCGCCCCC
GCCGGCTCGC GCGCCACCAT GGGCCTCGAT CCCGGCATCC GCACCGGCGT CAAGGTCGCC
GTCGTCGACG GCACCGGCAA GGTCGTGGCC ACCTCGACCG TCTATCCCTT CCAGCCGAGG
AACGACGTGC GCGGCGCCCA GATCGAGCTC GCATCGCTGA TCCGCAAGCA CAATGTCGAG
CTGATCTCGA TCGGCAACGG CACCGGCAGC CGCGAAACCG AAAAGCTGGT GGCCGACATG
CTGGCCGAGT TGCCGGCGCC GAAGCCGACC AAGGTCATCG TCTCGGAAGC GGGCGCCTCG
GTCTATTCCG CCTCGGCGAC CGCAGCGGCT GAGTTTCCCG ATCTCGACGT GTCGCTGCGT
GGCGCCGTCT CCATTGCCCG CCGCCTGCAG GATCCGCTCG CCGAACTCGT CAAGATCGAG
CCGAAGTCGA TCGGCGTCGG CCAGTATCAG CACGACGTCG ACCAACAGAA GCTGTCGCGC
TCGCTCGATG CGGTGGTCGA AGACGCGGTC AATGCTGTCG GTGTCGATCT CAACACCGCC
TCTGCGCCGC TGCTTTCCCG CGTCTCCGGC CTTGGGCCCT CGATTGCCGA TGCCATCGTC
CGCCACCGCG ACAGCGAGGG CCGTTTCGAG ACGCGCCGCG ATCTGTTGAA GGTTGCCCGC
CTCGGCGGCC GCACCTTCGA GCAATGCGCC GGCTTCCTGC GCATCCCGAA CGGCAAGGAG
CCGCTCGACG CCTCCTCCGT CCACCCGGAG GCCTATGGCG TCGCCAAGAA GATCGTCGCC
GCCTGCGGCC GTGATCTGCG CGCATTGATG GGCGACAGCG CCATGCTGAA ATCGGTCGAT
CCGCGCCAGT TTATCGACGA GAAATTCGGT CTGCCGACCG TCAGGGACAT CATCTCGGAA
CTGGAAAAGC CCGGTCGCGA CCCGCGCCCA AGCTTCAAGA CCGCCGCCTT CGCCGAGGGC
GTCAACGAAA TTTCCGACCT CAAGCCCGGC ATGATGCTGG AGGGCACGGT GACCAACGTC
GCCGCCTTCG GCGCCTTCGT CGATATCGGC GTGCACCAGG ATGGTCTGGT GCATGTCTCC
CAGCTTGCCG ACCGCTTCGT CAAGGATCCG CACGAGGTGG TCAAGGCGGG CGATGTCGTC
AAGGTGCGGG TGGTCGAGGT CGACGCCAAG CGCAAGCGCA TCGCTCTTTC GATGAAGCGC
GACGACGGTT CTGCAGCGCC GCCGCCGCGT GGTGATTCTC GCGGAAACCA GGGCTCGCGA
CCGCAGAACG AGCGCCGGCC TGCCGCTCCC AAGCCGGAGA GCCAGGGTGC TTTTGGCGCA
GCGCTGGCTG AGGCGATGAA GCGAAAATAA
 
Protein sequence
MAADLRFLAA RISAEINARP EQAKAAIELL DEGATVPFIA RYRKEVTGGL DDTQLRNLAE 
RLVYLRELEA RRAAIVESIT GQGKMTDELM GKVAGAETKA ELEDLYLPYK PKRRTRAEIA
RERGLGPLAE TILADRGREP AALAEGFVTA DVPDVKTALE GARDIIAEGI AENADLLGRL
RAHMRQASLL KAKVVDGKQA TGEKFSDYFD HSERWATAPG HRALAMLRGW NEEVLTLTIE
ADAETASPNK PVERMIAVAY EIGASRPGDR WLMEVASWTW RVKLSMSLSL DLMRELRERA
EEEAIHVFAR NLKDLLLAAP AGSRATMGLD PGIRTGVKVA VVDGTGKVVA TSTVYPFQPR
NDVRGAQIEL ASLIRKHNVE LISIGNGTGS RETEKLVADM LAELPAPKPT KVIVSEAGAS
VYSASATAAA EFPDLDVSLR GAVSIARRLQ DPLAELVKIE PKSIGVGQYQ HDVDQQKLSR
SLDAVVEDAV NAVGVDLNTA SAPLLSRVSG LGPSIADAIV RHRDSEGRFE TRRDLLKVAR
LGGRTFEQCA GFLRIPNGKE PLDASSVHPE AYGVAKKIVA ACGRDLRALM GDSAMLKSVD
PRQFIDEKFG LPTVRDIISE LEKPGRDPRP SFKTAAFAEG VNEISDLKPG MMLEGTVTNV
AAFGAFVDIG VHQDGLVHVS QLADRFVKDP HEVVKAGDVV KVRVVEVDAK RKRIALSMKR
DDGSAAPPPR GDSRGNQGSR PQNERRPAAP KPESQGAFGA ALAEAMKRK