Gene Rleg_0522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0522 
Symbol 
ID8015451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp544092 
End bp546188 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content69% 
IMG OID644823113 
ProductRNA-binding S4 domain protein 
Protein accessionYP_002974366 
Protein GI241203270 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.575781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCCA AAGACAAGCC AAAACGCCCG GGCGCAAAGC CCCTTTCACG GGATATCAGG 
TCGAAAGCCG GCCCGAAGGC GGACGGCGAC AAGCCGGCAA AGCCTGCGGT CGCACGCGCA
ATCGCCGCCG AAACCGATAG TGACGCCAAG GCCGAACGCA TTTCCAAGGT GATGGCGCGT
GCCGGCGTCG CCTCCCGCCG TGACATCGAA CGCATGATCA TGGAAGGCCG TGTGACGCTG
AACGGCAGGG TGCTCGACAC CCCCGTCGTC AACGTCACGC TCGCCGATCG CATCGAAGTC
GACGGCGTGC CGATCCGCGG CATCGAGCGC ACCAGACTGT GGCTCTATCA TAAGCCCACC
GGCCTGGTGA CCACCAATGC CGATCCGGAA GGCCGCTCGA CCGTCTTCGA CAACCTGCCG
GAAGAATTGC CGCGCGTCAT GTCGATCGGC CGCCTCGATA TCAACACCGA GGGCCTGCTG
CTCTTGACCA ATGACGGCGG TCTCGCCCGC GCGCTCGAGC TGCCGGCGAC CGGCTGGCTG
CGGCGTTACC GTGTCCGCGC CCATGGCGAG ATCGATCAGG ACGCGCTCGA CAAGCTGAAG
GACGGCATTG CCGTCGACGG CGTGCTCTAC GGCTCGATCG AGGCGACGCT CGACCGCACG
CAGGGCTCGA ACGTCTGGAT CACCATGGGT CTTCGCGAAG GCAAGAACCG CGAAATCAAG
AACGTTCTCG GTGCGCTCGG CCTCGACGTC AATCGCCTGA TCCGCATTTC CTATGGCCCG
TTCCAGCTTG GCGACCTGCC GGAAAGCCAT GTCGTCGAAG TGCGCGGCCG CACACTGCGC
GACCAGCTCG GCCCGCGCCT CATCGAGGAA GCCAAGGCGA ATTTCGACGC GCCGATCTAC
AATACGACGG CGGTCGCTGC CGAGGAAGAG GCAGAGCCGG CAGCACCGGA AAAGCGCGAG
CGTCCGCGTC GCGACGAGGA CAAGCGCGAA CGGGCGCTGA GCCGTCTCGA TACGAAGCGC
GACGACCGCC ACGGTGGGGC GCGCAAAGAG GACGATCGCC GCGACGGCGG CCGCAGGGAC
GACGAGAAGC CGAAGCGTCC CCAGCCGCTC GGCCAGCGCC GCAGCGCCAA TGTCTGGATG
GCGCCGGGCG CCCGGCCGCT CGGCGAAAAG GCAGCGGCGA AAGCCGCCAA GAATGCGCAG
ACTGCGCGCC GGCGCGGCGA GCAGGCGCCG GCAAAGAATG ATCGCATCGA GGATCGCCCG
CGCACGCAGG TGAACCGCGT GCGCGAAGAG GATGGCGAAT GGATCCGCTC GAGCGAGCAG
CCCCGCGGCA AGGATGAGGG CGAAGGCTTC GGCCGCAAGC GCGGCTTTGG TGATCGCCCC
GCCCGCGAAG ACCGCGGTTC TGGCGACCGC CCGGCGCGCG GCGACCGGCC GTTCGGTGAT
CGTCCCCCGC GTGGTGATCG GCCCTTCGGC GACAAGCCGC GCGGCGATCG CAAGCCGCGT
GCGGATGGCG ACGAGCGTCC TCGTGCCGCG AGAAGCCCTG CCGGTGAGGG CCGTTCTGAG
CGTCCGCGCG GCGACCGCCC CTTCGGCGAC CGTCCTTCGC GTGGAGACCG CCCCTTCGGC
GACAAGCCGC GCGGTGATCG CAAGCCGCGT GCGGATGGCG ACGAGCGTCC TCGTGCCGCC
AGAACCTCCC CCGGTGAGGG CCGTTCTGAG CGTCCGCGCA GCGACCGCCC CTCCGGTGAT
CGTCCTTCAC GTGGAGACCG GCCCTTCGGT GATAAGCCGC GTGGCGATCG CAGGCCGCGC
GAGGACGGTG ATGAGCGTCC GCGGGCAGCC AGAAGCTTTG CCGGCGAGGG CCGCTCCGAG
CGTCCGCGCG GTGACAAGCC TTCGGGTGAC AGGCCTTCTG GAGACAGGCC GCGCGGCAAG
GGCTTTGCGG CCAAGCCCAG CGGGGCGAAA CCTGGTGGGG CCAAGAGTTT TTCCGGCAAG
CCGAAAGGCA CCAAGCCGGG CGGGGACAGA CCGGGCGGCG ACAGGCCTGC GGGCGGGCCG
TCCAGAGGTG GTGCTAAAGG AAAAGGAATG ACGCGCGGTG CGGATCGTCG GCGGTGA
 
Protein sequence
MTPKDKPKRP GAKPLSRDIR SKAGPKADGD KPAKPAVARA IAAETDSDAK AERISKVMAR 
AGVASRRDIE RMIMEGRVTL NGRVLDTPVV NVTLADRIEV DGVPIRGIER TRLWLYHKPT
GLVTTNADPE GRSTVFDNLP EELPRVMSIG RLDINTEGLL LLTNDGGLAR ALELPATGWL
RRYRVRAHGE IDQDALDKLK DGIAVDGVLY GSIEATLDRT QGSNVWITMG LREGKNREIK
NVLGALGLDV NRLIRISYGP FQLGDLPESH VVEVRGRTLR DQLGPRLIEE AKANFDAPIY
NTTAVAAEEE AEPAAPEKRE RPRRDEDKRE RALSRLDTKR DDRHGGARKE DDRRDGGRRD
DEKPKRPQPL GQRRSANVWM APGARPLGEK AAAKAAKNAQ TARRRGEQAP AKNDRIEDRP
RTQVNRVREE DGEWIRSSEQ PRGKDEGEGF GRKRGFGDRP AREDRGSGDR PARGDRPFGD
RPPRGDRPFG DKPRGDRKPR ADGDERPRAA RSPAGEGRSE RPRGDRPFGD RPSRGDRPFG
DKPRGDRKPR ADGDERPRAA RTSPGEGRSE RPRSDRPSGD RPSRGDRPFG DKPRGDRRPR
EDGDERPRAA RSFAGEGRSE RPRGDKPSGD RPSGDRPRGK GFAAKPSGAK PGGAKSFSGK
PKGTKPGGDR PGGDRPAGGP SRGGAKGKGM TRGADRRR