Gene Smed_0416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0416 
Symbol 
ID5321250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp448238 
End bp449968 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content67% 
IMG OID640789351 
Productpseudouridine synthase 
Protein accessionYP_001326108 
Protein GI150395641 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.561279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCA AAGACAAGCC CACGAGGCCC GGCGGCAAGG CCAAGGGTCG CGACACGAAA 
CCTCATTCCG GCGGCGAGAA GGCTGCAGTG CGGGCAGCCG GTAGCGCGCC GGCCGCGTCG
GCTGCGGGTG CGGAACAGCC ACAGCGCATC TCGAAGATAC TCTCGCGCGC AGGCGTTGCA
TCGCGCCGCG ACGTCGAGCG GATGATCATG GAAGGGCGCG TCAGTCTCAA TGGCCTCGTG
CTCGACACAC CGGTGGTCAA CGCGACGCTC GCCGACAGGA TCGAAGTGGA CGGCCATCCG
ATCCGTGGCA TCGAACGGAC GCGGCTTTGG CTCTATCACA AGCCGGGCGG GCTCGTGACG
ACCAATGCCG ATCCGGAAGG CCGGCCGACG GTCTTCGAAA ACCTGCCCGA GGATCTGCCG
CGCGTTCTGT CGATCGGCCG GCTGGACATC AACACCGAAG GCCTGCTGCT CCTGACGAAT
GACGGCGGGC TTGCCCGCGT GCTTGAGCTG CCGTCGACAG GCTGGCTGCG CCGCTACCGC
GTGCGCGCAC ACGGGGAAAT CGATCAGGCG GCGCTCGACC GCCTGAAAGA GGGTATCGCG
GTCGAAGGTG TCCTTTACGG GGCGATCGAG GCTACTCTGG ATCGGGTACA GGGCTCGAAC
GTCTGGATCA CTATGGGCCT GCGCGAAGGC AAGAATCGTG AGATCAAGAA CGTCCTTGGC
GCACTTGGCC TCGACGTAAA CCGACTGATC CGCATCTCCT ATGGTCCGTT CCAGCTCGGC
GAGTTGCCGA TTGGTCAGGT TCAGGAGATT CGCGGCCGAA CGCTGCGCGA ACAGCTGGGC
CCCCGGCTGA TCGCCGACGC CAAAGCGAAT TTCGACGCGC CGATCTATAA CGACCAGTCG
ACTGCTGCCG AGCCTGAAGC AGAGCCTGTC GCCGATCACG GCAAGGCGGA ATGGGGCAGC
AAGCGCGAGA AGGCCGGAGA CAAGCGCGAG CGTGCTCTTG CCCGCCTCGA CACCCGCCGC
GACGATGGCC GACCGAACGA CCGAGGCCGC AGTCGTGGAA AGGCGGAGGA ACGTCCTGCT
CGCCCGCCTC TGAGACGCAA CCGCTCTTCC AACGTCTGGA TGGCGCCGGG TGCGCGTCCG
ACGGTCGAGA AAAAGCCGAA GGCTGCGGAA GACGAGCTTT CTCCGAAATC CACGCGGCGC
GCGCCGGCGG ACGTTAAACG CCGCGACCGT GCTAAGGACC CCGCCACGGC GAAGGCAGCC
GGCTTTGATG AGGAGCGTAA GGGCAAGGCG GGTAAATCCG CCGGCCGCAA GGACGCCGCC
GGTGGCTTCG AAAGGCGTCG CCCGCTCGAA GGCGCAGAGC GGAAATCACG CGACCATGGC
GATCGTCCGC CGCGCCGTCC GGAGGGGGAG CGGGCTGCAC GCGACCCGGC CGATGACCGC
GCGTCTTCCG GCGGCCGCAA GGAGACCGCT CGGCCACCGC GCGGCGACGG CAAGAAGCGC
TTTGCCGACG ACCGGCCACG GGCGAACGCT GGCGGCAAGC CGGCGGCGGC AAACCCCTCC
GCCAATCGCT CGTCCGAATC GCGCGCCGGA AAACCCGCAG GCACCAAACC CTCCGGCGGC
AGGCGGCCCG GTGGAGGCAA GCCCGGTAAT CGCCCGGGTG GCGGCAAACC CGGTAATCGC
CCTGGTGGCG GCAAATCCTC GGGCAAGGGA CCGCAGGGCA GGGGGAAGTA G
 
Protein sequence
MTSKDKPTRP GGKAKGRDTK PHSGGEKAAV RAAGSAPAAS AAGAEQPQRI SKILSRAGVA 
SRRDVERMIM EGRVSLNGLV LDTPVVNATL ADRIEVDGHP IRGIERTRLW LYHKPGGLVT
TNADPEGRPT VFENLPEDLP RVLSIGRLDI NTEGLLLLTN DGGLARVLEL PSTGWLRRYR
VRAHGEIDQA ALDRLKEGIA VEGVLYGAIE ATLDRVQGSN VWITMGLREG KNREIKNVLG
ALGLDVNRLI RISYGPFQLG ELPIGQVQEI RGRTLREQLG PRLIADAKAN FDAPIYNDQS
TAAEPEAEPV ADHGKAEWGS KREKAGDKRE RALARLDTRR DDGRPNDRGR SRGKAEERPA
RPPLRRNRSS NVWMAPGARP TVEKKPKAAE DELSPKSTRR APADVKRRDR AKDPATAKAA
GFDEERKGKA GKSAGRKDAA GGFERRRPLE GAERKSRDHG DRPPRRPEGE RAARDPADDR
ASSGGRKETA RPPRGDGKKR FADDRPRANA GGKPAAANPS ANRSSESRAG KPAGTKPSGG
RRPGGGKPGN RPGGGKPGNR PGGGKSSGKG PQGRGK