Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0416 |
Symbol | |
ID | 5321250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 448238 |
End bp | 449968 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640789351 |
Product | pseudouridine synthase |
Protein accession | YP_001326108 |
Protein GI | 150395641 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases |
TIGRFAM ID | [TIGR00093] pseudouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.561279 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCCA AAGACAAGCC CACGAGGCCC GGCGGCAAGG CCAAGGGTCG CGACACGAAA CCTCATTCCG GCGGCGAGAA GGCTGCAGTG CGGGCAGCCG GTAGCGCGCC GGCCGCGTCG GCTGCGGGTG CGGAACAGCC ACAGCGCATC TCGAAGATAC TCTCGCGCGC AGGCGTTGCA TCGCGCCGCG ACGTCGAGCG GATGATCATG GAAGGGCGCG TCAGTCTCAA TGGCCTCGTG CTCGACACAC CGGTGGTCAA CGCGACGCTC GCCGACAGGA TCGAAGTGGA CGGCCATCCG ATCCGTGGCA TCGAACGGAC GCGGCTTTGG CTCTATCACA AGCCGGGCGG GCTCGTGACG ACCAATGCCG ATCCGGAAGG CCGGCCGACG GTCTTCGAAA ACCTGCCCGA GGATCTGCCG CGCGTTCTGT CGATCGGCCG GCTGGACATC AACACCGAAG GCCTGCTGCT CCTGACGAAT GACGGCGGGC TTGCCCGCGT GCTTGAGCTG CCGTCGACAG GCTGGCTGCG CCGCTACCGC GTGCGCGCAC ACGGGGAAAT CGATCAGGCG GCGCTCGACC GCCTGAAAGA GGGTATCGCG GTCGAAGGTG TCCTTTACGG GGCGATCGAG GCTACTCTGG ATCGGGTACA GGGCTCGAAC GTCTGGATCA CTATGGGCCT GCGCGAAGGC AAGAATCGTG AGATCAAGAA CGTCCTTGGC GCACTTGGCC TCGACGTAAA CCGACTGATC CGCATCTCCT ATGGTCCGTT CCAGCTCGGC GAGTTGCCGA TTGGTCAGGT TCAGGAGATT CGCGGCCGAA CGCTGCGCGA ACAGCTGGGC CCCCGGCTGA TCGCCGACGC CAAAGCGAAT TTCGACGCGC CGATCTATAA CGACCAGTCG ACTGCTGCCG AGCCTGAAGC AGAGCCTGTC GCCGATCACG GCAAGGCGGA ATGGGGCAGC AAGCGCGAGA AGGCCGGAGA CAAGCGCGAG CGTGCTCTTG CCCGCCTCGA CACCCGCCGC GACGATGGCC GACCGAACGA CCGAGGCCGC AGTCGTGGAA AGGCGGAGGA ACGTCCTGCT CGCCCGCCTC TGAGACGCAA CCGCTCTTCC AACGTCTGGA TGGCGCCGGG TGCGCGTCCG ACGGTCGAGA AAAAGCCGAA GGCTGCGGAA GACGAGCTTT CTCCGAAATC CACGCGGCGC GCGCCGGCGG ACGTTAAACG CCGCGACCGT GCTAAGGACC CCGCCACGGC GAAGGCAGCC GGCTTTGATG AGGAGCGTAA GGGCAAGGCG GGTAAATCCG CCGGCCGCAA GGACGCCGCC GGTGGCTTCG AAAGGCGTCG CCCGCTCGAA GGCGCAGAGC GGAAATCACG CGACCATGGC GATCGTCCGC CGCGCCGTCC GGAGGGGGAG CGGGCTGCAC GCGACCCGGC CGATGACCGC GCGTCTTCCG GCGGCCGCAA GGAGACCGCT CGGCCACCGC GCGGCGACGG CAAGAAGCGC TTTGCCGACG ACCGGCCACG GGCGAACGCT GGCGGCAAGC CGGCGGCGGC AAACCCCTCC GCCAATCGCT CGTCCGAATC GCGCGCCGGA AAACCCGCAG GCACCAAACC CTCCGGCGGC AGGCGGCCCG GTGGAGGCAA GCCCGGTAAT CGCCCGGGTG GCGGCAAACC CGGTAATCGC CCTGGTGGCG GCAAATCCTC GGGCAAGGGA CCGCAGGGCA GGGGGAAGTA G
|
Protein sequence | MTSKDKPTRP GGKAKGRDTK PHSGGEKAAV RAAGSAPAAS AAGAEQPQRI SKILSRAGVA SRRDVERMIM EGRVSLNGLV LDTPVVNATL ADRIEVDGHP IRGIERTRLW LYHKPGGLVT TNADPEGRPT VFENLPEDLP RVLSIGRLDI NTEGLLLLTN DGGLARVLEL PSTGWLRRYR VRAHGEIDQA ALDRLKEGIA VEGVLYGAIE ATLDRVQGSN VWITMGLREG KNREIKNVLG ALGLDVNRLI RISYGPFQLG ELPIGQVQEI RGRTLREQLG PRLIADAKAN FDAPIYNDQS TAAEPEAEPV ADHGKAEWGS KREKAGDKRE RALARLDTRR DDGRPNDRGR SRGKAEERPA RPPLRRNRSS NVWMAPGARP TVEKKPKAAE DELSPKSTRR APADVKRRDR AKDPATAKAA GFDEERKGKA GKSAGRKDAA GGFERRRPLE GAERKSRDHG DRPPRRPEGE RAARDPADDR ASSGGRKETA RPPRGDGKKR FADDRPRANA GGKPAAANPS ANRSSESRAG KPAGTKPSGG RRPGGGKPGN RPGGGKPGNR PGGGKSSGKG PQGRGK
|
| |