Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1072 |
Symbol | |
ID | 4895888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1104948 |
End bp | 1105997 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640111659 |
Product | RluA family pseudouridine synthase |
Protein accession | YP_001042955 |
Protein GI | 126461841 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0863187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.714428 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCATCC TTCCGCCCCA CGGGGGCAAA CTGAGCATCA CGATCGGAGA GGATCCTCCC GACCGTCTTG ATAAGGCGCT CGTGCGCGAG GCGCCAGAGG AGGCCGCGCT GTCGCGCTCG CGGCTGATGA AGCTGATCGG CGAGGGCGCG GTGCGGCTCG AGGGCGCTCC CGTGACGGAT CCGAAGGCGA AGGTCGCCGA GGGGCAGGTC TACGAGATCG CGCTCGATGC GCCGGCCGAG GTGGAGGCCC GCCCCGAGGC GATCCCGCTC TCGGTCGTCT GGGAGGACGA AGACCTCATC GTCATCGACA AGCCGGTGGG GATGGTGGTC CATCCCGCGC CCGGTCAGTG GACGGGGACG CTGGTCAATG CGCTTCTCCA CCATTGCGGC GAGAGCCTTT CGGGCATCGG CGGGGAGAAG CGCCCGGGCA TCGTCCACCG GATCGACAAG GACACGTCGG GGCTTCTCGT GGTGGCGAAG ACCGACCGGG CGCATCAGGG CCTTGCGGCG CAGTTCGAGG CGCATACGGT CGAGCGGCGC TATCTCGCGC TGGTGCATGG CGTGCCCGAG GTCTCGGACC CGCGGCTGCG CGGCGTGCGC GGCACGAGCT TCGAGCCGGG CGGCGTGCTG CGGATCGCCA CCGGCCTCGC CCGCCACCGC ACCGACCGGC AGCGGCAGGC GGTCACCTTC GAGGGCGGGC GTCATGCCGT GACCCGGGCG CGGCTGCTCG AGCGGTTCGG CACGCCGCCG GTGCTGGCGC TCGTCGAATG CCGGCTCGAG ACGGGGCGCA CGCACCAGAT CCGCGTGCAT ATGGCCCATG CGGGCCACGG GCTGATCGGC GACCAGACCT ATGGCGGCAG GCGCAAGCTC TCGCCGAAGG CACTGGGGCC CGAGGCCGCG GCGGCGGCGG AAGCCTTCCC GCGGCAGGCG CTCCATGCGG CGAGCCTCGG CTTCCGCCAT CCGGTGAGCG GCGAGGAACT GAGCTTCGAG AGCCCCTTGC CCGCGGATAT GGCGGGCCTC CTGTCCCTCC TGCCGCGGAT GCAAGGGTAA
|
Protein sequence | MAILPPHGGK LSITIGEDPP DRLDKALVRE APEEAALSRS RLMKLIGEGA VRLEGAPVTD PKAKVAEGQV YEIALDAPAE VEARPEAIPL SVVWEDEDLI VIDKPVGMVV HPAPGQWTGT LVNALLHHCG ESLSGIGGEK RPGIVHRIDK DTSGLLVVAK TDRAHQGLAA QFEAHTVERR YLALVHGVPE VSDPRLRGVR GTSFEPGGVL RIATGLARHR TDRQRQAVTF EGGRHAVTRA RLLERFGTPP VLALVECRLE TGRTHQIRVH MAHAGHGLIG DQTYGGRRKL SPKALGPEAA AAAEAFPRQA LHAASLGFRH PVSGEELSFE SPLPADMAGL LSLLPRMQG
|
| |