Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0940 |
Symbol | |
ID | 4895386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 967378 |
End bp | 968847 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640111525 |
Product | RNA-binding S4 domain-containing protein |
Protein accession | YP_001042823 |
Protein GI | 126461709 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases |
TIGRFAM ID | [TIGR00093] pseudouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.811015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.643203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA GCAAAGACGG CGAGCGCATC GCCAAGGTTC TCTCTCGTGC GGGCATCGCC TCGCGCCGCG ACGCCGAGCG GATGATCGAG CTTGGCCGCA TCGCGGTGAA CGGGCGCACC ATCGACAGCC CGGCGCTGAA TGTGGGCCCG AAGGACCGCA TCACCGTCGA CGGCCAGCCC CTGGCGCCGC CCGAGCCCGC CCGCCTGTGG CTCTATTACA AGCCCGAGGG ACTCGTCACC TCGGCCTCGG ACGAGAAGGG CCGCGAGACG GTCTTCGACC ACCTGCCCGA GGGGATGCCC CGCGTCATGT CGGTGGGACG GCTCGATCTC AATTCCGAGG GTCTCCTCCT CCTGACCAAT GACGGCGAAC TGAAGCGCCG GCTCGAGCTG CCCTCCACCG GCTGGCTGCG CAAGTACCGC GTGCGGGTGA AGGGCAACCC GACCGACGCC GACCTCGAGC CGCTGCGCAA GGGGATCACG GTCGAGGGCG AGAGCTTCCA GCCGATGACC GTCTCGCTCG ACCGGGTGCA GGGCGCCAAC GCCTGGCTCA CGGTGGGCCT CCGCGAGGGC AAGAACCGCG AGATCCGCCG CGCCATGTCC GCCCTGGGCC TCACCGTGAA CCGGCTGATC CGCGTGAGCT ACGGTCCCTT CCGGCTGAAC GAGCTCGAGC CCGGCATGGT CGAGGAGGTG CGCCCCAAGA TCCTGCGCGA CCAGCTCGGC CTCGATCCGC ATGCCGACGG CGAGGCGCGC CCGGCCAGGG GACGGAAGCC CGAAGGTGCC GCACGAGCCG AAGGCGCCGC ACGCTCCGAA AGAACCGCGC GTCCGCCGCG CGGGGCCGCG GCGGAGGGAC CCACCCGCCC GGCTCGGGCA GCGGGGTCCG AAGGCTCCGC AGGCGCCACC AGAGGGCGCT CGCCGCAGGG GGCTGCGCCC AAGGCAGGGG CGCCCAAGGA CGGGGCACGC TTCGCAGGCA AAGGCCCGGC CGAGGGTGCG CGCCCGGCGC GGAGCGCCGC AACGGACCCC GCGCGCGCCT CCCGACCAGC GAGCACCGAG ACCACCGGAC GCACCCGAAC GGCGGAGGGC CGAGGCAAAC CCGCCGCCGC AGGACGGTCC GGCAAGACCG AGGCGGCCGA CGCGCCGCGT GCGTTCCGCA GCCGCAGCGC CGAGGCGCCC GCCAAGGGTC CGCGCGGTCC GGCCAAGGCC GCGGGTCCGG CCCGCAGCCG GCGAGAAGCC CTGCCCGACA CGCCGATCGA CGCCCGTGCG CCGCGGGGTC GCGCCGCGGC GGGTCCGAAG GCCAAGGGCG GCAGTCCGGC CAAGGCCGCC CCCACTCCGC GCGCCAAGGG AGCGGGCGAG ACCGCCGGCG GACGGGGCCC CCGGACGGGC GGGCCGTCTG CCCAGGGGGA CCGGAAACCG CAGCCCACAG GCCCCCGCCC GCCCTCGCGC GGACCGAAAG GCCGGCCCGG CGGGCGCTGA
|
Protein sequence | MTDSKDGERI AKVLSRAGIA SRRDAERMIE LGRIAVNGRT IDSPALNVGP KDRITVDGQP LAPPEPARLW LYYKPEGLVT SASDEKGRET VFDHLPEGMP RVMSVGRLDL NSEGLLLLTN DGELKRRLEL PSTGWLRKYR VRVKGNPTDA DLEPLRKGIT VEGESFQPMT VSLDRVQGAN AWLTVGLREG KNREIRRAMS ALGLTVNRLI RVSYGPFRLN ELEPGMVEEV RPKILRDQLG LDPHADGEAR PARGRKPEGA ARAEGAARSE RTARPPRGAA AEGPTRPARA AGSEGSAGAT RGRSPQGAAP KAGAPKDGAR FAGKGPAEGA RPARSAATDP ARASRPASTE TTGRTRTAEG RGKPAAAGRS GKTEAADAPR AFRSRSAEAP AKGPRGPAKA AGPARSRREA LPDTPIDARA PRGRAAAGPK AKGGSPAKAA PTPRAKGAGE TAGGRGPRTG GPSAQGDRKP QPTGPRPPSR GPKGRPGGR
|
| |