Gene Rsph17029_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1072 
Symbol 
ID4895888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1104948 
End bp1105997 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content72% 
IMG OID640111659 
ProductRluA family pseudouridine synthase 
Protein accessionYP_001042955 
Protein GI126461841 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0863187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.714428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCC TTCCGCCCCA CGGGGGCAAA CTGAGCATCA CGATCGGAGA GGATCCTCCC 
GACCGTCTTG ATAAGGCGCT CGTGCGCGAG GCGCCAGAGG AGGCCGCGCT GTCGCGCTCG
CGGCTGATGA AGCTGATCGG CGAGGGCGCG GTGCGGCTCG AGGGCGCTCC CGTGACGGAT
CCGAAGGCGA AGGTCGCCGA GGGGCAGGTC TACGAGATCG CGCTCGATGC GCCGGCCGAG
GTGGAGGCCC GCCCCGAGGC GATCCCGCTC TCGGTCGTCT GGGAGGACGA AGACCTCATC
GTCATCGACA AGCCGGTGGG GATGGTGGTC CATCCCGCGC CCGGTCAGTG GACGGGGACG
CTGGTCAATG CGCTTCTCCA CCATTGCGGC GAGAGCCTTT CGGGCATCGG CGGGGAGAAG
CGCCCGGGCA TCGTCCACCG GATCGACAAG GACACGTCGG GGCTTCTCGT GGTGGCGAAG
ACCGACCGGG CGCATCAGGG CCTTGCGGCG CAGTTCGAGG CGCATACGGT CGAGCGGCGC
TATCTCGCGC TGGTGCATGG CGTGCCCGAG GTCTCGGACC CGCGGCTGCG CGGCGTGCGC
GGCACGAGCT TCGAGCCGGG CGGCGTGCTG CGGATCGCCA CCGGCCTCGC CCGCCACCGC
ACCGACCGGC AGCGGCAGGC GGTCACCTTC GAGGGCGGGC GTCATGCCGT GACCCGGGCG
CGGCTGCTCG AGCGGTTCGG CACGCCGCCG GTGCTGGCGC TCGTCGAATG CCGGCTCGAG
ACGGGGCGCA CGCACCAGAT CCGCGTGCAT ATGGCCCATG CGGGCCACGG GCTGATCGGC
GACCAGACCT ATGGCGGCAG GCGCAAGCTC TCGCCGAAGG CACTGGGGCC CGAGGCCGCG
GCGGCGGCGG AAGCCTTCCC GCGGCAGGCG CTCCATGCGG CGAGCCTCGG CTTCCGCCAT
CCGGTGAGCG GCGAGGAACT GAGCTTCGAG AGCCCCTTGC CCGCGGATAT GGCGGGCCTC
CTGTCCCTCC TGCCGCGGAT GCAAGGGTAA
 
Protein sequence
MAILPPHGGK LSITIGEDPP DRLDKALVRE APEEAALSRS RLMKLIGEGA VRLEGAPVTD 
PKAKVAEGQV YEIALDAPAE VEARPEAIPL SVVWEDEDLI VIDKPVGMVV HPAPGQWTGT
LVNALLHHCG ESLSGIGGEK RPGIVHRIDK DTSGLLVVAK TDRAHQGLAA QFEAHTVERR
YLALVHGVPE VSDPRLRGVR GTSFEPGGVL RIATGLARHR TDRQRQAVTF EGGRHAVTRA
RLLERFGTPP VLALVECRLE TGRTHQIRVH MAHAGHGLIG DQTYGGRRKL SPKALGPEAA
AAAEAFPRQA LHAASLGFRH PVSGEELSFE SPLPADMAGL LSLLPRMQG