Gene Rcas_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0166 
Symbol 
ID5537627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp203138 
End bp204808 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content65% 
IMG OID640892330 
Productpseudouridine synthase 
Protein accessionYP_001430318 
Protein GI156740189 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.252408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTGCAG AACGATTGCA GAAAGTGCTG GCAAGCGCCG GCATTGCGTC GCGGCGTGAC 
TGTGAAGAGT ACATTGCCGC AGGGCGCGTC ATGGTCAACG GCAAAGTGGT CCGCATCCCC
GGTACGCGCG TGGACCCGGA ACACGATGAG ATTCTCGTTG ATGGAAAGCC GATCGGTAAG
ATTCACCACC GCACCTATGT CATGCTCCAC AAACCGGCAG GTGTGGTCTC GACAACCGAT
GATCCGCACG GTCGTCCAAC AGTGGTGGAT TTGGTCAATC TGCCACAGCG GTTGTTTCCG
GTCGGACGGC TCGATTACGA CTCAGAAGGC TTGCTGCTGT TGACCGACGA TGGTGAACTA
ACTCAGAAGC TGACCCATCC CAGCTATCAG GTCGAGAAAG AGTATCAGGT CCTGTTGAAT
GACGCGCCAT CGCCGAATGC GCTGCGCGCC TGGCGCACCG GTGTGGAACT GGACGGCGTG
AAAACAGCGC CTGCCTGGGT CGAACTGATC GAGCGCACAC CCGAAGGCGC ATGGGTGCGC
GTGATTTTAC ACGAGGGACG CAAGCGCCAG ATTCGCGAGG TGGCGCGATT GCTGGGGTAC
GAGGTGCGGC GGTTGATCCG TGTGCGCGAA GGTCCGCTGG CTCTGGGAGA CCTCCCAAGC
GGCACGTGGC GCTTTCTTAC CGATGAAGAG GTTGACATGC TGCGAGAGCA CGCCGAACGA
AACGCGGCGG TCGCCGATGC GGAACGGCCG CGTCGGCGTG AACAGGACGA GATGAAAGCA
GTTGGCGGTC GACGGTTGCG GCGTATCAAT CCGTCGGCGC GATTGCTACA GAAAGGTGAA
AAAGTGACAG AGACAACTCT GCCCGAACTC GACGAAGCGA TTGGTAGCGA GAAGGGTATT
CCGTCGATTG GCGAGGAGGG GCGTAAGGTG CGTGATAAAG AGGCGCCCCA TCGCAACGCC
TCTCCAGCGC GCGACTTCCG CCGCGATGAC CGTGGCAGTG GCTTCCGCAG TGACCGCCGC
AACGACCGTG GCAGTGGACC GCGCGACTTC CGCCGCGACG ACCGTGGCAA TGGACCGCGC
GACTTCCGCC GCGACGACCG TGGCAATGGA CCGCGCGACT TCCGCCGCGA TGACCGTGGC
AATGGACCGC GCGACTTCCG CCGCGATGAC CGCAGCAGTG GCTTCCGCAG CGACCGCCGC
GATGACCGTG GCAGTGGCTT CCGCAGTGAC CGCCGCGACG ACCGCAGCAG CGGCTTCCGC
AGTGACCGCC GCGACGACCG CAGCAGCGGC TTCCGCAGTG ACCGCCGCGA CGACCGCAGC
AGCGGCTTCC GCAGTGACCG CCGCGACGAC CGCAGCAGCG GCTTCCGCAG TGACCGCCGC
GACGACCGCA GCAGCGGCTT CCGCAGTGAC CGCCGCGACG ACCGCAGCAG CGGCTTCCGC
AGTGACCGCC GCGATGACCG CAGCAGCGGC TTCCGCAGTG ACCGCCGCGA TGACCGCAGC
AGCGGCTTCC GCAGTGACCG CCGCGACGAC CGCAGCAGCG GCTTCCGCAG TGACCGCCGC
GACGACCGCA GCAGTGGCTT CCGCAGTGAC CGCCGCGATA ACCGTGGGGC TAGAGGCTTT
TCGCGCAGTG ATCGCCCTCC ACCGCGCTAC TCCGAAGATG ATGAGGAATA G
 
Protein sequence
MSAERLQKVL ASAGIASRRD CEEYIAAGRV MVNGKVVRIP GTRVDPEHDE ILVDGKPIGK 
IHHRTYVMLH KPAGVVSTTD DPHGRPTVVD LVNLPQRLFP VGRLDYDSEG LLLLTDDGEL
TQKLTHPSYQ VEKEYQVLLN DAPSPNALRA WRTGVELDGV KTAPAWVELI ERTPEGAWVR
VILHEGRKRQ IREVARLLGY EVRRLIRVRE GPLALGDLPS GTWRFLTDEE VDMLREHAER
NAAVADAERP RRREQDEMKA VGGRRLRRIN PSARLLQKGE KVTETTLPEL DEAIGSEKGI
PSIGEEGRKV RDKEAPHRNA SPARDFRRDD RGSGFRSDRR NDRGSGPRDF RRDDRGNGPR
DFRRDDRGNG PRDFRRDDRG NGPRDFRRDD RSSGFRSDRR DDRGSGFRSD RRDDRSSGFR
SDRRDDRSSG FRSDRRDDRS SGFRSDRRDD RSSGFRSDRR DDRSSGFRSD RRDDRSSGFR
SDRRDDRSSG FRSDRRDDRS SGFRSDRRDD RSSGFRSDRR DDRSSGFRSD RRDNRGARGF
SRSDRPPPRY SEDDEE