Gene TM1040_1927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1927 
Symbol 
ID4076878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2026097 
End bp2027365 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content65% 
IMG OID638007243 
Productpseudouridine synthase, Rsu 
Protein accessionYP_613922 
Protein GI99081768 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.975227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.86306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCGC CGCTCTCGCG AATAGGCTAT GGCAGCGCCA TGAGCAAGAA AACCCCATCC 
AAACCGCCCG CGCGCCGCGC CGCCAACGCT GCCCGGCCAA AGGCCACCCC CTCCGCGACG
CCGCGCGCAC ACAAGGGCCT GCAGGCAGGC GCACCCGCAG ACGCGCGGCC CTCGCAAGAC
GGTGACCGCA TCGCCAAGGT GCTGTCGCGC GCAGGCGTGG CCTCTCGTCG CGAGGCGGAA
CGGATGATTG CGGAGGGTCG CGTTGCGGTC AATGGCAAGG TCATCGACAG CCCCGCGCTC
AATGTGACCA ATCTGGATCG CATCGTGGTG GATGGCAATC CGCTGCCCGA GGCCGAGGCC
CCGCGGCTTT GGCTCTATTA CAAGCCCAAC GGGTTGGTCA CCACGACCTC CGACGAGAAA
GGGCGCAAGA CGATCTTTGA CGCATTGCCC GAGGACCTGC CGCGGGTGAT GACAGTGGGC
CGGCTTGACC TCAATTCCGA AGGCCTGCTG CTGCTGACCA ACGACGGCGG CGTCAAACGC
CAGTTGGAGC TGCCCTCGAC AGGCTGGCTA CGCCGCTACC GCGTGCGGAT CAATGGCCGC
CCGCAGGATC ATGAATTTGA CGTGCTGCGC AAAGGCGTGG TGATCGACGG CGAGCGTTTT
CAGCCGATGA CCATCTCGCT CGATCGCCAG CAAGGCGCCA ACGCATGGCT CACCATCGGG
CTGCGCGAAG GCAAGAACCG CGAAATTCGC CGCGCCATGG AAGAGGTTGG CTATCCGGTA
AACCGGCTTC TGCGGATCTC TTATGGGCCG TTCCAGCTCG GCACGCTCAA GGAAGGCGAA
GTAGAGGAGT TGCGCCCACG CGTGGTGCGG GATCAACTGG GTCTCGCCGC ACCCGAAGGG
GATGGCGAAG CCAAGAAGAA ACCCACCCGC CCCGGACGCG GCCCGCGCAA TCCTGGCGCA
AAGACTGCTG GAAAAGGCAT CGGCGCGCCC AACGCTCGCC CGACAGGTAA AGGCGCAGTC
AAAGCCCCCG AAGGCAGAGT TTTTTCTGGC AAGACCTTCT CGGGCAAAAC CACCGGCAAA
ACCGCTGGCA AACCTGCGGC TCATAAGGGC CAGGGCGGCA AACCCTCGGG TCCCAAGAGT
TTTGGGACCA AACAGGGCGG CGGCTCGGGC GCGCGAGGAT CCGGTAAACC CAATGGCGCA
TCCGGTGGGG CATCCGGCGG CAAATTCAGC CCCGGCGGGC GCCGTTTTGG CAAAAAACCG
CAGGACTGA
 
Protein sequence
MLSPLSRIGY GSAMSKKTPS KPPARRAANA ARPKATPSAT PRAHKGLQAG APADARPSQD 
GDRIAKVLSR AGVASRREAE RMIAEGRVAV NGKVIDSPAL NVTNLDRIVV DGNPLPEAEA
PRLWLYYKPN GLVTTTSDEK GRKTIFDALP EDLPRVMTVG RLDLNSEGLL LLTNDGGVKR
QLELPSTGWL RRYRVRINGR PQDHEFDVLR KGVVIDGERF QPMTISLDRQ QGANAWLTIG
LREGKNREIR RAMEEVGYPV NRLLRISYGP FQLGTLKEGE VEELRPRVVR DQLGLAAPEG
DGEAKKKPTR PGRGPRNPGA KTAGKGIGAP NARPTGKGAV KAPEGRVFSG KTFSGKTTGK
TAGKPAAHKG QGGKPSGPKS FGTKQGGGSG ARGSGKPNGA SGGASGGKFS PGGRRFGKKP
QD