Gene TM1040_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0291 
Symbol 
ID4077426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp297539 
End bp298585 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content64% 
IMG OID638005585 
Productribosomal large subunit pseudouridine synthase C 
Protein accessionYP_612286 
Protein GI99080132 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.503835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG TGCAAATGAT CACCGTGACC GAAGACGACG GCGGCCAGCG GATCGACCGC 
TGGTTGCGGC GTCTGTTCCC GCATGTGAAC CAGGGCCGCA TCGAGAAGAT GTGTCGCAAG
GGCGAGCTGC GTCTGGATGG CGGTCGCGTC AAGGCCAACA CCCGTGTTGA GGCGGGACAG
GTTGTGCGGG TGCCCCCGCT GGCCGAGAGC GACATGAAAC CGGCCGAGGC GCGCCCGGTG
AAGATCTCGG ATGCTGACGC CAAGATGATT CGCGATTGCG TGATCTACAA GGACGACGAT
GTTCTGGTGA TCAACAAACC GGCCGGACTG GCGGTGCAGG GCGGCTCTGG CACCACCAAA
CACGTAGATG GCCTCTCGGA AGCACTGCGC TTTGACGCCG AGGACAAGCC GCGGCTGGTG
CATCGTCTCG ACAAGGACAC ATCCGGGCTC TTGGTGCTGG CGCGCAACCG CAAGGCGGCG
CAGGGGCTGA CCGCAGCCTT TCGCCACAAG AACACCCGCA AGATCTACTG GGCCTTGGTG
GCAGGCGTGC CGACGCCCTA CCTTGGCGAG ATCAAGACCG GGCTCGTAAA GGCGCCGGGA
CATGGCAAAT CCGGCGAGGG CGAAAAGATG ATCCCCGTTG ATCCGCGCGA TGTGGATGCC
ACGCCCGGGG CAAAGCGCGC GCATACCTAT TATGCCACGC TCTACCGCGT TGCGAGCCGT
GCAAGCTGGG TCGCGATGGA GCCGGTGACG GGCCGCACCC ACCAGCTGCG TGCGCATATG
GCGGGCATGG GGCATCCGAT CATTGGCGAT GGCAAATATG GCGGCTCGGG TCAGGAGAAC
CTCGGCGATG GCTGGGGCGC GCAAATCGGC GGTCTGATCT CGAAGAAACT GCACCTGCAT
GCGCGCCGTT TGCAGTTCGA ACACCCCGTC ACCGGCAAAG TGGTGACAGT GACTGCCGCG
CTGCCCGACC ACATGAAAGA GAGCTGGGAC ACCTTTGGCT GGACCGAGGA TCTGGCCGCC
GACGACCCGT TTGAGACGCT GTTTTGA
 
Protein sequence
MSGVQMITVT EDDGGQRIDR WLRRLFPHVN QGRIEKMCRK GELRLDGGRV KANTRVEAGQ 
VVRVPPLAES DMKPAEARPV KISDADAKMI RDCVIYKDDD VLVINKPAGL AVQGGSGTTK
HVDGLSEALR FDAEDKPRLV HRLDKDTSGL LVLARNRKAA QGLTAAFRHK NTRKIYWALV
AGVPTPYLGE IKTGLVKAPG HGKSGEGEKM IPVDPRDVDA TPGAKRAHTY YATLYRVASR
ASWVAMEPVT GRTHQLRAHM AGMGHPIIGD GKYGGSGQEN LGDGWGAQIG GLISKKLHLH
ARRLQFEHPV TGKVVTVTAA LPDHMKESWD TFGWTEDLAA DDPFETLF