Gene YPK_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2039 
Symbol 
ID6087452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2269988 
End bp2271022 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content51% 
IMG OID641597106 
Product23S rRNA pseudouridylate synthase B 
Protein accessionYP_001720779 
Protein GI170024274 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGA AGTCAGAAAA GCAAAAGTTA CCAACCCCAA ATCCACACAC CCCTGCTAAG 
CCACAGGGCG CTAAAATTGT GGGTGAAAAA TTACAGAAAA TCCTGGCGCG CGCTGGCCAT
GGTTCTCGTC GTGAAATTGA AGCCATTATC CAGCAAGGCC GCGTCAGTGT TGACGGTAAA
GTGTCAAAGC TAGGTGATCG TGTAGAAGTG ACCCAGTCCA CTAAAATTCG TCTGGATGGT
CATCTTCTTT CAATTAAAGA ATCTGAAGAA AATGTGTGCC GCGTTCTGGC GTATTACAAG
CCAGAAGGTG AGCTTTGTAC GCGTAATGAT CCGGAAGGTC GTCCAACGGT ATTTGACCGT
TTACCTAAAC TGCGTGGTTC ACGTTGGGTT GCGGTTGGTC GTTTGGATGT GAACACCTCT
GGTTTATTAC TGTTCACTAC CGATGGCGAA TTGGCTAACC GCCTGATGCA CCCAAGCCGT
GAAGTTGAGC GTGAATATGC GGTGCGGGTA TTTGGTCAAA TCGATGATGA TAAAATCAAA
CAGTTGAGCC GTGGTGTACA ACTGGAAGAT GGCCCTGCGG CATTTCGTAC TATCAGCTAC
CAGGGTGGTG AAGGGATAAA CCAGTGGTAT AACGTTACGT TGACCGAAGG GCGTAATCGC
GAAGTTCGCC GTCTATGGGA AGCCGTAGGC GTGCAGGTTA GCCGCCTGAT TCGTGTGCGT
TATGGCGACA TTAATTTGCC GAAAGGCCTG CCGCGGGGTG GCTGGACTGA ACTGGATCTT
AAAGCCACTA ACTACCTGCG TGAATTGGTC GAACTGGATG TTGAAACCGT CAGTAAATTA
CCCGTTGAGA AAGATCGTCG CCGGGTTAAA GCCAACCAAA TCCGTCGTGC AGTTAAGCGC
CATACCGAAG TCTCGGGCCG CCAGGTTGCT GGCCGTCAGG GTTCAGCCCG TAAAGGCTCT
ACGCGTCAAA ACGTGGGTAA TGCGGTACCC GCAGCCACGG CAAGTCGCCG TAGTGGCCCG
AAAAAACGCG GTTAA
 
Protein sequence
MSEKSEKQKL PTPNPHTPAK PQGAKIVGEK LQKILARAGH GSRREIEAII QQGRVSVDGK 
VSKLGDRVEV TQSTKIRLDG HLLSIKESEE NVCRVLAYYK PEGELCTRND PEGRPTVFDR
LPKLRGSRWV AVGRLDVNTS GLLLFTTDGE LANRLMHPSR EVEREYAVRV FGQIDDDKIK
QLSRGVQLED GPAAFRTISY QGGEGINQWY NVTLTEGRNR EVRRLWEAVG VQVSRLIRVR
YGDINLPKGL PRGGWTELDL KATNYLRELV ELDVETVSKL PVEKDRRRVK ANQIRRAVKR
HTEVSGRQVA GRQGSARKGS TRQNVGNAVP AATASRRSGP KKRG