Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1737 |
Symbol | |
ID | 4883944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1710127 |
End bp | 1711791 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640127665 |
Product | RNA pseudouridine synthase family protein |
Protein accession | YP_001058776 |
Protein GI | 126440981 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases |
TIGRFAM ID | [TIGR00093] pseudouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0207122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACTGATA CCCACGACAT CGATTCGTCC GAATCCGCGC ATGCCGTTGC GACGGCGCGC GCCGACGACG CACCCGAGCA GTCCGCAGCG GACGCGGGCG GCGAAGACCG CCCGCGCCGC GGTTTGCGGC GCGGGCCGCG CAGCCTGATC GCGCGCCGCC GAGCGGCCGC GAAATCGAAG CATTCCGATG CGCCCGAAAG CGCCGACGCG GCGCCGGCGG CCGATGCCGG CGCGGGCGCC GACGTCGCGA AAGCGCCCGC TCGCGCGCCG CGCGGCAAGG ACGCCGCGGC GAAGCCGCCG CGCAAGACGG CGGGCAAGCG CGAAGGCGCC GCGCGGCAGG GCGCTCAGCC GAAGCGAGGC GCGCAGCAGG CTGCCGCGGC GGTTGCGCCG TCCGCGGAGG CCGGCCAGGA CGACGTGTTC GCCTACGTGA TTTCGCCGGC GTTCGACGCC GACAACAACG CGCCGGGCGG CGGCGTGCGC GCGCCGATGC TGCGCCGGGG CCGCCAGACT CAGCCGAAGC GCGTGCTGTC GCCGGACGAC GACGCGCCGA AGCTGCACAA GGTACTCGCG GAAGCCGGCA TGGGCTCGCG CCGCGAGATG GAAGAGCTCA TCATTGCCGG CCGGGTGTCG GTGAACGGCG AGCCGGCGCA CATCGGCCAA CGGATCATGC CGACCGATCA GGTGCGGATC AACGGCAAGC CGGTCAAGCG CAAGCTGCCG AGCAAGCCGC CGCGCGTGCT GCTGTATCAC AAGCCGACGG GCGAGATCGT GAGCCACGCG GATCCGGAGG GCCGCCCGTC CGTGTTCGAT CGGCTGCCGC CGATGAAGAC CGCGAAATGG CTCGCGGTCG GCCGCCTCGA CTTCAACACC GAAGGCTTGC TGATGCTGAC GACGTCGGGC GATCTCGCGA ACCGCTTCAT GCATCCGCGC TATAGCGTCG AGCGCGAGTA CGCGGTGCGC GTCGTCGGCG AGCTGTCCGA GGCGTCGCGT CAGAAGCTGC TGCACGGCGT CGAGCTCGAC GACGGCCCGG CGAATTTCCT GCGCATTCGC GACGGCGGCG GCGAAGGCAC GAATCACTGG TATCACGTCG CGCTTGCCGA AGGGCGCAAC CGCGAGGTGC GGCGGATGTT CGAGGCGGTC GGCCTGATGG TGAGCCGCCT GATCCGCACG CGCCACGGCC CGATCCCGCT GCCGCGCGGG TTGAAGCGCG GCCGCTGGGA GGAACTCGAC GAGGCGCAGG TGCGGCGCCT GATGTCGACG GTCGGCCTGA AGGCGCCGAC CGAGGATAAG GGCGGCAAGC GCGGCGGCCC GGCCGAGCGC CGCCAGCCCG ATCCGATGCA GACGTCGATG GGCTTCATCA ATCGCGAGCC CGTGCTGACG ACTCACGGCC AGCTCGACCA GCCGCGGCGC GGCCGCCGCG GGCCGGCGGG CGGCGGCTTC GGCGCGGGCC TCGGCGGCGG CTACGCCGGC CTGCCGGGCT ACGGCGGCGC GTCGCGCCAG GGCGGCCGCG ATGTCGACGG CAACCGCGCG TCCTACGGCG GCGCGGGCGC GAACAAGCGC GGCGCCGGCA AGGGCGGCCG CAATCCGAAC GGCAATCGCG CCGAAGGCGG TGCGCGCGGC GGCCCGCGTA CGCCGCAGCA GCGCAATCGT TCGCGTAGCC GCTGA
|
Protein sequence | MTDTHDIDSS ESAHAVATAR ADDAPEQSAA DAGGEDRPRR GLRRGPRSLI ARRRAAAKSK HSDAPESADA APAADAGAGA DVAKAPARAP RGKDAAAKPP RKTAGKREGA ARQGAQPKRG AQQAAAAVAP SAEAGQDDVF AYVISPAFDA DNNAPGGGVR APMLRRGRQT QPKRVLSPDD DAPKLHKVLA EAGMGSRREM EELIIAGRVS VNGEPAHIGQ RIMPTDQVRI NGKPVKRKLP SKPPRVLLYH KPTGEIVSHA DPEGRPSVFD RLPPMKTAKW LAVGRLDFNT EGLLMLTTSG DLANRFMHPR YSVEREYAVR VVGELSEASR QKLLHGVELD DGPANFLRIR DGGGEGTNHW YHVALAEGRN REVRRMFEAV GLMVSRLIRT RHGPIPLPRG LKRGRWEELD EAQVRRLMST VGLKAPTEDK GGKRGGPAER RQPDPMQTSM GFINREPVLT THGQLDQPRR GRRGPAGGGF GAGLGGGYAG LPGYGGASRQ GGRDVDGNRA SYGGAGANKR GAGKGGRNPN GNRAEGGARG GPRTPQQRNR SRSR
|
| |