Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1759 |
Symbol | |
ID | 4901666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1717398 |
End bp | 1719062 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640134989 |
Product | RNA pseudouridine synthase family protein |
Protein accession | YP_001066028 |
Protein GI | 126455204 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases |
TIGRFAM ID | [TIGR00093] pseudouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.282053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACTGATA TCCACGACAT CGATTCGTCC GAATCCGCGC ATGCCGTTGC GACGGCGCGC GCCGACGACG CACCCGAGCA GTCCGCAGCG GACGCGGGCG GCGAAGACCG CCCGCGCCGC GGTTTGCGGC GCGGGCCGCG CAGCCTGATC GCGCGCCGCC GAGCGGCCGC GAAATCGAAG CATTCCGATG CGCCCGAAAG CGCCGACGCG GCGCCGGCGG CCGATGCCGG CGCGGGCGCC GACGTCGCGA AAGCGCCCGC TCGCGCGCCG CGCGGCAAGG ACGCCGCAGC GAAGCCGCCG CGCAAGACGG CGGGCAAGCG CGAAGGCGCG GCGCGGCAGG GCGCTCAGCC GAAGCGAGGC GCGCAGCAGG CTGCCGCGGC GGTTGCGCCG TCCGCGGAGT CTGGCCAGGA CGACGTGTTC GCCTACGTGA TTTCGCCGGC GTTCGACGCC GACAACAACG CGCCGGGCGG CGGCGTGCGC GCGCCGATGC TGCGCCGGGG CCGCCAGACT CAGCCGAAGC GCGTGCTGTC GCCGGACGAC GACGCGCCGA AGCTGCACAA GGTGCTCGCG GAAGCCGGCA TGGGCTCGCG CCGCGAGATG GAAGAGCTCA TCATTGCCGG CCGGGTGTCG GTGAACGGCG AGCCGGCGCA CATCGGCCAA CGGATCATGC CGACCGATCA GGTGCGGATC AACGGCAAGC CGGTCAAGCG CAAGCTGCCG AGCAAGCCGC CGCGCGTGCT GCTGTATCAC AAGCCGACGG GCGAGATCGT GAGCCACGCG GATCCGGAGG GCCGCCCGTC CGTGTTCGAT CGGCTGCCGC CGATGAAGAC CGCGAAATGG CTCGCGGTCG GCCGCCTCGA CTTCAACACC GAAGGCCTGC TGATGCTGAC GACGTCGGGC GATCTCGCGA ACCGCTTCAT GCATCCGCGC TATAGCGTCG AGCGCGAGTA CGCGGTGCGC GTCGTCGGCG AGCTGTCCGA GGCGTCGCGT CAGAGGCTGC TGCACGGCGT CGAGCTCGAC GACGGCCCGG CGAATTTCCT GCGCATTCGC GACGGCGGCG GCGAAGGCAC GAATCACTGG TATCACGTCG CGCTTGCCGA AGGGCGCAAC CGCGAGGTGC GGCGGATGTT CGAGGCGGTC GGCCTGATGG TGAGCCGCCT GATCCGCACG CGCCACGGCC CGATCCCGCT GCCGCGCGGG TTGAAGCGCG GCCGCTGGGA GGAACTCGAC GAGGCGCAGG TGCGGCGCCT GATGTCGACG GTCGGCCTGA AGGCGCCGAC CGAGGATAAG GGCGGCAAGC GCGGCGGCCC GGCCGAGCGC CGCCAGCCCG ATCCGATGCA GACGTCGATG GGCTTCATCA ATCGCGAGCC CGTGCTGACG ACTCACGGCC AGCTCGACCA GCCGCGGCGC GGCCGCCGCG GGCCGGCGGG CGGCGGCTTC GGCGCGGGCC TCGGCGGCGG CTACGCCGGC CTGCCGGGCT ACGGCGGCGC GTCGCGCCAG GGCGGCCGCG ATGTCGACGG CAACCGCGCG TCCTACGGCG GCGCGGGTGC GAACAAGCGC GGCGCCGGCA AGGGCGGCCG CAATCCGAAC GGCAATCGCG CCGAAGGCGG GGCGCGCGGC GGCCCGCGTA CGCCGCAGCA GCGCAATCGT TCGCGTAGCC GCTGA
|
Protein sequence | MTDIHDIDSS ESAHAVATAR ADDAPEQSAA DAGGEDRPRR GLRRGPRSLI ARRRAAAKSK HSDAPESADA APAADAGAGA DVAKAPARAP RGKDAAAKPP RKTAGKREGA ARQGAQPKRG AQQAAAAVAP SAESGQDDVF AYVISPAFDA DNNAPGGGVR APMLRRGRQT QPKRVLSPDD DAPKLHKVLA EAGMGSRREM EELIIAGRVS VNGEPAHIGQ RIMPTDQVRI NGKPVKRKLP SKPPRVLLYH KPTGEIVSHA DPEGRPSVFD RLPPMKTAKW LAVGRLDFNT EGLLMLTTSG DLANRFMHPR YSVEREYAVR VVGELSEASR QRLLHGVELD DGPANFLRIR DGGGEGTNHW YHVALAEGRN REVRRMFEAV GLMVSRLIRT RHGPIPLPRG LKRGRWEELD EAQVRRLMST VGLKAPTEDK GGKRGGPAER RQPDPMQTSM GFINREPVLT THGQLDQPRR GRRGPAGGGF GAGLGGGYAG LPGYGGASRQ GGRDVDGNRA SYGGAGANKR GAGKGGRNPN GNRAEGGARG GPRTPQQRNR SRSR
|
| |