Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2497 |
Symbol | |
ID | 4902696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 2450209 |
End bp | 2451924 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640135724 |
Product | RNA pseudouridine synthase family protein |
Protein accession | YP_001066756 |
Protein GI | 126451482 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases |
TIGRFAM ID | [TIGR00093] pseudouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.142989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCACAA AATTGACCGT CAAGAATCCG CGCCCGGCGA CGCCCGGCCG CGCCCCCGTC CGCTCCGGCA GCCTCACCGC GCGCAAGGTC GCGCGGCCCG ACCCGAAAGC GGCGGGCGCG AAACCCGCCG CGGCGAAGCC TGCTGCGAAG TCCGCATCGG CTGCCAAGCC GGCGGCGCCG CGCGGCGCGG CGAACGCTGC GCCGAAGCGC GCGCCGGGGC CGTCGCACCC GGCCGCGGCA TCCGAAGGCA AGCGCGTCGC GAAGCCGCGC GCCGCGCACG ACGCCGGCCG CACGGGCGGC GAGCGTGCGC CGGCCAAGCG CGCCACCGCG CCCGGTGCGC CCGGCGCGGC GTCCGCGCCG CGCACGCGCC GCACCGACGC GAAGCCGGCG CGCCGCACCG ACGAACGCCC TGCCGGCCGC GCCGGCAATC GCCCTGCCGG CCGCGACGAG CGCGCACCGC GCGACTCGGA TGCGCGCGCG TTCGATGCGG GCACGCGCGG TAAGGACCGC GCGCCCCGCG AGGGCGCAAG GCCCGGCGCA CGGGGCGCGA CGGGCGCGAA GTTCGGCGGC GCGGCGCGCC GATCGGACGA CGCCGACCGT CGAACGCCCC GCGCGACGCG TGCGGACAGC CGCGCGCGCG ATGCCGCGCC GTCGTCGTTC GCGGGCAAGA CCACGACAGC CGGCAAGCGT GCGCCGCAGC GCGCCGACGA TCGCTACGGC GCAGCCGGGA AGCGCACATC GCCGCGCCCC GAGCGAACCG AGCGTACCGC CCGCTTCGGC GAACGGCCGG CCACCCGCGC GAGCGCATCC GGCGAGCGCC GCCCCACGGC CCGCGCGGCG ACGGGTTCGC GCCTCAAGCT CGCGCAGCCG ATCAAGCGCG GCAGCGGCGA ACTGGGCGAA TCCGCTCGCG GCGGTGAGCA CGGCGAACGC GGCAAGCGTA TCGAGCGCGG CGACGAAACC GGCCTCGTGC GCCTGTCGAA GCGCATGTCG GAGCTGGGTC TCTGCTCGCG CCGCGAAGCA GACGAATGGA TCGAGAAAGG CTGGGTGCTC GTCGACGGCG AGCGCATCGA CACGCTCGGC ACGAAGGTGC GCGCCGACCA GCGCATCGAG ATCGATTCGA ACGCGCGTGC CGCGCAGGCC GCGCAAGTGA CGATCCTGCT GCACAAGCCG GTGGGCTACG TGTCGGGCCA GGCGGAGGAC GGCTACGCCC CCGCCGCGAC GCTCGTCACG CGCGAGAACC ACTGGAGCGG CGACCGCTCG CCGCTGCGCT TCTCGCCGCA GCACCTGCGC GCGCTCGCGC CCGCGGGCCG GCTCGACATC GATTCGACGG GCCTTCTCGT GCTGACGCAG AACGGGCGCG TCGCGAAACA GCTGATCGGC GAACAATCGG ACATCGACAA GGAATACCTG GTGCGCGTGC GCTTCGGCGA GCGCACGGCC GACATCGAAC GCCACTTCCC CGCCGAGTCG CTCGCGAAGC TGCGCCACGG CCTCGAACTC GACGGCGTGC CGCTCAAGCC CGCGATGGTC AGTTGGCAGA ACGGCGAGCA ACTGCGCTTC GTGCTGCGCG AAGGCAAGAA GCGCCAGATT CGCCGGATGT GCGAACTCGT CGGCCTCGAG GTGATCGGCC TGAAGCGCGT GCGGATGGGC CGCGTGATGC TGGGCGCGCT GCCGCAAGGC GAGTGGCGCT ATCTCGGGCC GGACGAATCG TTCTGA
|
Protein sequence | MRTKLTVKNP RPATPGRAPV RSGSLTARKV ARPDPKAAGA KPAAAKPAAK SASAAKPAAP RGAANAAPKR APGPSHPAAA SEGKRVAKPR AAHDAGRTGG ERAPAKRATA PGAPGAASAP RTRRTDAKPA RRTDERPAGR AGNRPAGRDE RAPRDSDARA FDAGTRGKDR APREGARPGA RGATGAKFGG AARRSDDADR RTPRATRADS RARDAAPSSF AGKTTTAGKR APQRADDRYG AAGKRTSPRP ERTERTARFG ERPATRASAS GERRPTARAA TGSRLKLAQP IKRGSGELGE SARGGEHGER GKRIERGDET GLVRLSKRMS ELGLCSRREA DEWIEKGWVL VDGERIDTLG TKVRADQRIE IDSNARAAQA AQVTILLHKP VGYVSGQAED GYAPAATLVT RENHWSGDRS PLRFSPQHLR ALAPAGRLDI DSTGLLVLTQ NGRVAKQLIG EQSDIDKEYL VRVRFGERTA DIERHFPAES LAKLRHGLEL DGVPLKPAMV SWQNGEQLRF VLREGKKRQI RRMCELVGLE VIGLKRVRMG RVMLGALPQG EWRYLGPDES F
|
| |