Gene BURPS1106A_1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1759 
Symbol 
ID4901666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1717398 
End bp1719062 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content73% 
IMG OID640134989 
ProductRNA pseudouridine synthase family protein 
Protein accessionYP_001066028 
Protein GI126455204 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.282053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACTGATA TCCACGACAT CGATTCGTCC GAATCCGCGC ATGCCGTTGC GACGGCGCGC 
GCCGACGACG CACCCGAGCA GTCCGCAGCG GACGCGGGCG GCGAAGACCG CCCGCGCCGC
GGTTTGCGGC GCGGGCCGCG CAGCCTGATC GCGCGCCGCC GAGCGGCCGC GAAATCGAAG
CATTCCGATG CGCCCGAAAG CGCCGACGCG GCGCCGGCGG CCGATGCCGG CGCGGGCGCC
GACGTCGCGA AAGCGCCCGC TCGCGCGCCG CGCGGCAAGG ACGCCGCAGC GAAGCCGCCG
CGCAAGACGG CGGGCAAGCG CGAAGGCGCG GCGCGGCAGG GCGCTCAGCC GAAGCGAGGC
GCGCAGCAGG CTGCCGCGGC GGTTGCGCCG TCCGCGGAGT CTGGCCAGGA CGACGTGTTC
GCCTACGTGA TTTCGCCGGC GTTCGACGCC GACAACAACG CGCCGGGCGG CGGCGTGCGC
GCGCCGATGC TGCGCCGGGG CCGCCAGACT CAGCCGAAGC GCGTGCTGTC GCCGGACGAC
GACGCGCCGA AGCTGCACAA GGTGCTCGCG GAAGCCGGCA TGGGCTCGCG CCGCGAGATG
GAAGAGCTCA TCATTGCCGG CCGGGTGTCG GTGAACGGCG AGCCGGCGCA CATCGGCCAA
CGGATCATGC CGACCGATCA GGTGCGGATC AACGGCAAGC CGGTCAAGCG CAAGCTGCCG
AGCAAGCCGC CGCGCGTGCT GCTGTATCAC AAGCCGACGG GCGAGATCGT GAGCCACGCG
GATCCGGAGG GCCGCCCGTC CGTGTTCGAT CGGCTGCCGC CGATGAAGAC CGCGAAATGG
CTCGCGGTCG GCCGCCTCGA CTTCAACACC GAAGGCCTGC TGATGCTGAC GACGTCGGGC
GATCTCGCGA ACCGCTTCAT GCATCCGCGC TATAGCGTCG AGCGCGAGTA CGCGGTGCGC
GTCGTCGGCG AGCTGTCCGA GGCGTCGCGT CAGAGGCTGC TGCACGGCGT CGAGCTCGAC
GACGGCCCGG CGAATTTCCT GCGCATTCGC GACGGCGGCG GCGAAGGCAC GAATCACTGG
TATCACGTCG CGCTTGCCGA AGGGCGCAAC CGCGAGGTGC GGCGGATGTT CGAGGCGGTC
GGCCTGATGG TGAGCCGCCT GATCCGCACG CGCCACGGCC CGATCCCGCT GCCGCGCGGG
TTGAAGCGCG GCCGCTGGGA GGAACTCGAC GAGGCGCAGG TGCGGCGCCT GATGTCGACG
GTCGGCCTGA AGGCGCCGAC CGAGGATAAG GGCGGCAAGC GCGGCGGCCC GGCCGAGCGC
CGCCAGCCCG ATCCGATGCA GACGTCGATG GGCTTCATCA ATCGCGAGCC CGTGCTGACG
ACTCACGGCC AGCTCGACCA GCCGCGGCGC GGCCGCCGCG GGCCGGCGGG CGGCGGCTTC
GGCGCGGGCC TCGGCGGCGG CTACGCCGGC CTGCCGGGCT ACGGCGGCGC GTCGCGCCAG
GGCGGCCGCG ATGTCGACGG CAACCGCGCG TCCTACGGCG GCGCGGGTGC GAACAAGCGC
GGCGCCGGCA AGGGCGGCCG CAATCCGAAC GGCAATCGCG CCGAAGGCGG GGCGCGCGGC
GGCCCGCGTA CGCCGCAGCA GCGCAATCGT TCGCGTAGCC GCTGA
 
Protein sequence
MTDIHDIDSS ESAHAVATAR ADDAPEQSAA DAGGEDRPRR GLRRGPRSLI ARRRAAAKSK 
HSDAPESADA APAADAGAGA DVAKAPARAP RGKDAAAKPP RKTAGKREGA ARQGAQPKRG
AQQAAAAVAP SAESGQDDVF AYVISPAFDA DNNAPGGGVR APMLRRGRQT QPKRVLSPDD
DAPKLHKVLA EAGMGSRREM EELIIAGRVS VNGEPAHIGQ RIMPTDQVRI NGKPVKRKLP
SKPPRVLLYH KPTGEIVSHA DPEGRPSVFD RLPPMKTAKW LAVGRLDFNT EGLLMLTTSG
DLANRFMHPR YSVEREYAVR VVGELSEASR QRLLHGVELD DGPANFLRIR DGGGEGTNHW
YHVALAEGRN REVRRMFEAV GLMVSRLIRT RHGPIPLPRG LKRGRWEELD EAQVRRLMST
VGLKAPTEDK GGKRGGPAER RQPDPMQTSM GFINREPVLT THGQLDQPRR GRRGPAGGGF
GAGLGGGYAG LPGYGGASRQ GGRDVDGNRA SYGGAGANKR GAGKGGRNPN GNRAEGGARG
GPRTPQQRNR SRSR