Gene BURPS668_1737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1737 
Symbol 
ID4883944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1710127 
End bp1711791 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content74% 
IMG OID640127665 
ProductRNA pseudouridine synthase family protein 
Protein accessionYP_001058776 
Protein GI126440981 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0207122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACTGATA CCCACGACAT CGATTCGTCC GAATCCGCGC ATGCCGTTGC GACGGCGCGC 
GCCGACGACG CACCCGAGCA GTCCGCAGCG GACGCGGGCG GCGAAGACCG CCCGCGCCGC
GGTTTGCGGC GCGGGCCGCG CAGCCTGATC GCGCGCCGCC GAGCGGCCGC GAAATCGAAG
CATTCCGATG CGCCCGAAAG CGCCGACGCG GCGCCGGCGG CCGATGCCGG CGCGGGCGCC
GACGTCGCGA AAGCGCCCGC TCGCGCGCCG CGCGGCAAGG ACGCCGCGGC GAAGCCGCCG
CGCAAGACGG CGGGCAAGCG CGAAGGCGCC GCGCGGCAGG GCGCTCAGCC GAAGCGAGGC
GCGCAGCAGG CTGCCGCGGC GGTTGCGCCG TCCGCGGAGG CCGGCCAGGA CGACGTGTTC
GCCTACGTGA TTTCGCCGGC GTTCGACGCC GACAACAACG CGCCGGGCGG CGGCGTGCGC
GCGCCGATGC TGCGCCGGGG CCGCCAGACT CAGCCGAAGC GCGTGCTGTC GCCGGACGAC
GACGCGCCGA AGCTGCACAA GGTACTCGCG GAAGCCGGCA TGGGCTCGCG CCGCGAGATG
GAAGAGCTCA TCATTGCCGG CCGGGTGTCG GTGAACGGCG AGCCGGCGCA CATCGGCCAA
CGGATCATGC CGACCGATCA GGTGCGGATC AACGGCAAGC CGGTCAAGCG CAAGCTGCCG
AGCAAGCCGC CGCGCGTGCT GCTGTATCAC AAGCCGACGG GCGAGATCGT GAGCCACGCG
GATCCGGAGG GCCGCCCGTC CGTGTTCGAT CGGCTGCCGC CGATGAAGAC CGCGAAATGG
CTCGCGGTCG GCCGCCTCGA CTTCAACACC GAAGGCTTGC TGATGCTGAC GACGTCGGGC
GATCTCGCGA ACCGCTTCAT GCATCCGCGC TATAGCGTCG AGCGCGAGTA CGCGGTGCGC
GTCGTCGGCG AGCTGTCCGA GGCGTCGCGT CAGAAGCTGC TGCACGGCGT CGAGCTCGAC
GACGGCCCGG CGAATTTCCT GCGCATTCGC GACGGCGGCG GCGAAGGCAC GAATCACTGG
TATCACGTCG CGCTTGCCGA AGGGCGCAAC CGCGAGGTGC GGCGGATGTT CGAGGCGGTC
GGCCTGATGG TGAGCCGCCT GATCCGCACG CGCCACGGCC CGATCCCGCT GCCGCGCGGG
TTGAAGCGCG GCCGCTGGGA GGAACTCGAC GAGGCGCAGG TGCGGCGCCT GATGTCGACG
GTCGGCCTGA AGGCGCCGAC CGAGGATAAG GGCGGCAAGC GCGGCGGCCC GGCCGAGCGC
CGCCAGCCCG ATCCGATGCA GACGTCGATG GGCTTCATCA ATCGCGAGCC CGTGCTGACG
ACTCACGGCC AGCTCGACCA GCCGCGGCGC GGCCGCCGCG GGCCGGCGGG CGGCGGCTTC
GGCGCGGGCC TCGGCGGCGG CTACGCCGGC CTGCCGGGCT ACGGCGGCGC GTCGCGCCAG
GGCGGCCGCG ATGTCGACGG CAACCGCGCG TCCTACGGCG GCGCGGGCGC GAACAAGCGC
GGCGCCGGCA AGGGCGGCCG CAATCCGAAC GGCAATCGCG CCGAAGGCGG TGCGCGCGGC
GGCCCGCGTA CGCCGCAGCA GCGCAATCGT TCGCGTAGCC GCTGA
 
Protein sequence
MTDTHDIDSS ESAHAVATAR ADDAPEQSAA DAGGEDRPRR GLRRGPRSLI ARRRAAAKSK 
HSDAPESADA APAADAGAGA DVAKAPARAP RGKDAAAKPP RKTAGKREGA ARQGAQPKRG
AQQAAAAVAP SAEAGQDDVF AYVISPAFDA DNNAPGGGVR APMLRRGRQT QPKRVLSPDD
DAPKLHKVLA EAGMGSRREM EELIIAGRVS VNGEPAHIGQ RIMPTDQVRI NGKPVKRKLP
SKPPRVLLYH KPTGEIVSHA DPEGRPSVFD RLPPMKTAKW LAVGRLDFNT EGLLMLTTSG
DLANRFMHPR YSVEREYAVR VVGELSEASR QKLLHGVELD DGPANFLRIR DGGGEGTNHW
YHVALAEGRN REVRRMFEAV GLMVSRLIRT RHGPIPLPRG LKRGRWEELD EAQVRRLMST
VGLKAPTEDK GGKRGGPAER RQPDPMQTSM GFINREPVLT THGQLDQPRR GRRGPAGGGF
GAGLGGGYAG LPGYGGASRQ GGRDVDGNRA SYGGAGANKR GAGKGGRNPN GNRAEGGARG
GPRTPQQRNR SRSR