Gene BURPS668_2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2440 
Symbol 
ID4883561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2405236 
End bp2406960 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content75% 
IMG OID640128368 
ProductRNA pseudouridine synthase family protein 
Protein accessionYP_001059472 
Protein GI126439492 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACAA AATTGACCGT CAAGAATCCG CGCCCGGCGA CGCCCGGCCG CGCCCCCGTC 
CGCTCCGGCA GCCTCACCGC GCGCAAGGTC GCGCGGCCCG ACCCGAAAGC GGCGGGCGCG
AAACCCGCCG CGGCGAAGCC TGCTGCGAAG TCCGCATCGG CTGCCAAGCC GGCGGCGCCG
CGCGGCGCGG CGAACGCTGC GCCGAAGCGC GCGCCGGGGC CGTCGCGCCC GGCCGCGGCG
TCCGAAGGCA AGCGCGTCGC GAAGCCGCGC ACCGCGCACG ACGCCGGCCG CACGGGCGGC
GAGCGTGCGC CGGCCAAGCG CGCCACCGCG CCCGGTGCGC CCGGCGCGGC GTCCGCGCCG
CGCACGCGCC GCACCGACGC GAAGCCGGCG CGGCGCACCG ACGAACGCCC TGCCGGCCGC
GCCGGCAATC GCCCTGCCGG CCGCGACGAG CGCGCACCGC GCGACTCGGA TGCGCGCGCG
TTCGATGCGG GCACGCGCGG TAAGGACCGC GCGCCCCGCG AGGGCGCAAG GCCCGGCGCA
CGGGGCGCGA CGGGCGCGAA GTTCGGCGGC GCGGCGCGCC GATCGGACGA CGCCGACCGT
CGAACGCCCC GCGCGACGCG TGCGGACAGC CGCGCGCGCG ATGCCGCGCC GTCGTCGTTC
GCGGGCAAGA CCGCGACAGC CGGCAAGCGT GCGCCGCAGC GCGCCGACGA TCGCTACGGC
GCAGCCGGGA AGCGCACATC GCCGCGCCCC GAGCGAACCG AGCGTACCGC CCGCCTCGGC
GAACGGCCGG CCACCCGCGC GAGCACATCC GGCGAGCGCC GCCCCACGGC CCGCGCGGCG
ACGGGTTCGC GCCTCAAGCT CGCGCAGCCG ATCAAGCGCG GCAGCGGCGA ACTGGGCGAA
TCCGCTCGCG GCGGTGAGCA CGGTGAGCAC GGCGAACGCG GCAAGCGTAT CGAGCGCGGC
GACGAAACCG GCCTCGTGCG CCTGTCGAAG CGCATGTCGG AGCTGGGTCT CTGCTCGCGC
CGCGAAGCAG ACGAATGGAT CGAGAAAGGC TGGGTGCTCG TCGACGGCGA GCGCATCGAC
ACGCTCGGCA CGAAGGTGCG CGCCGACCAG CGCATCGAGA TCGATTCGAA CGCGCGCGCC
GCGCAGGCCG CGCAAGTGAC GATCCTGCTG CACAAGCCGG TGGGCTACGT GTCGGGCCAG
GCGGAGGACG GCTACGCCCC CGCCGCGACG CTCGTCACGC GCGAGAACCA CTGGAGCGGC
GACCGCTCGC CGCTGCGCTT CTCGCCGCAG CACCTGCGCG CGCTCGCGCC CGCGGGCCGG
CTCGACATCG ATTCGACGGG CCTTCTCGTG CTGACGCAGA ACGGGCGCGT CGCGAAACAG
CTGATCGGCG AACAATCGGA CATCGACAAG GAATACCTGG TGCGCGTGCG CTTCGGCGAG
CGCACGGCCG ACATCGAACG CCACTTCCCC GCCGAGTCGC TCGCGAAGCT GCGCCACGGC
CTCGAGCTCG ACGGCGTGCC GCTCAAGCCC GCGATGGTCA GTTGGCAGAA CGGCGAGCAA
CTGCGCTTCG TGCTGCGCGA AGGCAAGAAG CGCCAGATTC GCCGGATGTG CGAACTCGTC
GGCCTCGAGG TGATCGGCCT GAAGCGCGTG CGGATGGGCC GCGTGATGCT GGGCGCGCTG
CCGCAAGGCG AGTGGCGCTA TCTCGGGCCG GACGAATCGT TCTGA
 
Protein sequence
MRTKLTVKNP RPATPGRAPV RSGSLTARKV ARPDPKAAGA KPAAAKPAAK SASAAKPAAP 
RGAANAAPKR APGPSRPAAA SEGKRVAKPR TAHDAGRTGG ERAPAKRATA PGAPGAASAP
RTRRTDAKPA RRTDERPAGR AGNRPAGRDE RAPRDSDARA FDAGTRGKDR APREGARPGA
RGATGAKFGG AARRSDDADR RTPRATRADS RARDAAPSSF AGKTATAGKR APQRADDRYG
AAGKRTSPRP ERTERTARLG ERPATRASTS GERRPTARAA TGSRLKLAQP IKRGSGELGE
SARGGEHGEH GERGKRIERG DETGLVRLSK RMSELGLCSR READEWIEKG WVLVDGERID
TLGTKVRADQ RIEIDSNARA AQAAQVTILL HKPVGYVSGQ AEDGYAPAAT LVTRENHWSG
DRSPLRFSPQ HLRALAPAGR LDIDSTGLLV LTQNGRVAKQ LIGEQSDIDK EYLVRVRFGE
RTADIERHFP AESLAKLRHG LELDGVPLKP AMVSWQNGEQ LRFVLREGKK RQIRRMCELV
GLEVIGLKRV RMGRVMLGAL PQGEWRYLGP DESF