Gene BURPS1106A_2497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2497 
Symbol 
ID4902696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2450209 
End bp2451924 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content75% 
IMG OID640135724 
ProductRNA pseudouridine synthase family protein 
Protein accessionYP_001066756 
Protein GI126451482 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.142989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACAA AATTGACCGT CAAGAATCCG CGCCCGGCGA CGCCCGGCCG CGCCCCCGTC 
CGCTCCGGCA GCCTCACCGC GCGCAAGGTC GCGCGGCCCG ACCCGAAAGC GGCGGGCGCG
AAACCCGCCG CGGCGAAGCC TGCTGCGAAG TCCGCATCGG CTGCCAAGCC GGCGGCGCCG
CGCGGCGCGG CGAACGCTGC GCCGAAGCGC GCGCCGGGGC CGTCGCACCC GGCCGCGGCA
TCCGAAGGCA AGCGCGTCGC GAAGCCGCGC GCCGCGCACG ACGCCGGCCG CACGGGCGGC
GAGCGTGCGC CGGCCAAGCG CGCCACCGCG CCCGGTGCGC CCGGCGCGGC GTCCGCGCCG
CGCACGCGCC GCACCGACGC GAAGCCGGCG CGCCGCACCG ACGAACGCCC TGCCGGCCGC
GCCGGCAATC GCCCTGCCGG CCGCGACGAG CGCGCACCGC GCGACTCGGA TGCGCGCGCG
TTCGATGCGG GCACGCGCGG TAAGGACCGC GCGCCCCGCG AGGGCGCAAG GCCCGGCGCA
CGGGGCGCGA CGGGCGCGAA GTTCGGCGGC GCGGCGCGCC GATCGGACGA CGCCGACCGT
CGAACGCCCC GCGCGACGCG TGCGGACAGC CGCGCGCGCG ATGCCGCGCC GTCGTCGTTC
GCGGGCAAGA CCACGACAGC CGGCAAGCGT GCGCCGCAGC GCGCCGACGA TCGCTACGGC
GCAGCCGGGA AGCGCACATC GCCGCGCCCC GAGCGAACCG AGCGTACCGC CCGCTTCGGC
GAACGGCCGG CCACCCGCGC GAGCGCATCC GGCGAGCGCC GCCCCACGGC CCGCGCGGCG
ACGGGTTCGC GCCTCAAGCT CGCGCAGCCG ATCAAGCGCG GCAGCGGCGA ACTGGGCGAA
TCCGCTCGCG GCGGTGAGCA CGGCGAACGC GGCAAGCGTA TCGAGCGCGG CGACGAAACC
GGCCTCGTGC GCCTGTCGAA GCGCATGTCG GAGCTGGGTC TCTGCTCGCG CCGCGAAGCA
GACGAATGGA TCGAGAAAGG CTGGGTGCTC GTCGACGGCG AGCGCATCGA CACGCTCGGC
ACGAAGGTGC GCGCCGACCA GCGCATCGAG ATCGATTCGA ACGCGCGTGC CGCGCAGGCC
GCGCAAGTGA CGATCCTGCT GCACAAGCCG GTGGGCTACG TGTCGGGCCA GGCGGAGGAC
GGCTACGCCC CCGCCGCGAC GCTCGTCACG CGCGAGAACC ACTGGAGCGG CGACCGCTCG
CCGCTGCGCT TCTCGCCGCA GCACCTGCGC GCGCTCGCGC CCGCGGGCCG GCTCGACATC
GATTCGACGG GCCTTCTCGT GCTGACGCAG AACGGGCGCG TCGCGAAACA GCTGATCGGC
GAACAATCGG ACATCGACAA GGAATACCTG GTGCGCGTGC GCTTCGGCGA GCGCACGGCC
GACATCGAAC GCCACTTCCC CGCCGAGTCG CTCGCGAAGC TGCGCCACGG CCTCGAACTC
GACGGCGTGC CGCTCAAGCC CGCGATGGTC AGTTGGCAGA ACGGCGAGCA ACTGCGCTTC
GTGCTGCGCG AAGGCAAGAA GCGCCAGATT CGCCGGATGT GCGAACTCGT CGGCCTCGAG
GTGATCGGCC TGAAGCGCGT GCGGATGGGC CGCGTGATGC TGGGCGCGCT GCCGCAAGGC
GAGTGGCGCT ATCTCGGGCC GGACGAATCG TTCTGA
 
Protein sequence
MRTKLTVKNP RPATPGRAPV RSGSLTARKV ARPDPKAAGA KPAAAKPAAK SASAAKPAAP 
RGAANAAPKR APGPSHPAAA SEGKRVAKPR AAHDAGRTGG ERAPAKRATA PGAPGAASAP
RTRRTDAKPA RRTDERPAGR AGNRPAGRDE RAPRDSDARA FDAGTRGKDR APREGARPGA
RGATGAKFGG AARRSDDADR RTPRATRADS RARDAAPSSF AGKTTTAGKR APQRADDRYG
AAGKRTSPRP ERTERTARFG ERPATRASAS GERRPTARAA TGSRLKLAQP IKRGSGELGE
SARGGEHGER GKRIERGDET GLVRLSKRMS ELGLCSRREA DEWIEKGWVL VDGERIDTLG
TKVRADQRIE IDSNARAAQA AQVTILLHKP VGYVSGQAED GYAPAATLVT RENHWSGDRS
PLRFSPQHLR ALAPAGRLDI DSTGLLVLTQ NGRVAKQLIG EQSDIDKEYL VRVRFGERTA
DIERHFPAES LAKLRHGLEL DGVPLKPAMV SWQNGEQLRF VLREGKKRQI RRMCELVGLE
VIGLKRVRMG RVMLGALPQG EWRYLGPDES F