Gene BURPS668_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2801 
Symbol 
ID4881801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2764372 
End bp2765379 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content68% 
IMG OID640128729 
ProductRluA family pseudouridine synthase 
Protein accessionYP_001059822 
Protein GI126439038 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000617612 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAGT TAGGCAAAAA ATCCCATAAT TCGGTCGCAA GCGGTCAGGT TTCGCTCATC 
GAGATCGACG AAAGCGCGGC CGGGCAGCGC ATCGACAACT TCCTGCTGCG CGTCTGCAAG
GGCGTGCCGA AGAGTCATAT TTACCGGATC CTGCGCAGCG GCGAAGTCCG CGTGAACAAG
GGCAGGATCG ATGCGCAGTA CCGGCTCGCG TTCGGCGACG TCGTGCGCGT GCCGCCCGTG
CGCGTCGCGG CGGCCGATCT TGCGCGCGCG GCCGGCCCGG CGCCCGTGCC CGCCGCGGAA
TTCGAGATCC TGTTCGAGGA CGACGCGATC ATCGTGCTGA ACAAGCCGGC AGGCGTCGCC
GTGCACGGCG GCAGCGGCGT CGCGTTCGGC GTGATCGAGC AGATGCGCCA TGCGCGGCCG
CACGCGAAAT TCCTCGAACT CGCGCACCGG CTCGACCGCG AGACCTCGGG CATCCTGATG
CTCGCGAAGA AGCGCTCGGC GCTCGTCGGG CTGCACGAGC AGATTCGCGA GAACCGGATG
GACAAGCGCT ACTTCGCCTG CGTGCATGGC GACTGGGCGG CCGACTGGGG CCGCCGCCGC
GTGGTGAGGG CGCCCCTTTT CAAGTACGCG ACGCCCGACG GCGAGCGGCG CGTGCGGGTT
CAGGAGGACG GCCTGCCGTC GCACACGGTG TTCAATCTCG TCGACCGCTG GCCGGACTAC
GCGCTCGTCG AAGCGGAACT CAAGACGGGG CGGACCCATC AGATCCGCGT GCACCTCGCG
CATCTCGGCC TGCCGATCGT CGGCGACGCC AAGTACGGCG ATTTCGCGCT GAACAAGGCG
CTTGCGCGCG CGAACGCGGT GCCGTCGATC AAGCGGATGT TCCTGCACGC GCATCGGCTG
CGCCTCGCGC ATCCGCTGAC GGGCGAGCCG CTGCAGTTCG ACGCGCCGCT GCCCGCCGAG
TGCCGGCAAT TCATCGATCA ACTCTCCGAC TTGCGCGACA CCGCGTGA
 
Protein sequence
MNELGKKSHN SVASGQVSLI EIDESAAGQR IDNFLLRVCK GVPKSHIYRI LRSGEVRVNK 
GRIDAQYRLA FGDVVRVPPV RVAAADLARA AGPAPVPAAE FEILFEDDAI IVLNKPAGVA
VHGGSGVAFG VIEQMRHARP HAKFLELAHR LDRETSGILM LAKKRSALVG LHEQIRENRM
DKRYFACVHG DWAADWGRRR VVRAPLFKYA TPDGERRVRV QEDGLPSHTV FNLVDRWPDY
ALVEAELKTG RTHQIRVHLA HLGLPIVGDA KYGDFALNKA LARANAVPSI KRMFLHAHRL
RLAHPLTGEP LQFDAPLPAE CRQFIDQLSD LRDTA