Gene Dret_0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0491 
Symbol 
ID8418297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp597468 
End bp598439 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content60% 
IMG OID645037053 
ProducttRNA pseudouridine synthase B 
Protein accessionYP_003197366 
Protein GI258404624 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0130] Pseudouridine synthase 
TIGRFAM ID[TIGR00431] tRNA pseudouridine 55 synthase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.444271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAA AGAAACCGGC CTTTCCCCAG CAACACGGCT TGCTCATTCT GAACAAACCC 
CAGGGACCGA CCTCCTACGG CTGTCTGGAC CAGATCAAAC GCCATCTCGG ACAGCGCAAG
ATCGGCCACG CTGGGACTCT CGACCCCATG GCCAGCGGGG TCCTGGTTGT ATTGCTCGGG
CAAGCCACCA AACTCGCCTC CTATATCAGC GGCGGGGACA AAATATACCG CGGTCACCTG
CGTATCGGCC TGGCAACCGA TACGTACGAC ATCGAAGGGG CCGTGGAACA CCAATTTCCC
TGGGAACACA TAACACCTGA GGATGTAGGC AAGACGGTCG CCGAATGGCG CGATCTCCTG
GAACAGGAGG TCCCAGCGTA TTCGGCGGCC AAACACAAGG GCAAACCACT GTACGAACTC
AAGCGTACCG GGCAGGAAGT GCCGGTGAAG ACCAAGCCGG TGGAGATCTT TGACGCCCAG
GTCCTTGACA TTGCCCTGCC GGATGTCCAA TTTCGGGTCT CGTGTTCCCA AGGAACGTAC
GTGCGCTCCC TGGCCCACAG CCTGGGGAAG CGACTTGGCT GCGGCGCGAC GCTTACCGCC
CTGGTCCGGG AATTCAGCCA TCCGTTCGCT CTCGAACAGG CCTATTCGCT AGAAGAGGTC
CTCGGCGAAC CGGACTCTCT GGCTGAGAAG ATCCTGCCTC TGGACAAATG CCTGCCGCAC
TGGCCCTCGT TGCCTCTGGA CGGAAGCCAA GCCCGGGAAG TCCAAAATGG GACTCGTTTG
CTCGTCCGGG ACGTGACAGC CGATTTTGCT GTCCACCCGG GGGACCGGGC CCAGTTTGTT
GACGCCACAG GGCGTTTATT GGCGCTTGTC GAGGCCAAAT GGGAAGGACC AGTACTGGTC
TGGACAATCC TGCGCGGGTT CCAGCCGCCC AAAAAACATG ACGTCGCGCC CGAACGAACA
AACCACGAGT AG
 
Protein sequence
MAKKKPAFPQ QHGLLILNKP QGPTSYGCLD QIKRHLGQRK IGHAGTLDPM ASGVLVVLLG 
QATKLASYIS GGDKIYRGHL RIGLATDTYD IEGAVEHQFP WEHITPEDVG KTVAEWRDLL
EQEVPAYSAA KHKGKPLYEL KRTGQEVPVK TKPVEIFDAQ VLDIALPDVQ FRVSCSQGTY
VRSLAHSLGK RLGCGATLTA LVREFSHPFA LEQAYSLEEV LGEPDSLAEK ILPLDKCLPH
WPSLPLDGSQ AREVQNGTRL LVRDVTADFA VHPGDRAQFV DATGRLLALV EAKWEGPVLV
WTILRGFQPP KKHDVAPERT NHE