Gene Hhal_2235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2235 
Symbol 
ID4709165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2450341 
End bp2451282 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content73% 
IMG OID639856711 
ProductRluA family pseudouridine synthase 
Protein accessionYP_001003801 
Protein GI121999014 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.894524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACGAC CCCGTCACCA AGCCGCCGTA CCCGAGGGGC TGGCCGGCCA GCGCCTGGAT 
CGGGTGCTGG CGGCGCTCTT CCCCGATTAT TCGCGCAGCC GTTTGCAGCA GTGGATCCGG
GCCGGCTGGA TCACCGTCGA CGGCGCGGTC CGCCGGCCCC GCGATCCGGT TCATGCCGGT
GAGCAGATCA GCGTGGACGC CGAGCCCGAA CCGGAGACCC CGCTGGCGCC GGAACCCATC
CCGTTGCGCC TGCTCTACGA GGACGACCAC CTGCTGGTGG TGGACAAGCC GGCGGGCCTG
GTGGTTCACC CGGGGGCGGG CAACCCCGGC GGCACCCTGG TCAACGCCCT GCTCCATTAC
GACCCCGGGC TGGAGGCGCT GCCCCGTGCG GGGATCGTCC ACCGGCTGGA CAAGGAGACC
TCGGGGGTGT TGGTGGTGGC CCGGACCTAT GCGGCACACC ACGCACTGGT GGCGCAGCTT
CAGGCGCGCA CGGTGGGGCG GGGTTACCAG GCGGTGGTCG TCGGGCGCCC CACCGCCGGG
GGGCGAGTGG ACGCCCCCAT CGCCCGTCAC CCGCGGGACC GCAAGCGCAT GGCCGTGGTG
GAGACCGGTC GTCCGGCGGT GACCCACTAC CGCGTTGCCG AGCGGTTCAC GGCCCACACC
CTGCTCGATG TGGAGCTGGA AACCGGGCGC ACCCACCAGA TCCGCGTGCA CATGGCGCAC
TGCCGGCTGC CGTTGGTGGG CGATCCCGTC TACGGGCGCC GGCCGGTCTA CCCCAAGGGG
GCGAGCGAGA CCCTGCGCGC CGTGCTCGAC GGCTTCCGCC GTCAGGCCCT GCACGCTCGG
CATCTGCGCC TTGAGCACCC GCGCACCGGA GAGGCGATGC ACTGGGAAGC GCCGCCGCCA
GACGACTGGG AGCGCCTCTT GGAGGTCCTG CGCCATGGCT GA
 
Protein sequence
MERPRHQAAV PEGLAGQRLD RVLAALFPDY SRSRLQQWIR AGWITVDGAV RRPRDPVHAG 
EQISVDAEPE PETPLAPEPI PLRLLYEDDH LLVVDKPAGL VVHPGAGNPG GTLVNALLHY
DPGLEALPRA GIVHRLDKET SGVLVVARTY AAHHALVAQL QARTVGRGYQ AVVVGRPTAG
GRVDAPIARH PRDRKRMAVV ETGRPAVTHY RVAERFTAHT LLDVELETGR THQIRVHMAH
CRLPLVGDPV YGRRPVYPKG ASETLRAVLD GFRRQALHAR HLRLEHPRTG EAMHWEAPPP
DDWERLLEVL RHG