Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2235 |
Symbol | |
ID | 4709165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2450341 |
End bp | 2451282 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639856711 |
Product | RluA family pseudouridine synthase |
Protein accession | YP_001003801 |
Protein GI | 121999014 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.894524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACGAC CCCGTCACCA AGCCGCCGTA CCCGAGGGGC TGGCCGGCCA GCGCCTGGAT CGGGTGCTGG CGGCGCTCTT CCCCGATTAT TCGCGCAGCC GTTTGCAGCA GTGGATCCGG GCCGGCTGGA TCACCGTCGA CGGCGCGGTC CGCCGGCCCC GCGATCCGGT TCATGCCGGT GAGCAGATCA GCGTGGACGC CGAGCCCGAA CCGGAGACCC CGCTGGCGCC GGAACCCATC CCGTTGCGCC TGCTCTACGA GGACGACCAC CTGCTGGTGG TGGACAAGCC GGCGGGCCTG GTGGTTCACC CGGGGGCGGG CAACCCCGGC GGCACCCTGG TCAACGCCCT GCTCCATTAC GACCCCGGGC TGGAGGCGCT GCCCCGTGCG GGGATCGTCC ACCGGCTGGA CAAGGAGACC TCGGGGGTGT TGGTGGTGGC CCGGACCTAT GCGGCACACC ACGCACTGGT GGCGCAGCTT CAGGCGCGCA CGGTGGGGCG GGGTTACCAG GCGGTGGTCG TCGGGCGCCC CACCGCCGGG GGGCGAGTGG ACGCCCCCAT CGCCCGTCAC CCGCGGGACC GCAAGCGCAT GGCCGTGGTG GAGACCGGTC GTCCGGCGGT GACCCACTAC CGCGTTGCCG AGCGGTTCAC GGCCCACACC CTGCTCGATG TGGAGCTGGA AACCGGGCGC ACCCACCAGA TCCGCGTGCA CATGGCGCAC TGCCGGCTGC CGTTGGTGGG CGATCCCGTC TACGGGCGCC GGCCGGTCTA CCCCAAGGGG GCGAGCGAGA CCCTGCGCGC CGTGCTCGAC GGCTTCCGCC GTCAGGCCCT GCACGCTCGG CATCTGCGCC TTGAGCACCC GCGCACCGGA GAGGCGATGC ACTGGGAAGC GCCGCCGCCA GACGACTGGG AGCGCCTCTT GGAGGTCCTG CGCCATGGCT GA
|
Protein sequence | MERPRHQAAV PEGLAGQRLD RVLAALFPDY SRSRLQQWIR AGWITVDGAV RRPRDPVHAG EQISVDAEPE PETPLAPEPI PLRLLYEDDH LLVVDKPAGL VVHPGAGNPG GTLVNALLHY DPGLEALPRA GIVHRLDKET SGVLVVARTY AAHHALVAQL QARTVGRGYQ AVVVGRPTAG GRVDAPIARH PRDRKRMAVV ETGRPAVTHY RVAERFTAHT LLDVELETGR THQIRVHMAH CRLPLVGDPV YGRRPVYPKG ASETLRAVLD GFRRQALHAR HLRLEHPRTG EAMHWEAPPP DDWERLLEVL RHG
|
| |