Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3492 |
Symbol | |
ID | 5735353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4397666 |
End bp | 4399348 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280639 |
Product | pseudouridine synthase |
Protein accession | YP_001546256 |
Protein GI | 159900009 |
COG category | [J] Translation, ribosomal structure and biogenesis [R] General function prediction only |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATCG AAATTCTCTA CCGCGACGAG CAAATCCTGG TGATCAATAA ACCAACCGGG GTGGCAACCC ACGCACCCCA AGGCGATTTG GCATTAACCG ATGTTGAACG CGCTTTGCGT GCCCAATTGC AACTTGAGTA TTTGGCGATT CATCAGCGGC TAGATCGCGA TACCTCGGGC GTGATGCTGT TTGCGCTTGA CCCCGCCGCC AATGCCAATT TGGCCACGGC CTTTGCTGAA CATACGATTG AGAAAACCTA CCAAGCCTTG GTGTATGGCG TGCCCATCCA AACTCAAGGG GTCATCGATG CTGCGCTTGC GCCTGCTGGC GATGGCATGA TCCAGGTTGC CTCAGCCCAT GATCGGCGTG CTCAATCAGC AATTACGCAT TATCGGGTCT TGGTCAGCAG TCCTGATCAG CGATTTAGCT TGTTGGAATT ACAGCCAAAA ACTGGCCGAA CTCACCAATT GCGGGTGCAC TGCGAATGTT TGGGCCATCC AATTGTTGGC GATCCGCTGT ATGATGTGGC GCGAGCTGCC CCCCGACTGA TGCTGCATGC CAGCGAATTA CGCTTCACCC ACCCGCTTAC TCAACAACCA TTGCATATTC AAGCGCCAAC TCCAGCCTTA TTCACGCGGG TGGCGCAAGG CTTGCCCGAA TTACAACAAA GCACCGAGCT AGCTGCGCTG AATGGCTTAA TCGAGTTAGC GGCTGAACGG CGGGCTGTCC TGGCAGCCGA TCCTGCTACG ACGATCTTTC GGGTGTTTCA TGGCCCAAGC GATGGTCTGA CCCATCCATG GTTGCAGCAT TGGACGGTCG ATAAACTTGA TCAGGTATTA ATTGCCTCGT GCTATGACGA ACATGTGCGC CAAGTGCCAG CCAGCTTAAT CAACGCCTTG GTTGCGCAAT GGCAGCCGCA GGCGATTTAT GCTAAATATC GCCCTCGGGC TGCTGCCAAA GTTGACGAGG CCGCGATGGC TGAGTTAGCT CCAACTAGGC CAGTGTGGGG TGAGCCAATC GAGCAAGTGG TGGTGCAAGA AGCTGGGTTA AGCTATGAAT TGCGGCCTAA CGACGGCTTG AGCATTGGTT TATATGCTGA TATGCGTGAA ACTCGCCAAC GGGTGCGCAA TTTGCTTGCC AAGCGTCAAT TGCGGGTGCT CAACACCTTT GCCTATACCT GTGGCTTTGG GGTGGCAGCC GTTGCCGATG CCCCTGAGGC AATCGTGACC AATCTTGATC TTTCGCGGCG CTCGTTGGAT TGGGGCAAAA TTAATTATGG CCTGAATCAG TTGGCCGTTG AAGATCGTCA GTTTGTATTT GGCGATGTCT TCGATTGGCT CAGTCGTTGG GTGCGTCAAG GCCGTCAATT CGATGTGGTG ATTCTTGATC CACCGTCGTT TGCCCGCAAT CGGGGTAAGC GTTGGCGAGC CGAAGAAGAT TACGCCGATT TGGTAGCCTT GGCGGTGCAG TTGTTGCCAG CCGATGGCCA TTTAATCGCT TGCTGTAACC ATGTTGGGCT TTCGCGGCGG CAATTTCGTG GTCAAGTCGA ACGCGGTATG CAGCAAGGGC GTTGGCATGG CACGATTGAA GCCAATTATC CGGCCTCGCC CTTAGATTAC CCCGCTGCCT ATGGCGAAAG CCACTTGAAA ATTATTTTAG CGACTGGTCA AACCAACGAT TAA
|
Protein sequence | MHIEILYRDE QILVINKPTG VATHAPQGDL ALTDVERALR AQLQLEYLAI HQRLDRDTSG VMLFALDPAA NANLATAFAE HTIEKTYQAL VYGVPIQTQG VIDAALAPAG DGMIQVASAH DRRAQSAITH YRVLVSSPDQ RFSLLELQPK TGRTHQLRVH CECLGHPIVG DPLYDVARAA PRLMLHASEL RFTHPLTQQP LHIQAPTPAL FTRVAQGLPE LQQSTELAAL NGLIELAAER RAVLAADPAT TIFRVFHGPS DGLTHPWLQH WTVDKLDQVL IASCYDEHVR QVPASLINAL VAQWQPQAIY AKYRPRAAAK VDEAAMAELA PTRPVWGEPI EQVVVQEAGL SYELRPNDGL SIGLYADMRE TRQRVRNLLA KRQLRVLNTF AYTCGFGVAA VADAPEAIVT NLDLSRRSLD WGKINYGLNQ LAVEDRQFVF GDVFDWLSRW VRQGRQFDVV ILDPPSFARN RGKRWRAEED YADLVALAVQ LLPADGHLIA CCNHVGLSRR QFRGQVERGM QQGRWHGTIE ANYPASPLDY PAAYGESHLK IILATGQTND
|
| |