Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1169 |
Symbol | |
ID | 5733062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1341571 |
End bp | 1342509 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641278309 |
Product | pseudouridine synthase |
Protein accession | YP_001543945 |
Protein GI | 159897698 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000063461 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACTTT GGCAACTGCT GATGGCCCGC CTCGACCTCT CGCCAACCTT GGCCCAAACC CTGATGGTAC GCGGCGCGAT TTGGATCAAC AGCGTGCGGG TTACTGATCC GTTGGCTGAG GCTCCGCCAG AAGGCGAATT GGTGGTGCAT ACACCGCCCG CTGGTTTATA TGCCAACCCC GTGGTCACTC CTGCCGACAT CTTATTTGAA GATGAATGGA TTTTGGTGCT CAATAAGCCA GCCGCTTGGT ATAGCGTGGC CACGCCGTGG GATACCTTTG GGCATCTCGA AGGCGCGTTG CAACGCTTTT TTCTTGAGCG CGATGGCGAA GCTGTGCCGT TTCACCTGGT CCATCGACTG GATCATGGTA CCTCAGGCGC GTTGATCGTC TCCAAAAATC CTAGCCTCAA TAGTCGTTTT CAGCGCATGT TCAACGAGGG TCGGGTGCAG AAAACCTACT TGGCGCTATG CAGTGGCCTA CCCGATTGGA CGAAATTAGA AGTTGTGACA GGCCATTCAC GCGGCGAGTT TGGGCTGTGG TCGGTTTACC CCGCAGCAAT GATCGATCAA TTGTATGGGC CTAAGGATCG GCGGGTGCGG CGGGCGCATA CCAGTTTTAC AGTGCAGGCA GTTGGCAACG CAGCGGCTTT ACTGGCGGCT CGCTTGCACA CTGGCCGCAC CCATCAAATT CGTTTGCATG CGAAACATAT TGGACATCCC TTGGTGGGCG ATCAACGCTA TGGTGGCGTA ACTCAGTTTG GCAATTTGAG CATTCCTGAT CAATTATTAC ACGCTGCTTG GCTGCAATTT CCGCATCCGC GCTATGGCTC GATGCTTGAG TTAATCGCGC CAGTGCCAAG TCTATGGCAC GTGGTTGGAG CAGCAGTCGG AATTGATCTG CAAACGGGAT TGCTCGAAGA AGGACATTCA GCGCCATGA
|
Protein sequence | MPLWQLLMAR LDLSPTLAQT LMVRGAIWIN SVRVTDPLAE APPEGELVVH TPPAGLYANP VVTPADILFE DEWILVLNKP AAWYSVATPW DTFGHLEGAL QRFFLERDGE AVPFHLVHRL DHGTSGALIV SKNPSLNSRF QRMFNEGRVQ KTYLALCSGL PDWTKLEVVT GHSRGEFGLW SVYPAAMIDQ LYGPKDRRVR RAHTSFTVQA VGNAAALLAA RLHTGRTHQI RLHAKHIGHP LVGDQRYGGV TQFGNLSIPD QLLHAAWLQF PHPRYGSMLE LIAPVPSLWH VVGAAVGIDL QTGLLEEGHS AP
|
| |