Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0787 |
Symbol | |
ID | 5732671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 886965 |
End bp | 889151 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641277917 |
Product | pseudouridine synthase |
Protein accession | YP_001543563 |
Protein GI | 159897316 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases |
TIGRFAM ID | [TIGR00093] pseudouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000926536 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACGTT TACATAAGTT TTTGGCGCGG GCCGGCGTAG ATTCACGTCG GGCTTGCGAA GAGTTGATGT TGGCTGGGCG CGTATCGGTT AATGGACGAG TTCAACGCGA ATTAGGAACC CAGATCGACC CCGATAAAGA TGACATTCGA GTTGATGGCG AACAAATTAC ACCTCCAACC GAGCTTCATT ACGTCTTATT GCACAAACCA AGCAGCGTCA TGACCACGTT GCATGACCCT GAAGGCCGCC AAACCGTCGC CGATTTAGTC AAATCGCACA ATCGTTTATT TCCCGTAGGC CGCCTCGATT ACGATAGCGA AGGGCTATTG TTGCTGACCG ATGATGGTGA TCTCACGTTC CGTTTGACCC ACCCATCGTT CGAAGTCGAA AAAGAATATC AGGCCTTGTT AAACCAAACA CCATCAGTTG AGCAATTGCG CGAGTGGCGC AATGGGGTTG AGCTTGACGA TGGCAAAACA GCACCTGCAT GGATCGAATT GTTAAATGAA ACCGCCGATG GAACTTGGGT TCGGGTGGTT ATTCACGAAG GTCGCAACCG CCAAATTCGG CGGGTTGCCG AAGCATTAGG CTTAGAAGTC CGCCGCTTGA TCCGGTTGCG CGAAGGCCCA CTAACTTTAG GTGAATTACA AGCAGGCCAA TGGCGAGCAT TAACACCCAC CGAAATTGAT CGTTTGCGTA TGCACGGCAA GCAAAGCAAA GCATGGACAC CAGTACGCCC ACAAGAAAAC GCCGAGGGTG GCAAACCACG CCGTCAAGTT CGGCGGATTG ATCGGTCTGG ACAGTTGGTA GAACGTCGGG CTGATGTTGA ACCGCGCTTA CAACGAGCGC CTCACTTTAC TTCACAGGAA AATACAATGT CATTTGAAGA TTTCTCAATC GAAGAAGAAC GCTCCACCGA TCGCCCACGC CGTCGCCCAG AACCTCGCCG CGAAGGTGGA TTTAGCGACC GTGGCCCCCG CCGTGATGGC GGCAGCCCTA GCGACGACCG TGGCCCCCGC CGCGAAGGTG GATTTAGCGA ACGCGGTCCG CGCCGTGAAG CTGGCGATTT TGGCGACCGT GGTCCACGCC GCGAAGCTGG TTTCGGCGAA CGTGGTCCAC GCCGCGATGG CGGTGGCTTT GGTGAACGCG GCCCGCGCCG CGATGGTGAC GGCCCGCGCC GCGATTTTGG CGACCGTGGC CCACGTCGTG AAGGTGGCGA TTTTGGTGAC CGTGGCCCAC GTCGTGAAGG TGGTGGTTTC GGCGAACGCG GTCCGCGCCG CGATGGTGAC GGCCCGCGCC GCGATTTTGG TGACCGTAGT CCTCGTCGCG ATGGCGATTT TGGTGACCGT GGCTCACGTC GTGAAGGTGG CGATTTTGGT GACCGTGGTC CACGCCGTGA AGGTGGCTTT GGCGAACGCG GCCCGCGCCG TGAAGGCGAC GGCCCACGCC GCGATTTCGG TGACCGTGGC CCACGCCGTG AAGGTGGCGG TTTTGGCGAA CGTGGTCCAC GCCGTGAAGG TGGCGGTTTT GGCGAACGCG GCCCACGCCG TGAAGGTGGC GGTTTTGGCG AACGCAGTCC GCGCCGCGAT GGTGGCGGCT TTGGCGAACG CAGTCCGCGC CGTGAAGGTG GCGGCTTTGG CGAACGCGGC CCGCGCCGTG AAGGTGGCGG TTTTGGTGAC CGTGGTCCGC GCCGTGAAGG CGGCGGTTTC GGCGAACGTG GCCCACGCCG TGAAGGTGGC GGTTTCGGCG AACGTGGCCC ACGTCGCGAT GCAAGTGGCT TCGATCGTAC CAATCGTCGC GAAGAATTTT TTGATGATCG ACCACGTGGC GAACGTGGCT TGGCGCTTGA TCGCGCCCAT CGTGGCGATG ATCGCCCTAA AAACACCTTT GGCGGTCGCC GCCCAAGCAA CCAAGGCTCG TTTGATCGCC GTGGTTTTGA CAACCGTGGT CGCGACGACC GTCGTGGTTT CGATAACCGT GGCCGCGATG ACCGTCGTGG TTTCGATGAT CGCGCATCAG CTCCACGCGA CAAAATGGTT GAACGCGGGC CAATTTCAGC CCGCAATACG CCAGCGCCAA CGCCAGCACC TACGCCGGCC CCAGTTGCCG AGCAAGCAGC TCAGCCAGCC CGTCGGATGC GCGTGGTACG GCGTTTGAAA AAGGCAGGCG GTGATGAACA AGCTTAA
|
Protein sequence | MERLHKFLAR AGVDSRRACE ELMLAGRVSV NGRVQRELGT QIDPDKDDIR VDGEQITPPT ELHYVLLHKP SSVMTTLHDP EGRQTVADLV KSHNRLFPVG RLDYDSEGLL LLTDDGDLTF RLTHPSFEVE KEYQALLNQT PSVEQLREWR NGVELDDGKT APAWIELLNE TADGTWVRVV IHEGRNRQIR RVAEALGLEV RRLIRLREGP LTLGELQAGQ WRALTPTEID RLRMHGKQSK AWTPVRPQEN AEGGKPRRQV RRIDRSGQLV ERRADVEPRL QRAPHFTSQE NTMSFEDFSI EEERSTDRPR RRPEPRREGG FSDRGPRRDG GSPSDDRGPR REGGFSERGP RREAGDFGDR GPRREAGFGE RGPRRDGGGF GERGPRRDGD GPRRDFGDRG PRREGGDFGD RGPRREGGGF GERGPRRDGD GPRRDFGDRS PRRDGDFGDR GSRREGGDFG DRGPRREGGF GERGPRREGD GPRRDFGDRG PRREGGGFGE RGPRREGGGF GERGPRREGG GFGERSPRRD GGGFGERSPR REGGGFGERG PRREGGGFGD RGPRREGGGF GERGPRREGG GFGERGPRRD ASGFDRTNRR EEFFDDRPRG ERGLALDRAH RGDDRPKNTF GGRRPSNQGS FDRRGFDNRG RDDRRGFDNR GRDDRRGFDD RASAPRDKMV ERGPISARNT PAPTPAPTPA PVAEQAAQPA RRMRVVRRLK KAGGDEQA
|
| |