Gene Haur_0787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0787 
Symbol 
ID5732671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp886965 
End bp889151 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content59% 
IMG OID641277917 
Productpseudouridine synthase 
Protein accessionYP_001543563 
Protein GI159897316 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000926536 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACGTT TACATAAGTT TTTGGCGCGG GCCGGCGTAG ATTCACGTCG GGCTTGCGAA 
GAGTTGATGT TGGCTGGGCG CGTATCGGTT AATGGACGAG TTCAACGCGA ATTAGGAACC
CAGATCGACC CCGATAAAGA TGACATTCGA GTTGATGGCG AACAAATTAC ACCTCCAACC
GAGCTTCATT ACGTCTTATT GCACAAACCA AGCAGCGTCA TGACCACGTT GCATGACCCT
GAAGGCCGCC AAACCGTCGC CGATTTAGTC AAATCGCACA ATCGTTTATT TCCCGTAGGC
CGCCTCGATT ACGATAGCGA AGGGCTATTG TTGCTGACCG ATGATGGTGA TCTCACGTTC
CGTTTGACCC ACCCATCGTT CGAAGTCGAA AAAGAATATC AGGCCTTGTT AAACCAAACA
CCATCAGTTG AGCAATTGCG CGAGTGGCGC AATGGGGTTG AGCTTGACGA TGGCAAAACA
GCACCTGCAT GGATCGAATT GTTAAATGAA ACCGCCGATG GAACTTGGGT TCGGGTGGTT
ATTCACGAAG GTCGCAACCG CCAAATTCGG CGGGTTGCCG AAGCATTAGG CTTAGAAGTC
CGCCGCTTGA TCCGGTTGCG CGAAGGCCCA CTAACTTTAG GTGAATTACA AGCAGGCCAA
TGGCGAGCAT TAACACCCAC CGAAATTGAT CGTTTGCGTA TGCACGGCAA GCAAAGCAAA
GCATGGACAC CAGTACGCCC ACAAGAAAAC GCCGAGGGTG GCAAACCACG CCGTCAAGTT
CGGCGGATTG ATCGGTCTGG ACAGTTGGTA GAACGTCGGG CTGATGTTGA ACCGCGCTTA
CAACGAGCGC CTCACTTTAC TTCACAGGAA AATACAATGT CATTTGAAGA TTTCTCAATC
GAAGAAGAAC GCTCCACCGA TCGCCCACGC CGTCGCCCAG AACCTCGCCG CGAAGGTGGA
TTTAGCGACC GTGGCCCCCG CCGTGATGGC GGCAGCCCTA GCGACGACCG TGGCCCCCGC
CGCGAAGGTG GATTTAGCGA ACGCGGTCCG CGCCGTGAAG CTGGCGATTT TGGCGACCGT
GGTCCACGCC GCGAAGCTGG TTTCGGCGAA CGTGGTCCAC GCCGCGATGG CGGTGGCTTT
GGTGAACGCG GCCCGCGCCG CGATGGTGAC GGCCCGCGCC GCGATTTTGG CGACCGTGGC
CCACGTCGTG AAGGTGGCGA TTTTGGTGAC CGTGGCCCAC GTCGTGAAGG TGGTGGTTTC
GGCGAACGCG GTCCGCGCCG CGATGGTGAC GGCCCGCGCC GCGATTTTGG TGACCGTAGT
CCTCGTCGCG ATGGCGATTT TGGTGACCGT GGCTCACGTC GTGAAGGTGG CGATTTTGGT
GACCGTGGTC CACGCCGTGA AGGTGGCTTT GGCGAACGCG GCCCGCGCCG TGAAGGCGAC
GGCCCACGCC GCGATTTCGG TGACCGTGGC CCACGCCGTG AAGGTGGCGG TTTTGGCGAA
CGTGGTCCAC GCCGTGAAGG TGGCGGTTTT GGCGAACGCG GCCCACGCCG TGAAGGTGGC
GGTTTTGGCG AACGCAGTCC GCGCCGCGAT GGTGGCGGCT TTGGCGAACG CAGTCCGCGC
CGTGAAGGTG GCGGCTTTGG CGAACGCGGC CCGCGCCGTG AAGGTGGCGG TTTTGGTGAC
CGTGGTCCGC GCCGTGAAGG CGGCGGTTTC GGCGAACGTG GCCCACGCCG TGAAGGTGGC
GGTTTCGGCG AACGTGGCCC ACGTCGCGAT GCAAGTGGCT TCGATCGTAC CAATCGTCGC
GAAGAATTTT TTGATGATCG ACCACGTGGC GAACGTGGCT TGGCGCTTGA TCGCGCCCAT
CGTGGCGATG ATCGCCCTAA AAACACCTTT GGCGGTCGCC GCCCAAGCAA CCAAGGCTCG
TTTGATCGCC GTGGTTTTGA CAACCGTGGT CGCGACGACC GTCGTGGTTT CGATAACCGT
GGCCGCGATG ACCGTCGTGG TTTCGATGAT CGCGCATCAG CTCCACGCGA CAAAATGGTT
GAACGCGGGC CAATTTCAGC CCGCAATACG CCAGCGCCAA CGCCAGCACC TACGCCGGCC
CCAGTTGCCG AGCAAGCAGC TCAGCCAGCC CGTCGGATGC GCGTGGTACG GCGTTTGAAA
AAGGCAGGCG GTGATGAACA AGCTTAA
 
Protein sequence
MERLHKFLAR AGVDSRRACE ELMLAGRVSV NGRVQRELGT QIDPDKDDIR VDGEQITPPT 
ELHYVLLHKP SSVMTTLHDP EGRQTVADLV KSHNRLFPVG RLDYDSEGLL LLTDDGDLTF
RLTHPSFEVE KEYQALLNQT PSVEQLREWR NGVELDDGKT APAWIELLNE TADGTWVRVV
IHEGRNRQIR RVAEALGLEV RRLIRLREGP LTLGELQAGQ WRALTPTEID RLRMHGKQSK
AWTPVRPQEN AEGGKPRRQV RRIDRSGQLV ERRADVEPRL QRAPHFTSQE NTMSFEDFSI
EEERSTDRPR RRPEPRREGG FSDRGPRRDG GSPSDDRGPR REGGFSERGP RREAGDFGDR
GPRREAGFGE RGPRRDGGGF GERGPRRDGD GPRRDFGDRG PRREGGDFGD RGPRREGGGF
GERGPRRDGD GPRRDFGDRS PRRDGDFGDR GSRREGGDFG DRGPRREGGF GERGPRREGD
GPRRDFGDRG PRREGGGFGE RGPRREGGGF GERGPRREGG GFGERSPRRD GGGFGERSPR
REGGGFGERG PRREGGGFGD RGPRREGGGF GERGPRREGG GFGERGPRRD ASGFDRTNRR
EEFFDDRPRG ERGLALDRAH RGDDRPKNTF GGRRPSNQGS FDRRGFDNRG RDDRRGFDNR
GRDDRRGFDD RASAPRDKMV ERGPISARNT PAPTPAPTPA PVAEQAAQPA RRMRVVRRLK
KAGGDEQA