Gene Hoch_5639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5639 
Symbol 
ID8548053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7742398 
End bp7744059 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content75% 
IMG OID646390307 
Productpseudouridine synthase 
Protein accessionYP_003270009 
Protein GI262198800 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAG CGATACGTCT ACAGCGCTAT CTATCGCAGT GCGGGGTCGC GGCCCGGCGC 
AAGGCGGAGG AGCTCATCAC CAGCGGCGTG GTCAGCGTCA ACGGGCAGCG GGTCACCGAG
CTAGGGACCA AAGTCGTGCC CGGTCGCGAC CGCGTGAGCG TCCGCGGCGA AGCGGTCTAT
CCCGAGGAGC CCTTCTACGT GCTCCTCAAC AAGCCCAAGG GCTGCCTGAC CGCGGTCACC
GACCCCGAGA ACCGGCCCAC GGTCATGGAG TATCTGCACG GTCTGCCTGC GCGGGTCGTG
CCGGTGGGCC GCCTCGACTT CTACAGCGAG GGCGTGCTGC TGCTGACCAA CGACGGCGAC
CTGTCGGCGC GGCTGCAATC GCCCCGCCAC CACGTCGAGA AGACCTACCA CGTCAAAGTG
CGCGACCAGG TCACCGAGCG CCACCTCGAG GTGTTGCGCC AGGGCGTGCG CCTCGAGGAC
GGCACGGTGA CGCGGCCGGC GCAGATCGAC ATGCTCGGGG GCACCAAGAG CCGCCACGAT
TGGCTGGTCA TCACCCTGAC CGAGGGCAAA TCGCGGCAGA TTCACCGCAT GCTCGGCGTG
CTCGGCTACA CGGTCATGAA GCTGCAGCGC GTGGCCTTTG GCGGGCTGAC CTTCCACGGT
CTGCGCGTGG GCGACGCGCG CGAGTTGACG CAAGCCGAGG TCAACGATCT GCGCGAGCTG
GTCGGACTTC CCAAGGACAC GGTGGCGCGC GGCAAGTGGA CTTCGCGTCG TGAAGAGACC
GAACGCGCCC GGCGCAGCCG CGCCCGCGCG CGCGTTGGCA TGAGCGGCGG CCGGGACGGG
GGCCGTGATT TTCGTACGGG TGGACGAGGT CGCGACGGCG GCTACGGTGG GCGCGACGGC
GGTCGCGATC GCGGCTACGG CGGACGCGAC GGAGGCCGTG ATCGCGGCTT CGGCGGCCGC
GACGGCGACT ATGGCGGACG TGATGGCGGA CGCGACGGCG GCTACGGCGG ACGCGACGGA
GGTCGTGAGC GCGGCTTCGG TGGTCGCGAC GGCGGCCATG GCGGTCGCGA TGGCGGCCGT
GACGGCGGCT ATGGCGGACG CGACGGAGGT CGTGAGCGCG GCTTCGGTGG TCGCGACGGC
GGTCGCGATC GTGGCCACGG CGGACGCGAT GGTGGCCGTG ACGGCGGCTA TGGCGGTCGG
GGTCGCGACG GCGCCTCGGG CGGCCCGCGC GGTCCCGGAG GTGGACGCGG GGACGGCCCG
CGCGGTCGCA GCGGCGGCCC CGGAGGCGGA CGCGGCGACG GCCCGCGCGG TCGCAGCGGC
GGCCCCGGAG GCGGACGCGG CGACGGCCCG CGCGGTCGCA GCGGCGGCCC CGGAGGTGGA
CGCGGCGACG GCCCGCGCGG TCGCAGCGGC GGCCCCGGAG GCGGACGCGG CGACGGCCCG
CGCGGTCGCA GCGGCGGCCC CGGAGGTGGA CGCGGCGACG GCCCGCGCGG TCGCAGCGGC
GGCCCCGGAG GCGGACGCGG CGACGGCCCG CGCGGTCGCA GCGGCGGCCC CGGAGGCGGT
GGTCCCGGCG GTAAGCGCGG CGGACCGTCT CGCGGTGGTC CGCGTCCGTC CGCTCGTCCG
AAAAAAGGCG GCGGTCCCTC TCGCGGCGGC TCGTCGCGCT GA
 
Protein sequence
MDEAIRLQRY LSQCGVAARR KAEELITSGV VSVNGQRVTE LGTKVVPGRD RVSVRGEAVY 
PEEPFYVLLN KPKGCLTAVT DPENRPTVME YLHGLPARVV PVGRLDFYSE GVLLLTNDGD
LSARLQSPRH HVEKTYHVKV RDQVTERHLE VLRQGVRLED GTVTRPAQID MLGGTKSRHD
WLVITLTEGK SRQIHRMLGV LGYTVMKLQR VAFGGLTFHG LRVGDARELT QAEVNDLREL
VGLPKDTVAR GKWTSRREET ERARRSRARA RVGMSGGRDG GRDFRTGGRG RDGGYGGRDG
GRDRGYGGRD GGRDRGFGGR DGDYGGRDGG RDGGYGGRDG GRERGFGGRD GGHGGRDGGR
DGGYGGRDGG RERGFGGRDG GRDRGHGGRD GGRDGGYGGR GRDGASGGPR GPGGGRGDGP
RGRSGGPGGG RGDGPRGRSG GPGGGRGDGP RGRSGGPGGG RGDGPRGRSG GPGGGRGDGP
RGRSGGPGGG RGDGPRGRSG GPGGGRGDGP RGRSGGPGGG GPGGKRGGPS RGGPRPSARP
KKGGGPSRGG SSR