Gene Haur_3492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3492 
Symbol 
ID5735353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4397666 
End bp4399348 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content54% 
IMG OID641280639 
Productpseudouridine synthase 
Protein accessionYP_001546256 
Protein GI159900009 
COG category[J] Translation, ribosomal structure and biogenesis
[R] General function prediction only 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific
[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATCG AAATTCTCTA CCGCGACGAG CAAATCCTGG TGATCAATAA ACCAACCGGG 
GTGGCAACCC ACGCACCCCA AGGCGATTTG GCATTAACCG ATGTTGAACG CGCTTTGCGT
GCCCAATTGC AACTTGAGTA TTTGGCGATT CATCAGCGGC TAGATCGCGA TACCTCGGGC
GTGATGCTGT TTGCGCTTGA CCCCGCCGCC AATGCCAATT TGGCCACGGC CTTTGCTGAA
CATACGATTG AGAAAACCTA CCAAGCCTTG GTGTATGGCG TGCCCATCCA AACTCAAGGG
GTCATCGATG CTGCGCTTGC GCCTGCTGGC GATGGCATGA TCCAGGTTGC CTCAGCCCAT
GATCGGCGTG CTCAATCAGC AATTACGCAT TATCGGGTCT TGGTCAGCAG TCCTGATCAG
CGATTTAGCT TGTTGGAATT ACAGCCAAAA ACTGGCCGAA CTCACCAATT GCGGGTGCAC
TGCGAATGTT TGGGCCATCC AATTGTTGGC GATCCGCTGT ATGATGTGGC GCGAGCTGCC
CCCCGACTGA TGCTGCATGC CAGCGAATTA CGCTTCACCC ACCCGCTTAC TCAACAACCA
TTGCATATTC AAGCGCCAAC TCCAGCCTTA TTCACGCGGG TGGCGCAAGG CTTGCCCGAA
TTACAACAAA GCACCGAGCT AGCTGCGCTG AATGGCTTAA TCGAGTTAGC GGCTGAACGG
CGGGCTGTCC TGGCAGCCGA TCCTGCTACG ACGATCTTTC GGGTGTTTCA TGGCCCAAGC
GATGGTCTGA CCCATCCATG GTTGCAGCAT TGGACGGTCG ATAAACTTGA TCAGGTATTA
ATTGCCTCGT GCTATGACGA ACATGTGCGC CAAGTGCCAG CCAGCTTAAT CAACGCCTTG
GTTGCGCAAT GGCAGCCGCA GGCGATTTAT GCTAAATATC GCCCTCGGGC TGCTGCCAAA
GTTGACGAGG CCGCGATGGC TGAGTTAGCT CCAACTAGGC CAGTGTGGGG TGAGCCAATC
GAGCAAGTGG TGGTGCAAGA AGCTGGGTTA AGCTATGAAT TGCGGCCTAA CGACGGCTTG
AGCATTGGTT TATATGCTGA TATGCGTGAA ACTCGCCAAC GGGTGCGCAA TTTGCTTGCC
AAGCGTCAAT TGCGGGTGCT CAACACCTTT GCCTATACCT GTGGCTTTGG GGTGGCAGCC
GTTGCCGATG CCCCTGAGGC AATCGTGACC AATCTTGATC TTTCGCGGCG CTCGTTGGAT
TGGGGCAAAA TTAATTATGG CCTGAATCAG TTGGCCGTTG AAGATCGTCA GTTTGTATTT
GGCGATGTCT TCGATTGGCT CAGTCGTTGG GTGCGTCAAG GCCGTCAATT CGATGTGGTG
ATTCTTGATC CACCGTCGTT TGCCCGCAAT CGGGGTAAGC GTTGGCGAGC CGAAGAAGAT
TACGCCGATT TGGTAGCCTT GGCGGTGCAG TTGTTGCCAG CCGATGGCCA TTTAATCGCT
TGCTGTAACC ATGTTGGGCT TTCGCGGCGG CAATTTCGTG GTCAAGTCGA ACGCGGTATG
CAGCAAGGGC GTTGGCATGG CACGATTGAA GCCAATTATC CGGCCTCGCC CTTAGATTAC
CCCGCTGCCT ATGGCGAAAG CCACTTGAAA ATTATTTTAG CGACTGGTCA AACCAACGAT
TAA
 
Protein sequence
MHIEILYRDE QILVINKPTG VATHAPQGDL ALTDVERALR AQLQLEYLAI HQRLDRDTSG 
VMLFALDPAA NANLATAFAE HTIEKTYQAL VYGVPIQTQG VIDAALAPAG DGMIQVASAH
DRRAQSAITH YRVLVSSPDQ RFSLLELQPK TGRTHQLRVH CECLGHPIVG DPLYDVARAA
PRLMLHASEL RFTHPLTQQP LHIQAPTPAL FTRVAQGLPE LQQSTELAAL NGLIELAAER
RAVLAADPAT TIFRVFHGPS DGLTHPWLQH WTVDKLDQVL IASCYDEHVR QVPASLINAL
VAQWQPQAIY AKYRPRAAAK VDEAAMAELA PTRPVWGEPI EQVVVQEAGL SYELRPNDGL
SIGLYADMRE TRQRVRNLLA KRQLRVLNTF AYTCGFGVAA VADAPEAIVT NLDLSRRSLD
WGKINYGLNQ LAVEDRQFVF GDVFDWLSRW VRQGRQFDVV ILDPPSFARN RGKRWRAEED
YADLVALAVQ LLPADGHLIA CCNHVGLSRR QFRGQVERGM QQGRWHGTIE ANYPASPLDY
PAAYGESHLK IILATGQTND