Gene Hmuk_1733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1733 
Symbol 
ID8411257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1647734 
End bp1648633 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content70% 
IMG OID645020061 
ProductH/ACA RNA-protein complex component Cbf5p 
Protein accessionYP_003177554 
Protein GI257387781 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0130] Pseudouridine synthase 
TIGRFAM ID[TIGR00425] rRNA pseudouridine synthase, putative
[TIGR00431] tRNA pseudouridine 55 synthase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.17014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCGCG ACTCGCCCGA ACACCGGTCG CCCGCCGATC TGCTCACCTT CGGCGTCGTC 
AACCTCGACA AGCCACCCGG CCCCTCCGCC CACCAGGTGG CCGCCTGGGT CCGGGACGCC
GTCGACGACG CGCTCGACGC GGCCGGCGAC GACCGGACGC TCGGGCGCGT CGCACACGGC
GGAACGCTCG ACCCGAAGGT CACGGGCTGT CTCCCGATCT TGCTCGGCGA CGCCGCCCGC
GCGGCACGCG TGTTCGACGA CGCAGTCAAA GAGTACGTCA CGATTCTCGA ACTCCACGAC
CAGGCCCCGC CGGACTTCGA CTCGATCGTC GCCGAGTTCG AGGGCGAAAT CTACCAGAAG
CCCCCGCGCA AGAGCGCCGT GAAACGACGG CTCCGCAGCC GGACGATCCA CGCGCTAGAC
GTCCTCGAAC GCCACGATCG CCGGGCGCTC CTGCGCGTCC GCTGTGGCTC TGGAACGTAC
GTCCGAAAGC TCTGTCACGA CATCGGGCTC GCGCTCGGGA CCGGCGCACA CATGGGCGAA
CTCCGACGCA CCGCGACCGG CGACTTCGAC GATCGCGACC TCGTCACCAT GGAGGACCTC
GTCGACGCAC TGGCCGAGTG GACCGACAAC GACGACGAGA CGTGGCTCCA CGAGACCGTC
GCACCCGCGG AACGCGCCCT GGCCGGCTAT CCCACGGTGA CCATCGCACC CAGCGCCGCC
CGCGAAGTGG CACACGGCGC ACCGGTCTAC GCACCCGGCG TGATCGAGAC CGACGCCGAG
CAGGGTGCCG ACGTGTGCTG CGTGACACCC GACGGCGCGG CGGTGTGTCT GGGCAGACTG
GTCGGCGACC CCGACGCCGA CAGCGGCGTC GCCGTCGAAC TCGAACGCGT CCTCGTCTAA
 
Protein sequence
MVRDSPEHRS PADLLTFGVV NLDKPPGPSA HQVAAWVRDA VDDALDAAGD DRTLGRVAHG 
GTLDPKVTGC LPILLGDAAR AARVFDDAVK EYVTILELHD QAPPDFDSIV AEFEGEIYQK
PPRKSAVKRR LRSRTIHALD VLERHDRRAL LRVRCGSGTY VRKLCHDIGL ALGTGAHMGE
LRRTATGDFD DRDLVTMEDL VDALAEWTDN DDETWLHETV APAERALAGY PTVTIAPSAA
REVAHGAPVY APGVIETDAE QGADVCCVTP DGAAVCLGRL VGDPDADSGV AVELERVLV