Gene Hlac_1937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1937 
Symbol 
ID7399889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1937122 
End bp1938405 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content70% 
IMG OID643709008 
Productputative pseudouridylate synthase 
Protein accessionYP_002566585 
Protein GI222480348 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1258] Predicted pseudouridylate synthase 
TIGRFAM ID[TIGR01213] conserved hypothetical protein TIGR01213 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTAC TCGAGGTCGC GGCGCGGGCG ACCGGGACGG GGCCGGTGTG CGACGCGTGT 
CTCGGCCGGC TCGTCGCCGA CCGGAGCTTC GGGCTGTCGA ACGCCGAGCG CGGGTCGGCG
CTGCGGACCA GTCTTGCGCT CCGCGACGAC GAGGACTACG AGCCGGTCGA GACGGCAGAC
TGCTGGGTGT GTGAGGGGCG CTGCACCGAG TTCGACGAAT GGGCCGAGCG GGCCGCCGAG
GCGGTTGAGG ACGTGGAGTT CGCCACCTAC AACGTCGGCA CCCGTCCCCC GCCGCTGATC
GAGGAGAACG AGGCGCTGCT CCGCGAGGAA GCCGGGCTCG ACGACGACGC GGGCGAGCCG
TTCAAGTCGG AGTTCAACCG CGAAGTCGGG AAGCGGTTCG GCCGGCTCAC GGAGACGGAG
GTGTCGTTCG ACCGCCCGGA CGTGCAGTTC ACGATCGACC TCGCCGAAGA CGAGATCGAC
GCGAAGGTGA ACTCCACGTT TGTGTACGGC CGGTATCGAA AACTGGAACG GGACATCCCG
CAGACCGAGT GGCCCTGCCG CGAGTGCAAG GGCTCGGGGC GACAGGGCGC GGACCCCTGT
GATCACTGTG GCGGCTCCGG CTACCTCTAC GACGACAGCG TCGAGGAGTA CACCGCGCCC
GTCGTCGAGG ACGTGATGGA CGGCACCGAG GCGACGTTCC ACGGCGCGGG CCGGGAGGAC
GTGGACGCCT TGATGCTCGG AACCGGGCGC CCGTTCGTGA TCGAAGTCGA GGAGCCGCGC
CGCCGCCGGG TCGACACCGA TCGCCTGCAG GCCGACATCA ACGCCTTCGC CGACGGCGCC
GTGGAGGTCG AGGGGCTCCG GCTCGCGACC TACGACATGG TCGAACGCGT GAAGGAACAC
GACGCTGCGA AGCGCTACCG CGCCGAGGTA GCCTTCGACG CCGACGTGGA CGCCGACGCC
CTCGCGGCCG CGGTCGAAGA GCTTGAGGGG ACGACTGTCG AGCAGTACAC CCCGAACCGG
GTCGACCACC GCCGGGCGAG CATCACCCGC GAGCGCGACG TGTACGAGGC GACCGCCGAA
CTCGACGACG CCCGCCACGC GATCGTGGAG ATTCACGGCG AAGGTGGGCT CTACATCAAA
GAGCTGATCT CCGGCGACGA GGGCCGGACG GAGCCGAGCC TCGCAGGCCT GCTCGGCGTC
GGCGCCGAGG TCACCGCGCT CGACGTGGTC GCCGTCGAGG GCGAAGACGA GCCGTTCGAG
CGCGAGGAGT TCTTCCGGGA GTGA
 
Protein sequence
MDVLEVAARA TGTGPVCDAC LGRLVADRSF GLSNAERGSA LRTSLALRDD EDYEPVETAD 
CWVCEGRCTE FDEWAERAAE AVEDVEFATY NVGTRPPPLI EENEALLREE AGLDDDAGEP
FKSEFNREVG KRFGRLTETE VSFDRPDVQF TIDLAEDEID AKVNSTFVYG RYRKLERDIP
QTEWPCRECK GSGRQGADPC DHCGGSGYLY DDSVEEYTAP VVEDVMDGTE ATFHGAGRED
VDALMLGTGR PFVIEVEEPR RRRVDTDRLQ ADINAFADGA VEVEGLRLAT YDMVERVKEH
DAAKRYRAEV AFDADVDADA LAAAVEELEG TTVEQYTPNR VDHRRASITR ERDVYEATAE
LDDARHAIVE IHGEGGLYIK ELISGDEGRT EPSLAGLLGV GAEVTALDVV AVEGEDEPFE
REEFFRE