Gene Hlac_1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1065 
Symbol 
ID7400137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1063753 
End bp1064862 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content49% 
IMG OID643708132 
Productglycosyl transferase group 1 
Protein accessionYP_002565731 
Protein GI222479494 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.912454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTATC TCAGACAGAG CGTAAGGAAG GTTCATATTC GTATATCCGC CGGAGACCAT 
TTAATATGTT ATCAACGGAG CGTTCTTATT GGTGGAAGAA GTCATTCAGA TGTGACTCAG
GCGAACCCCA AAGTTCGGTG GTTTACTCCA GATAAACCGG ACAACATCAG TGTTGGGAGA
GAACGAATTG CCTCCCATCT CCGACAAAAC GAGGGGTTCC ACGTTGATGT TGTGGGAACT
ACACTCCCAA CTGTCCGAAC AGCGATCAGA GAACGTGACC GATACGACGT GATCCTTGGA
ACCACTCGTG CAGGGGCGAT TGCCGGGACA CTGATCGGAC GCGTAACCGG AAAGCCCGTG
ATTGTTGACC ACGTAGATCC CATTCGACAG TTCCGTGAAA ACAACTCTCC GTTCTTTTCG
ATCCCAGTTC GAATAGCCGA AAATATCTCA TTCGCGCTAG CCGAGTTGGT ACTGTACGTG
TACGAGGAAG AGTACGATCG GGTTTCTCGC TACGCTAGCC AGCATATGAA AACCGAACTC
GGTGTTGATT ATCGTCGGTT TGCTAGTCCC AATTCAGAGA TCATTGATTC TGTTCAGGAT
CAGTTAGCTG AATACGAGCT TCGTGAATAT GTAGCAATCT ACGTCGGCGG GCTTGAACCC
ATATATCACA TCAGAGAGTT ACTGATGGCG ATGTCGTATC TTCCTGACTG GTCGTTGATT
GTTCTCGGAG AGGGCAGTCT CAGAGGAATG CTTGAAGAGG TGGATGCCGA CCAAGAGAAC
ATTCACTATT TAGGACTCGT TCCACACGAG ACCATCCCTG GGTATCTCAA TGTGGCTGAT
GTCGGCGTTT CATTGGTTGA TGACCCTCAT ACACTCAAGA TATTAGAGTA CGGTGCTGCG
GGACTATCGG TCGTTCAAGC TAGTGGACTA GCGGAAGAGA GATTCCGGGA ACGGGTGGAA
TATGCCGATT CCGATCCAAG ATCTATAGCG GATGCTATCA GGCGTGCCGG AGAGCGTGAA
AACGTTGAAC AACTCCAGTC GTTCATATCT GAATTTGATT GGAAGCAGAT CGCTGGAGAT
TATGTGGATG CGCTCAAAAG CATAAAATAG
 
Protein sequence
MSYLRQSVRK VHIRISAGDH LICYQRSVLI GGRSHSDVTQ ANPKVRWFTP DKPDNISVGR 
ERIASHLRQN EGFHVDVVGT TLPTVRTAIR ERDRYDVILG TTRAGAIAGT LIGRVTGKPV
IVDHVDPIRQ FRENNSPFFS IPVRIAENIS FALAELVLYV YEEEYDRVSR YASQHMKTEL
GVDYRRFASP NSEIIDSVQD QLAEYELREY VAIYVGGLEP IYHIRELLMA MSYLPDWSLI
VLGEGSLRGM LEEVDADQEN IHYLGLVPHE TIPGYLNVAD VGVSLVDDPH TLKILEYGAA
GLSVVQASGL AEERFRERVE YADSDPRSIA DAIRRAGERE NVEQLQSFIS EFDWKQIAGD
YVDALKSIK