Gene Hlac_0548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0548 
Symbol 
ID7401683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp571878 
End bp572951 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content72% 
IMG OID643707613 
Productglycosyl transferase family 2 
Protein accessionYP_002565220 
Protein GI222478983 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0176894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTCT CGGTCGTTGT TCCCACCCTC AATGGGCGGG ATCGGTTAGC CGCCTGTCTC 
GACGCGCTGG CGGCCCACGC CCCCGAGGCG GAGGTGATCG TCGCTAACGG CCCCTCCGCC
GACGGCACCA CCGGGATGGT GCGGGACCGC GACGACGTGG ACGTGCTGGT GGAGATCTCC
GACCGCACGG TCAACGTCGC CCGCAACGCT GGTATCGAGG TCGCCACCGG CGACATAGTC
GCTCTCGTTG ACTACGACAA CCGGATCGGA GAGGGGTGGC TCGACGCGGT TCGCGCCGGG
CTCGACGACG CGGACGTTGT GACAGGACCA GTGACCCCGA TCGAACCGGA GCAGGGAGGA
CGCGACGGTG AAGCGTCCGC CAACGGGGAA CGCATTGTGC GCGACGACGA GCACGACGAC
GAACGCGACG ACGAGCACGA CGACGAACGC GACGACGAGC ACGACGACGA ACGCGACGAC
GAGCACGACG ACGAACGCGA CGGCCCGGAG CGCCGGACGA TCGCGGGCAC CGAGGTGACC
TACTTCGAGG GGGGCAACGT CGCCTTCCGG CGCGAGGCGC TCCGGGACCT GGACGGCTTC
GACGAGTATC TCCGCACGGG CGGCGCGCGC GACGCGGCGC ACCGGCTGGC GCAGATGGGA
CGAACCGTGG CGTGGCGCGA AGACCTGGCG GTCACCAAGG CGCTCCCGAG CCCGACGGCG
GCCGACTGCG GGCGCACCGC CCGCGAGTGG GGGTGGAAGT ACCGAGCGCT CGCGTACCGC
CTCTTGAAGA ACTACGGGGT CCGCCCGACC GTCGTCGCGC GGGCCGGGAC GCACGCGGCG
ACGGACGCGT TCGGAGCCGC CGGCGACGTG ATCCGCGGCG AGTCGACCCC GTCGCGGTGG
GTCGCCACCG GCCGCGACGT ACTCGTCGGC CTCGCCGGCG GGAGCTCCGA CGGCCTCGTC
GCGCGGAGCC GCGACCGGAG CCCGGCGCGG AACCCGAACG GTATCTCGAA GCGCGCCGAC
CGCGCCGTCG CGAAGTACGA CCGACGGGAG CCGAAGAGGG GGACGGAGGA GTGA
 
Protein sequence
MDLSVVVPTL NGRDRLAACL DALAAHAPEA EVIVANGPSA DGTTGMVRDR DDVDVLVEIS 
DRTVNVARNA GIEVATGDIV ALVDYDNRIG EGWLDAVRAG LDDADVVTGP VTPIEPEQGG
RDGEASANGE RIVRDDEHDD ERDDEHDDER DDEHDDERDD EHDDERDGPE RRTIAGTEVT
YFEGGNVAFR REALRDLDGF DEYLRTGGAR DAAHRLAQMG RTVAWREDLA VTKALPSPTA
ADCGRTAREW GWKYRALAYR LLKNYGVRPT VVARAGTHAA TDAFGAAGDV IRGESTPSRW
VATGRDVLVG LAGGSSDGLV ARSRDRSPAR NPNGISKRAD RAVAKYDRRE PKRGTEE