Gene Hlac_1069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1069 
Symbol 
ID7400141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1068368 
End bp1069906 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content59% 
IMG OID643708136 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_002565735 
Protein GI222479498 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.544527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAGA AACAGAATTC AATCATCCGT CGACTGTTCA AGAGCGGGCT CCTGCTGTTC 
GCCGGTCTCG TGCTTGAACT CGGGATCTCC TTTGCAGCCA AAATACTCAT CGCCAGACTG
CTTGGTCGTC CGGCATACGG CGTCGCGACC ATCGGCATTA CGACGCTGTC TTTCGCTTCC
ACGATTCTGC TGTTCGGGAT GAATACCGGC GTCGGCCGCT ACCTGCCGCG CTTCGACGAC
GAGGCCGACC GCAAGGGCAT CATGGCGTCC GGTCTGCAGG TCGTGATAGG TTTCTCCGTC
AGTTGCGCCG TTCTTCTGTT CATTTTTGCC GAACCGTTCG CGACGCGGGT TTTAGGCGCG
CCAGAGGCCG TCGTCGTGGT GCGCATCGCG TCGCTCGGCA TCCCCTTCGC GGTGGTGATG
AAATTCACTA TCGGCGTCGT GCAAGGACTC CAGCGGTCGC TCCCGAAGGT GCTCATCAGG
AACATCGGCC AGCCAATCGT CAGGTTCTCG CTGGTCGTCG TCTCGCTCTA CTTCGGCCTC
GGTGCCGCGG GCATCGTTGG GGCCTATTCG GCGACGTTCG CCGCCGCCGG TCTGGCGGGG
CTTTACTACG TTCTCACGCG GACGAACCTT CGGTCCTCCG TCACCGCGAA CATGCGACAG
CGAGAACTCG TCAGGTTCTC TGCACCGCTG ATGCTCACCG CTGCGATGCT GATGGTCCTC
TCGTATTTCG ACATCTTCAT GCTGAGTTAC TTCCGGACAT CCGGGGAAGT CGGGAGCTAC
AACGTGGTGT ATCCGCTTGC GGAACTGCTG ACAGCGACGC TCTCGGCATT CAGCTTTATC
GCGATGCCCA TCCTCTCGCA GTTGCATTCC GACGAGCGGA TCACTGAAAT GGACCGCACG
TACAAGGTCG TCACCAAGTG GATATTCATG GCTACGCTGC CGCCTATGCT CATCCTCATT
TTCTTCCCGA CGGCGTCGAT TCGGATGACG TTTGGACCGG AGTACACGGA TGGGTCGCTG
GCACTCGTGA CGCTCGCGCT CGGGTTCTTC ACGCACTCCG TGGCCGGCCC AAACGTGAAC
ACGCTCACGG CAATCGGGCG GACGCGAATC ATCATGTGGG ATAATCTCCT CGCGGGCGTG
ACCAACATCG CGCTCAACTT TGCGCTCATC CCGGAGTACG GTATCCTCGG CGCGGCAGTG
GCGACTGCCG TCTCCTACGC AGGGCTGAAC GTCCTCTACT CGGCTCAGCT CTACCGGCAA
ACAGGCATCC ATCCGATGAC AGCAGCGTTG TTCAAACCCG CAATCGCCGG CACCTTGTCA
ATGGTGGGTA TCTACTACGT CGTCACACGA TTTCTCGATA CGACGGCGCC GGTGTTGGTC
GGCATGGGTA TCGTGTTCGT GAGTCTCTAC AGCATCGCGA TTCTGGCACT CGGCGGCATT
GAGGAGGAAG AAATTATGCT CGTGCTGAGC TTTGAGGAGC GGTTCGGCGT GGACCTCGGA
CCGTTCAAGC GCGTCGCCCG GTTCTTCGTG GACGAGTAA
 
Protein sequence
MFEKQNSIIR RLFKSGLLLF AGLVLELGIS FAAKILIARL LGRPAYGVAT IGITTLSFAS 
TILLFGMNTG VGRYLPRFDD EADRKGIMAS GLQVVIGFSV SCAVLLFIFA EPFATRVLGA
PEAVVVVRIA SLGIPFAVVM KFTIGVVQGL QRSLPKVLIR NIGQPIVRFS LVVVSLYFGL
GAAGIVGAYS ATFAAAGLAG LYYVLTRTNL RSSVTANMRQ RELVRFSAPL MLTAAMLMVL
SYFDIFMLSY FRTSGEVGSY NVVYPLAELL TATLSAFSFI AMPILSQLHS DERITEMDRT
YKVVTKWIFM ATLPPMLILI FFPTASIRMT FGPEYTDGSL ALVTLALGFF THSVAGPNVN
TLTAIGRTRI IMWDNLLAGV TNIALNFALI PEYGILGAAV ATAVSYAGLN VLYSAQLYRQ
TGIHPMTAAL FKPAIAGTLS MVGIYYVVTR FLDTTAPVLV GMGIVFVSLY SIAILALGGI
EEEEIMLVLS FEERFGVDLG PFKRVARFFV DE