Gene Hlac_2339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2339 
Symbol 
ID7401956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2336542 
End bp2337777 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content73% 
IMG OID643709412 
Productprotein of unknown function DUF112 transmembrane 
Protein accessionYP_002566985 
Protein GI222480748 
COG category[S] Function unknown 
COG ID[COG1784] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.145216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGC TGGTCAGACC CGTCGTCGAT CCGGGGTTCG CGGCGGCGGC GCTGGCGTTC 
CTGTGCGGGG GCGTCGCGCT CGGGACCGCG AGCGGGCTCG TGCCGGGGCT CCACGCCAAC
AACTTCGCGC TGCTTCTGGC CGGACTCGCT CCTTCGGTGT CGGCGGACCC ACTTCTCGTC
GGCGTCGCGA TGCTCGGCGC CGGCGTGGTC CACTCCTTCC TCGACATCGT GCCCGCGCTC
GCGCTCGGCG TCCCGGACGC GGCCACGGCC GTCGCCGCGC TACCGGGCCA CCGGCTCGTC
GTGGCGGGGC GCGGCCGCGA GGCGGTCCGG CTGTCCGCGG TCGGATCGGG GCTAGCGGTC
GCGCTCGCGG TCCCGCTTGC GATTCCGATC ACGTGGGTCA TGATCCGCGG CTACCCCGTC
GTGAGGAACC ACCTCCCGCT GCTTTTGGGC GGTGTGGTCG CTTTTCTTGT GCTTACCGAG
TCGTCGAAGC GGGCCGCGAT CGGCGGGCTC GTCGCCTTCC TCGCGAGCGC CGCGCTCGGC
TTTGCAACCC TCGATGTCGA CCCCGCGGCG CCGCTTGACG CCGGCGGGGT CCTCGCGCCC
CTTTTCGCCG GGCTGTTCGG CGCTCCGGTG CTTATCGACG CGATGGGAGG CGGGGGCGTG
CCGCCACAGG CCGACGCCCG GATCGCGATG AGCCGTCGCG GGCTCGGAAT CAGCGCGGGC
GGCGGGTCGC TCGCGGGCGC GGTCGTCGGG TACGTGCCGG GCATCTCCGC GGCTATCGCA
GCGGTCGCGG CGCTGCCGGC GGTCCCGCGC GAAGAGTCGG ACCGCGGCTT CGTCGTCGCC
ACCAGCGGCG CGAACACGGC GAACACGATC TTCGCGCTGT TCGCGCTCGT CGCGCTCGGG
ACGCCCCGAA CGGGGGTGAC CGTCGCGATC GACCGCGCCG AGGTCCCCTT CGCGCTCCCG
ATCTTGCTCG TCGCGGCCGC GACCGCCGCG GCGTGCGGGT TCGCGCTCGT GTTGCTCGTC
GGCGACGCGT ACCTCCGAGT CGTCGGGAAC GCGGACTACA CGCGGCTGTC GATCGGCGTG
CTGGCGCTGC TCGTGGGGAT ATCCTACGCG TTCGCTGGCA CAGTCGGGAT CGGCGTGTTC
GTCGTCGCCG GGGCGTTGGG GCTCGTCCCG CCGCGGGTCG GCGCGCGACG GGTGCACCTG
ATGGGAGTGT TGATCGGGCC GCTTATCGCC GGGTGA
 
Protein sequence
MDALVRPVVD PGFAAAALAF LCGGVALGTA SGLVPGLHAN NFALLLAGLA PSVSADPLLV 
GVAMLGAGVV HSFLDIVPAL ALGVPDAATA VAALPGHRLV VAGRGREAVR LSAVGSGLAV
ALAVPLAIPI TWVMIRGYPV VRNHLPLLLG GVVAFLVLTE SSKRAAIGGL VAFLASAALG
FATLDVDPAA PLDAGGVLAP LFAGLFGAPV LIDAMGGGGV PPQADARIAM SRRGLGISAG
GGSLAGAVVG YVPGISAAIA AVAALPAVPR EESDRGFVVA TSGANTANTI FALFALVALG
TPRTGVTVAI DRAEVPFALP ILLVAAATAA ACGFALVLLV GDAYLRVVGN ADYTRLSIGV
LALLVGISYA FAGTVGIGVF VVAGALGLVP PRVGARRVHL MGVLIGPLIA G