Gene Hlac_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3334 
Symbol 
ID7402190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp86164 
End bp87501 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content60% 
IMG OID643709886 
Producttransposase (ISH6) 
Protein accessionYP_002567452 
Protein GI222481216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGCCA CAATCGACGT GCGGTTCGAA CTGAGTATCG ACGACGACAA AACGCTACCG 
CTCGCCACGC TTGCCGAGGC CGTCACTGAC CAGAACCTCG AAGCAGTCCT TCTCGAATCG
CTGGTCGAGA GCCTCGACGC CGCCAGCGTC GAGGCGCTCT GTGGTGAGAA ACACGCACAT
GGCAACGGTG ACCAGCGCTT CCAACGCGCC GGCACCGACA CCCGCACAGC TGTCACAACT
GCCGGAGAAC ACGAGTTCTC TCTCCACTAC GTCGAAGATA CAGCCGCTTC CCCAGACGAA
TCCAGCTACT TCCGGCCCGT CGAAGACGTT CTCGACTTCG ACGGGCAGAA CCGCTATCAG
CAGGACATCG CCGCCAAAAG CGTCGATCTC GCTACCTCGC TCAGCTATCG AGACGCTGCC
AATCACGGCG ACAGCTTCGT CTCGATGCCG TCGCCGACCA CCATCAACCG CCGTGCCAAG
AAATACGGCC ACAAGCTCAA ACAGTTCCTT CCAGACTGTG TCGCTGGCAC AGACGCTGAC
GCCGTCATTC CTGACGGGAC AAAGTGCCAC AGCCAAGACG ACGACCGCTC GTCCCACTCC
GTCCAAGCAA CGCTCGGCGA AGACACCGCC GAAGAGTCAC GCTCCCTGCT GGATCTGTCG
GTCAACGCTG ACTGGGACGA AACTGCCGCC GAACTCGATG ATATCGGCGC AGTCACTGAC
GACGCGACGG TCGTCAGTGA CGCTGATAGC GGCATCGTCA CAGCCTTTAC CGACGAAAAC
CGTGACCACC AGCTCGATCT CGTCCACGTC GGCCGAACGC TGGGTTACAC CCTCTGGGAC
GATGGCGTCT TCTCCTTGGA CCGTCGGAAG GAGATCGTTT CGGAGGTGAT CGACGAGGTG
TTCCATCTGA AGAACTCTGT GGCGAAGCAT CGTCCAGCGG AGGAGTTCGC GGCGATCCGC
TCGCGGATCG CGCGAACGAG AGAGCGATTA GAGAAGACAG CGTGGCAACT GGAGCAGTTC
GGGTCAGCAA AGGCTGCAGG GTATCTTCGG CGGTGGCTGC CGTCGATTGT GACGTTCGCC
GAGCACGCTG TCGAGGGGTT CGAGGTTCCG TGGACCTCGA ACCCCGTCGA ACGACTGATG
GGCGAGGTCA GCAAGCGGTG CAAGAACCAG TGGATGCGCT GGACAGCAGA GGGATTGGAA
GCGATACTCC AACTTCGGTT GGTGAAGTAC GCCGACCCCG AGTACTACCA AGCGTTCCTC
GACGAACTGC TCCAACGTTC GACCAAAACA GCAATCAACT GTGACCTCTC AATTGAGAGT
ACCAGCGGCA AAGTCTAG
 
Protein sequence
MHATIDVRFE LSIDDDKTLP LATLAEAVTD QNLEAVLLES LVESLDAASV EALCGEKHAH 
GNGDQRFQRA GTDTRTAVTT AGEHEFSLHY VEDTAASPDE SSYFRPVEDV LDFDGQNRYQ
QDIAAKSVDL ATSLSYRDAA NHGDSFVSMP SPTTINRRAK KYGHKLKQFL PDCVAGTDAD
AVIPDGTKCH SQDDDRSSHS VQATLGEDTA EESRSLLDLS VNADWDETAA ELDDIGAVTD
DATVVSDADS GIVTAFTDEN RDHQLDLVHV GRTLGYTLWD DGVFSLDRRK EIVSEVIDEV
FHLKNSVAKH RPAEEFAAIR SRIARTRERL EKTAWQLEQF GSAKAAGYLR RWLPSIVTFA
EHAVEGFEVP WTSNPVERLM GEVSKRCKNQ WMRWTAEGLE AILQLRLVKY ADPEYYQAFL
DELLQRSTKT AINCDLSIES TSGKV