Gene Hlac_3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3062 
Symbol 
ID7399035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp321046 
End bp322800 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content53% 
IMG OID643706868 
Producttransposase IS4 family protein 
Protein accessionYP_002564490 
Protein GI222475969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATG AGAGCAGTCG GGTTCAAAGT GGGCTGACAA AGCAAGTTGA TGATGTCCTT 
ACTGCAGATA CTGACTGGAT CACACTTGCG AACGAACTGG ACGTGAGCCG CTATACGCTA
CGGGACGCAC ACCCAGAGTG GAGTTCGTCG CTTCCGTTTC GGCCAATGTT TCTGGCGTAT
CTGTGGGCAA CTGTCGAGCG TGAATCTCTG TCAGGAATCC CAGAACGCCT CTCTGACCGA
CCGGAACTCG CCCGTGCATT TGGGTTTGAG ATGGATGATC TCCCCTCAGA AAGTAGCTGT
AAACCAGTCC GGCTTGAAAG CCGATTCAGA AAGTTACAGA CGGTCGTCGA ATCAGGTGCT
GAAGAGATCC GCCTGCTCGC GGCTGAACGA GGCGCACCAA TCGGGAATGA TCTTCTCAAA
ACAGCGGACG ACGAAGACAA ACAGTCGCTG TCAAATCGAA CCGTCCAACG CTTGCTACGG
AAGAAGGGGC ATCAGGTGCT TGATGAGTTG AAGTCGGTAG CCATCCCTTC AATCTCACTC
TCTCGCCCGG ATGACGCGAT CTACGACGAC GATGAGTTAC TCGTCTTAGA AGCAATCGCG
TCGATCAAAC AGAAGGCAGC ACACGATTCG GGCCAGAAGC TGGGTGACAT GAAAAATCCA
GACCCAGATA TTGATGACCC GTTCTACGAG GACGGCCCAT CTGGTGAGAC GCTGTTGGAA
GCCCTCAAGC AGATGTCTAT CGAGGAGATT GCGACTGTAC TGAATTTCGC TCTCCGGAAA
ACCTACACAC GCGCGAAACC CCGAATCAGG GAGCTCGAAC ACGGGAACGG CTCACGGTTT
GGGACTCGTG CGAAAGTCGC TCTGGATATG ACGTACGTTG CCTACTATGG CGATCGCGAC
GAGATGGAAT GGGTACAGGG CGCACCTGAA GGAAAAGAGT ACAGTTGGTG TCACAAGTTT
GCGACGGTCG TGATCGTCGG CGAGAACACC CACTACGTCG TTGGGGTGTG TCCGCTCGGG
AGTACGGATT ACGCTGCGAC GGACGCCTAT CCCGGCAAGG ATAGTTCCTA CTACGTTGGG
GATGTTGCAC GACAACTTCT CTCGATCGCC GAAGACTATG TCGACATCAG GATGGTGTAT
GCCGATCGTG AATTTCACGC TGTAGATGTC CTTCAGACGC TTATTAACAA GCGGTTGGAT
TACGTAATCC CTGCCAAGAA AGATCAACAT CGGATTGGAC CGATGTGTGA CCGGTTTGAC
CAAGTGAAGC AGGGGTATCA CGAACCGAAT GACACCCCGC TGTATGTCGA GGAGGATTTC
GTCATGCACG GTGTAGTGAA GGATGGCGTC TCAAACCACA CGGTACATAC GACCGTTGCC
GTGTTACCCC CAGCGGAAGA TGATGATGTC CATGAAGAGG GATCGCCACA GCCGTTTATC
ACCAGTCTCG ATGTGAGTGA TGAGGTCGCA CTCGATCGGC GCTGGGCGAA ACAGCAGATC
GAACAGTACA GTGACCGCGG AGCGATCGAG AACTCGTACT CGTCGATCAA GAACGCAGCA
GCGTGGACTA CCTCGAAGGA GTTTGGAGTA CGGTGGTTTC ATTTCGCCTT CGGGTGTGTG
GTCTACAATA TGTGGCTGTT AGTCGATTTC CTCACACAAG AGCGCATTGG GGTCATTGAA
ACCCGGAAGA AGCCCAGAAT CACACTCAGT CGGTTCCTTG ATTGGCTGGA CAAAGAGCTG
ATCACACTCA TTTAG
 
Protein sequence
MSNESSRVQS GLTKQVDDVL TADTDWITLA NELDVSRYTL RDAHPEWSSS LPFRPMFLAY 
LWATVERESL SGIPERLSDR PELARAFGFE MDDLPSESSC KPVRLESRFR KLQTVVESGA
EEIRLLAAER GAPIGNDLLK TADDEDKQSL SNRTVQRLLR KKGHQVLDEL KSVAIPSISL
SRPDDAIYDD DELLVLEAIA SIKQKAAHDS GQKLGDMKNP DPDIDDPFYE DGPSGETLLE
ALKQMSIEEI ATVLNFALRK TYTRAKPRIR ELEHGNGSRF GTRAKVALDM TYVAYYGDRD
EMEWVQGAPE GKEYSWCHKF ATVVIVGENT HYVVGVCPLG STDYAATDAY PGKDSSYYVG
DVARQLLSIA EDYVDIRMVY ADREFHAVDV LQTLINKRLD YVIPAKKDQH RIGPMCDRFD
QVKQGYHEPN DTPLYVEEDF VMHGVVKDGV SNHTVHTTVA VLPPAEDDDV HEEGSPQPFI
TSLDVSDEVA LDRRWAKQQI EQYSDRGAIE NSYSSIKNAA AWTTSKEFGV RWFHFAFGCV
VYNMWLLVDF LTQERIGVIE TRKKPRITLS RFLDWLDKEL ITLI