Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3062 |
Symbol | |
ID | 7399035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012028 |
Strand | - |
Start bp | 321046 |
End bp | 322800 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643706868 |
Product | transposase IS4 family protein |
Protein accession | YP_002564490 |
Protein GI | 222475969 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATG AGAGCAGTCG GGTTCAAAGT GGGCTGACAA AGCAAGTTGA TGATGTCCTT ACTGCAGATA CTGACTGGAT CACACTTGCG AACGAACTGG ACGTGAGCCG CTATACGCTA CGGGACGCAC ACCCAGAGTG GAGTTCGTCG CTTCCGTTTC GGCCAATGTT TCTGGCGTAT CTGTGGGCAA CTGTCGAGCG TGAATCTCTG TCAGGAATCC CAGAACGCCT CTCTGACCGA CCGGAACTCG CCCGTGCATT TGGGTTTGAG ATGGATGATC TCCCCTCAGA AAGTAGCTGT AAACCAGTCC GGCTTGAAAG CCGATTCAGA AAGTTACAGA CGGTCGTCGA ATCAGGTGCT GAAGAGATCC GCCTGCTCGC GGCTGAACGA GGCGCACCAA TCGGGAATGA TCTTCTCAAA ACAGCGGACG ACGAAGACAA ACAGTCGCTG TCAAATCGAA CCGTCCAACG CTTGCTACGG AAGAAGGGGC ATCAGGTGCT TGATGAGTTG AAGTCGGTAG CCATCCCTTC AATCTCACTC TCTCGCCCGG ATGACGCGAT CTACGACGAC GATGAGTTAC TCGTCTTAGA AGCAATCGCG TCGATCAAAC AGAAGGCAGC ACACGATTCG GGCCAGAAGC TGGGTGACAT GAAAAATCCA GACCCAGATA TTGATGACCC GTTCTACGAG GACGGCCCAT CTGGTGAGAC GCTGTTGGAA GCCCTCAAGC AGATGTCTAT CGAGGAGATT GCGACTGTAC TGAATTTCGC TCTCCGGAAA ACCTACACAC GCGCGAAACC CCGAATCAGG GAGCTCGAAC ACGGGAACGG CTCACGGTTT GGGACTCGTG CGAAAGTCGC TCTGGATATG ACGTACGTTG CCTACTATGG CGATCGCGAC GAGATGGAAT GGGTACAGGG CGCACCTGAA GGAAAAGAGT ACAGTTGGTG TCACAAGTTT GCGACGGTCG TGATCGTCGG CGAGAACACC CACTACGTCG TTGGGGTGTG TCCGCTCGGG AGTACGGATT ACGCTGCGAC GGACGCCTAT CCCGGCAAGG ATAGTTCCTA CTACGTTGGG GATGTTGCAC GACAACTTCT CTCGATCGCC GAAGACTATG TCGACATCAG GATGGTGTAT GCCGATCGTG AATTTCACGC TGTAGATGTC CTTCAGACGC TTATTAACAA GCGGTTGGAT TACGTAATCC CTGCCAAGAA AGATCAACAT CGGATTGGAC CGATGTGTGA CCGGTTTGAC CAAGTGAAGC AGGGGTATCA CGAACCGAAT GACACCCCGC TGTATGTCGA GGAGGATTTC GTCATGCACG GTGTAGTGAA GGATGGCGTC TCAAACCACA CGGTACATAC GACCGTTGCC GTGTTACCCC CAGCGGAAGA TGATGATGTC CATGAAGAGG GATCGCCACA GCCGTTTATC ACCAGTCTCG ATGTGAGTGA TGAGGTCGCA CTCGATCGGC GCTGGGCGAA ACAGCAGATC GAACAGTACA GTGACCGCGG AGCGATCGAG AACTCGTACT CGTCGATCAA GAACGCAGCA GCGTGGACTA CCTCGAAGGA GTTTGGAGTA CGGTGGTTTC ATTTCGCCTT CGGGTGTGTG GTCTACAATA TGTGGCTGTT AGTCGATTTC CTCACACAAG AGCGCATTGG GGTCATTGAA ACCCGGAAGA AGCCCAGAAT CACACTCAGT CGGTTCCTTG ATTGGCTGGA CAAAGAGCTG ATCACACTCA TTTAG
|
Protein sequence | MSNESSRVQS GLTKQVDDVL TADTDWITLA NELDVSRYTL RDAHPEWSSS LPFRPMFLAY LWATVERESL SGIPERLSDR PELARAFGFE MDDLPSESSC KPVRLESRFR KLQTVVESGA EEIRLLAAER GAPIGNDLLK TADDEDKQSL SNRTVQRLLR KKGHQVLDEL KSVAIPSISL SRPDDAIYDD DELLVLEAIA SIKQKAAHDS GQKLGDMKNP DPDIDDPFYE DGPSGETLLE ALKQMSIEEI ATVLNFALRK TYTRAKPRIR ELEHGNGSRF GTRAKVALDM TYVAYYGDRD EMEWVQGAPE GKEYSWCHKF ATVVIVGENT HYVVGVCPLG STDYAATDAY PGKDSSYYVG DVARQLLSIA EDYVDIRMVY ADREFHAVDV LQTLINKRLD YVIPAKKDQH RIGPMCDRFD QVKQGYHEPN DTPLYVEEDF VMHGVVKDGV SNHTVHTTVA VLPPAEDDDV HEEGSPQPFI TSLDVSDEVA LDRRWAKQQI EQYSDRGAIE NSYSSIKNAA AWTTSKEFGV RWFHFAFGCV VYNMWLLVDF LTQERIGVIE TRKKPRITLS RFLDWLDKEL ITLI
|
| |