Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3073 |
Symbol | |
ID | 7399046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012028 |
Strand | + |
Start bp | 331485 |
End bp | 332759 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643706879 |
Product | transposase IS4 family protein |
Protein accession | YP_002564501 |
Protein GI | 222475980 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3385] FOG: Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGGC TCACTACACT GTTTCCCTCC GAGTTCCTCG AAGAGCACGC CGAGGAACTC GGCGTGGTCG AACGTGACCG CAAGCTCCAG ATCCCTGCCT TCGTTTGGGC GTTCGTGTTC GGCTTCGCCG CAGGTGAAAG CCGAACACTC GCCGGGTTCA GGCGATCTTA CAACTCAACT GCCGATGAGA CAATCTCGCC CAGTGGGTTC TATCAGTGGT TGACGCCGAC GCTTGCGGAG TACTTCCGCG ACCTCGTCGA GCGCGGTCTC GACGAGGTCG CTGTCTCTGA TGCTGTTGAC GCTGATACCG ATCGATTTAG AGACGTGATG GTCGCCGATG GAACGGTGTT GCGGTTACAT GAGTTTCTTT CAGATCAGTT CGAAGCCCGC CATGAGGAGC AGGCTGGAGC GAAGCTCCAC CTGCTCCACA ATGCCACAGA GCAGACGATC GAACGAATCG ATACTGCTGA CGAGAAAACA CACGACAGCA CCCTGTTCAA AACAGGGCCA TGGCTTGAGA ACCGCCTCAT GCTGTTCGAT CTCGCCTACT TCAAGTACCG CCGGTTTGCG CTGATCGACG AGAACGGCGG CTACTTCGTG AGCCGGCTGA AACAGAACGC GAACCCGGTG ATTACGGCAG AATTACGGGA ATGGCGCGGC CGCGCCATTC CCTTAGAAGG CAAGCAGCTC CGAACTGTTC TCGACGATCT CGATCGGAAG TACATCGATG TGGAGGTCGA AGTCGAGTTC AAGCGGGGGC CGTACAATGG GACACAGTCG CTGGATACGA AGCGATTTCG CGTCGTCGGC GTCCGCGACG AGGACGCCGA CGACTACCAC CTGTACATGA CGAATTTAGC GAGGAAGGAG TTCTTTCCGG CGGATTTAGC GGAGATCTAC CGCTGTCGGT GGGAAGTTGA GTTGCTGTTC CGGGAGCTGA AGACACAGTA CGAATTGGAC GAGTTCGACA CGAGTGACGA ACACGTGGTG AGGATCTTAT TGTACGCAGC GCTGCTGTCG CTGCTTGTAA GCCGCGATCT GTTAGATCTA GTCACTGAGC AGGCGGATGA TGAGCTTGTG TTTCCGACAG AGCGCTGGGC GGCGACCTTT CGGTCGCACG CCCAGCTTAT TCTCCACGAA CTCGGTGAGT TCCTCGGCTA CTCACCACCG CCGCTGCTCG ACCGGCTGAT CGAAGACGCT CAAAAGATCC ACAAGCGACG ACCAATCTTA CAAGAGACGC TCGCTACCGC TACACAACCG AGATGTGAGG CTTAA
|
Protein sequence | MRRLTTLFPS EFLEEHAEEL GVVERDRKLQ IPAFVWAFVF GFAAGESRTL AGFRRSYNST ADETISPSGF YQWLTPTLAE YFRDLVERGL DEVAVSDAVD ADTDRFRDVM VADGTVLRLH EFLSDQFEAR HEEQAGAKLH LLHNATEQTI ERIDTADEKT HDSTLFKTGP WLENRLMLFD LAYFKYRRFA LIDENGGYFV SRLKQNANPV ITAELREWRG RAIPLEGKQL RTVLDDLDRK YIDVEVEVEF KRGPYNGTQS LDTKRFRVVG VRDEDADDYH LYMTNLARKE FFPADLAEIY RCRWEVELLF RELKTQYELD EFDTSDEHVV RILLYAALLS LLVSRDLLDL VTEQADDELV FPTERWAATF RSHAQLILHE LGEFLGYSPP PLLDRLIEDA QKIHKRRPIL QETLATATQP RCEA
|
| |