Gene Hlac_3073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3073 
Symbol 
ID7399046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp331485 
End bp332759 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content57% 
IMG OID643706879 
Producttransposase IS4 family protein 
Protein accessionYP_002564501 
Protein GI222475980 
COG category[L] Replication, recombination and repair 
COG ID[COG3385] FOG: Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGGC TCACTACACT GTTTCCCTCC GAGTTCCTCG AAGAGCACGC CGAGGAACTC 
GGCGTGGTCG AACGTGACCG CAAGCTCCAG ATCCCTGCCT TCGTTTGGGC GTTCGTGTTC
GGCTTCGCCG CAGGTGAAAG CCGAACACTC GCCGGGTTCA GGCGATCTTA CAACTCAACT
GCCGATGAGA CAATCTCGCC CAGTGGGTTC TATCAGTGGT TGACGCCGAC GCTTGCGGAG
TACTTCCGCG ACCTCGTCGA GCGCGGTCTC GACGAGGTCG CTGTCTCTGA TGCTGTTGAC
GCTGATACCG ATCGATTTAG AGACGTGATG GTCGCCGATG GAACGGTGTT GCGGTTACAT
GAGTTTCTTT CAGATCAGTT CGAAGCCCGC CATGAGGAGC AGGCTGGAGC GAAGCTCCAC
CTGCTCCACA ATGCCACAGA GCAGACGATC GAACGAATCG ATACTGCTGA CGAGAAAACA
CACGACAGCA CCCTGTTCAA AACAGGGCCA TGGCTTGAGA ACCGCCTCAT GCTGTTCGAT
CTCGCCTACT TCAAGTACCG CCGGTTTGCG CTGATCGACG AGAACGGCGG CTACTTCGTG
AGCCGGCTGA AACAGAACGC GAACCCGGTG ATTACGGCAG AATTACGGGA ATGGCGCGGC
CGCGCCATTC CCTTAGAAGG CAAGCAGCTC CGAACTGTTC TCGACGATCT CGATCGGAAG
TACATCGATG TGGAGGTCGA AGTCGAGTTC AAGCGGGGGC CGTACAATGG GACACAGTCG
CTGGATACGA AGCGATTTCG CGTCGTCGGC GTCCGCGACG AGGACGCCGA CGACTACCAC
CTGTACATGA CGAATTTAGC GAGGAAGGAG TTCTTTCCGG CGGATTTAGC GGAGATCTAC
CGCTGTCGGT GGGAAGTTGA GTTGCTGTTC CGGGAGCTGA AGACACAGTA CGAATTGGAC
GAGTTCGACA CGAGTGACGA ACACGTGGTG AGGATCTTAT TGTACGCAGC GCTGCTGTCG
CTGCTTGTAA GCCGCGATCT GTTAGATCTA GTCACTGAGC AGGCGGATGA TGAGCTTGTG
TTTCCGACAG AGCGCTGGGC GGCGACCTTT CGGTCGCACG CCCAGCTTAT TCTCCACGAA
CTCGGTGAGT TCCTCGGCTA CTCACCACCG CCGCTGCTCG ACCGGCTGAT CGAAGACGCT
CAAAAGATCC ACAAGCGACG ACCAATCTTA CAAGAGACGC TCGCTACCGC TACACAACCG
AGATGTGAGG CTTAA
 
Protein sequence
MRRLTTLFPS EFLEEHAEEL GVVERDRKLQ IPAFVWAFVF GFAAGESRTL AGFRRSYNST 
ADETISPSGF YQWLTPTLAE YFRDLVERGL DEVAVSDAVD ADTDRFRDVM VADGTVLRLH
EFLSDQFEAR HEEQAGAKLH LLHNATEQTI ERIDTADEKT HDSTLFKTGP WLENRLMLFD
LAYFKYRRFA LIDENGGYFV SRLKQNANPV ITAELREWRG RAIPLEGKQL RTVLDDLDRK
YIDVEVEVEF KRGPYNGTQS LDTKRFRVVG VRDEDADDYH LYMTNLARKE FFPADLAEIY
RCRWEVELLF RELKTQYELD EFDTSDEHVV RILLYAALLS LLVSRDLLDL VTEQADDELV
FPTERWAATF RSHAQLILHE LGEFLGYSPP PLLDRLIEDA QKIHKRRPIL QETLATATQP
RCEA