Gene Hlac_0580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0580 
Symbol 
ID7401715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp597223 
End bp598557 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content57% 
IMG OID643707645 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002565252 
Protein GI222479015 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.835378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.194869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTATAC AGGCGTTCAG AATGCTGGAA GTCCACCGCA CCCATCAGGC AAAAATCCTC 
AACCACTCAC AGGGAGAGGA ATCGCTTGAC CGGCACGGGT GGAGCGCCAG CAAACTCTGG
AACGTCGCGA ACCACCACTC CCGAAAAGTC TGGAAGGAGA CGGGCGAGAT TCCCGGTCAC
GGCGACCTCA AAGACGAGTT GAAGACGCAT CCAAAATACA ACGGACTCCA TTCTCAGTCC
AGTCAGCGCG TTCTGGAGGA ACTCGCTGAA GCCTTCAACT CGTGGTACGG CTCCGACGAC
GATCGGGACA ATCCACCCGG CTATCGGAAA GAAAACTACT ACGACGACCA AGGCCGTCGC
GTCCACGAAG AACACCCGCG TTCCACTGTG ACGTGGAAGC AGAACGGCAT CAAACACGAC
ACCACAAACA ACCGTGTTCG CCTCTCAAAA GGTGCGAACC ACAAGGAACA CCCGAGAGCG
TGGGAATACA TCCTTGTCGA ATACGAGACA CGCCCCGGCG TCACGGTCGA GAACCTACAG
CAGGTTCGTG CTGTCTACGA CAGCACAAAG GAACGCTGGG AACTCCACCT CGTCTGCAAA
GACGAGATCG AGACACCCAA CGCCCCCGGC AACGAGACGG CAGGTATCGA CCTCGGGATC
AGCAACTTCG CCGCCGTCGC CTACAGCACC GAAGACGCCG ACCTATACCC CGGCAACCGT
CTGAAGCAAG ACGGCTACTA CTTCCCCAAG GAGATCGCCA AGTGCGACGA CAGCGGTGGT
GACAGGGCTA CTCGGCTCCA TCACAAGTGG GCGGAACGCC GCACTCACTT CTTCCATTCC
TTGGCGAAAC ACATCGTCGA ACGGTGTGTC GAGAAGGAAG TAGGACGCAT CAACGTCGGA
GACTTGGAAG GGGTCCGTGA AGATGAGAAC GGCAGCTCGA AGAACTGGGG CAAGCACGGG
AATCTCGACT TACACGGGTG GGCGTTCGAC CGCTTCAGCT CGATTCTCGA ATACAAGGCG
AGAGTCGAGG GAATCGAGGT CTTGGAAGTG TCTGAGCGGG ATACGAGCAA GACATGTTGC
ACATGCGGGA AAACAGACGA CTCACAGCGC GTCCACCGCG GTTTGTACGT CTGTGATGAG
TGCGACGCAG CGTTCAACGC CGACGTGAAC GGGGCGGAGA ACATCCGTCT CGACAGCAAC
GAAAGTAACT CCAAGTCTGC ACCCGATTTG GGTGGAGATA GGAGTACCGG CTGGTTGGCA
CAGCCCGGAG TCTATCTTCA CGACCTCTCT CGAGGATTCC AACCTCGGAC AGAGGTGGTA
GACTGCAAAC CCTAA
 
Protein sequence
MLIQAFRMLE VHRTHQAKIL NHSQGEESLD RHGWSASKLW NVANHHSRKV WKETGEIPGH 
GDLKDELKTH PKYNGLHSQS SQRVLEELAE AFNSWYGSDD DRDNPPGYRK ENYYDDQGRR
VHEEHPRSTV TWKQNGIKHD TTNNRVRLSK GANHKEHPRA WEYILVEYET RPGVTVENLQ
QVRAVYDSTK ERWELHLVCK DEIETPNAPG NETAGIDLGI SNFAAVAYST EDADLYPGNR
LKQDGYYFPK EIAKCDDSGG DRATRLHHKW AERRTHFFHS LAKHIVERCV EKEVGRINVG
DLEGVREDEN GSSKNWGKHG NLDLHGWAFD RFSSILEYKA RVEGIEVLEV SERDTSKTCC
TCGKTDDSQR VHRGLYVCDE CDAAFNADVN GAENIRLDSN ESNSKSAPDL GGDRSTGWLA
QPGVYLHDLS RGFQPRTEVV DCKP