Gene Hlac_2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2571 
Symbol 
ID7399796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2547423 
End bp2548664 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content60% 
IMG OID643709643 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002567213 
Protein GI222480976 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTACG CCTACAAGTA CCGTCTCAAA CCGTCCGACG CCCACCGCGA GGCGTTGGAC 
CGCCACCGAG ACATTTGTAG GCAACTGTAC AACCACACAC TCAACCGCCT CAACGAGTAC
CAAGACGAGC ACGGTGAACT GCCATCCATG ACCACGCTTC GGTCGGAGCT ACCCGACCTC
AAGAAATGGT GGGACGGCCT CTCGGACGTG TACTCGAAGG TTCTCCAAAC CGTCGTGGAA
CGGCTGTTTG ACAACCTCAA AGGCCTCTCT GCGCTCAAGA AGAACGGCCA CGGCGTCGGC
CAACTCAAGT GGAAGCCGCC ACGGGAGTTC CGCAGTTTCA CGTACAGTCA GTCTGGCTTC
AAGCTCGACA AGAAGGGCGG TCAGACTGTG CTGTCACTCT CGAAACTCGC GGACATACCG
ATTCGGCTTC ACCGCGCCAT CCCCGACGAC GCCACGCTCA AGCAGGTCAC GGTCAAGAAG
GAACCGACGG GCGAGTGGTT CGCCACGTTC GGCGTCCAAA TGGACCGTGA ACCTCCTGAG
CCACCTGAGA ATCCCGAGAA GTGCGTCGGT ATCGACGTGG GGATTCTCAA GTACGCTCAC
GACACCGACG GCACAGCAGT CGGGTCGCTC GATCTCACCG ACAAACGTGA ACGCTTGGAG
CGCGAGCAAC GGAAACTCTC GCGGAAGCAA CACGGGTCGA ACAACTACGA GAAGCAACGG
CGACGAGTCG CGGAGTGTCA CGCTAATCTC CGGCAGAAGC GCCGTGACTT CTTGCACAAA
CTCTCGGCGT ATTACGCTCG GGAGTACGAT CTCGTGGCGG TCGAAGACCT GAACGTGAAG
GGGATGATGG AGTCGCCGGC GAACAGCCGC AACACCGCCT CCGCCGCGTG GCGGACGTTC
CTCTCGTTGC TCGAATACAA GTGCGAACGG GAGGGGGCAC ACTTCGTGGC GGTTGATCCG
AGAGGGACGA CCAAGGAGTG TGCGTCATGT GGCGTCTCGA CGGAGAAGCC GTTGTGGGTC
CGTGAACACT CCTGTCCCGC CTGCGGGTTT GAGGCGGACA GGGACGCGAA CGCGGCGTGG
AACATTCTTT CTCGTGGCCT CGGAGATGTA GGAGTGGGAC ACTCCGAATC AACGCCTGTG
GAGACTGCGC TCCCTGTGGA TACAGCAGTA TCTGCAAAGC GCGTCCTGGA AGCAGGAAGC
CCTACTCTCA AGGAGCGAGC GGCGTCAGCC GTGAGCGAGT AG
 
Protein sequence
MYYAYKYRLK PSDAHREALD RHRDICRQLY NHTLNRLNEY QDEHGELPSM TTLRSELPDL 
KKWWDGLSDV YSKVLQTVVE RLFDNLKGLS ALKKNGHGVG QLKWKPPREF RSFTYSQSGF
KLDKKGGQTV LSLSKLADIP IRLHRAIPDD ATLKQVTVKK EPTGEWFATF GVQMDREPPE
PPENPEKCVG IDVGILKYAH DTDGTAVGSL DLTDKRERLE REQRKLSRKQ HGSNNYEKQR
RRVAECHANL RQKRRDFLHK LSAYYAREYD LVAVEDLNVK GMMESPANSR NTASAAWRTF
LSLLEYKCER EGAHFVAVDP RGTTKECASC GVSTEKPLWV REHSCPACGF EADRDANAAW
NILSRGLGDV GVGHSESTPV ETALPVDTAV SAKRVLEAGS PTLKERAASA VSE