Gene Hlac_0436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0436 
Symbol 
ID7401054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp452921 
End bp454195 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content59% 
IMG OID643707500 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002565108 
Protein GI222478871 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.178599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.292397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGTG CCAACACTTT TGAGGTCGTG CCACAGACCG AGAACGACAA AGAGTGCCTC 
CTACGGCTAC TCGATGCATC CGCTTCTCTG TGGAACGAAC TGACCTACGA ACGTCGTCAG
AACTACTTCG GTGACGGCGA CGTGTGGGAC ACTTCCGAGT ACCGAGGACG CTACAACGGC
GTCGTCGGAA GCGCGACTGT TCAACAGGTC ACGCGCAAGA ACAGCGAAGC GTGGCGGTCG
TTCTTCGCCC TCAAGGAGAA AGGCGAGTAC GCCAACCCAC CGTCGTACTG GGGCAACGAG
GAGGACGGAC GCGAACTCCG TACCTACATC CGATGCAACC AGTACACGAT TGAGTGGGGG
AAACGTAGCC GTCTCGAAAT CCCTGTCGGG CAAGAACTGA AAGACGAATA CGGACTCGGC
TACCACGAAC GACTCCGCCT CGAAGTCCGA GGCAACCCGA AGTGGGACGG CAAACAGGGT
CGTCTGGAAC TTGAGTACGA CGAGGTTAGC GACACGTTCA GGGCTTTTCA ACCAGTCACC
GTACCTGATT CTCGACTGGA TTCACCACTG GCTTCGGAAG AAGCCGCCCT CGACGTTGGA
GCGAACAATC TCGTCGCGTG TTCCACGACT ACTGGGAACC AGTACCTCTA CGACGGTCGT
GAGTTGTTCG GACGGTTCCG CGAGACGACA GACGAAATCG CCCGCCTACA GTCGAAACTC
CGAGAGGGTC GCTACTCCTC GAATCGGATT CGACGGCTGT ACCGACAGCG GACGAAGCGT
CGTGACCATG CACAGAACGC GCTGGTGCGC GACCTCGTTG AACGGCTGTA CGATGAGGGC
GTGGCGACGG TGTACGTGGG CGACCTGACA GACGTGCTGG AAGCGCATTG GTCGGTCAGG
GTGAACGAGA AGACGCACAA CTTCTGGGCG TTCAAGAAGT TCATCCACCG TCTCGCGTGC
GTCTGTGAGG AGTACGGCAT CAGCCTCGAA ACCGAGTCGG AAGCGTGGAC GAGTCAGACG
TGTCCCGAGT GTGGCGACCA CGAGAAGACG GTTCGCCACG AGGATACGCT GACGTGTCCA
TGTGGCTTCG AGGGGCACGC CGACCTCACG GCGTCAGAGA CGTTCCTTCG GGAAAACAGC
AATTGCGAAA TCAGGCCGAT GGCACGGCCC GTGCGATTCG AGTGGGACGA CCACGACTGG
TCGGGGAAAC TATACCCTCA CGAAAGTCCC AAAGAAGTGC GCACGAACCC GCAAGTTGCC
TCCGTGGGTC GGTAG
 
Protein sequence
MKRANTFEVV PQTENDKECL LRLLDASASL WNELTYERRQ NYFGDGDVWD TSEYRGRYNG 
VVGSATVQQV TRKNSEAWRS FFALKEKGEY ANPPSYWGNE EDGRELRTYI RCNQYTIEWG
KRSRLEIPVG QELKDEYGLG YHERLRLEVR GNPKWDGKQG RLELEYDEVS DTFRAFQPVT
VPDSRLDSPL ASEEAALDVG ANNLVACSTT TGNQYLYDGR ELFGRFRETT DEIARLQSKL
REGRYSSNRI RRLYRQRTKR RDHAQNALVR DLVERLYDEG VATVYVGDLT DVLEAHWSVR
VNEKTHNFWA FKKFIHRLAC VCEEYGISLE TESEAWTSQT CPECGDHEKT VRHEDTLTCP
CGFEGHADLT ASETFLRENS NCEIRPMARP VRFEWDDHDW SGKLYPHESP KEVRTNPQVA
SVGR