Gene Hlac_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1603 
Symbol 
ID7399552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1622112 
End bp1623392 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID643708669 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002566258 
Protein GI222480021 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0811903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAC AGGTCGTTAC CCGCACCTAC ACTGCTTCCA TACGGAACCA GTCTCGGGTG 
CAAGACGACC TTGATTCGCT CGGGTTCGCC GCCTCAAAAC TCTGGAACGT CGGACGGTGG
ACGTGCAGTC GGATCTGGGA TAAAATCGAT CACATTCCCA CCCACAACGA ACTCACCACG
TACCTCAAAA ACCACGAACG CTATGATGAC CTGCATTCTC AGTCAAGTCA GCGAGTCCTT
CAAGAACTCG CTGAAGCGTT CAACGGCTGG TACGGCAACC GACAAAACGG AGATACGAAA
GCGAACCCGC CCGGCTACCG CAAACACGGC GACGAGCACC CGCGCTCAAC AGTCACCTTC
AAGCAGAAAG GCTTCAAACT CGACACTCAG TACGACCGAG TTCGACTCTC AAAAGGATCG
AACCTGAAAG AGTATTGGTC GGACTTCGTA CTGTGCAAGT ACCAAACTCG CCCCGATGTT
GACCTCTCCA CCGTGGAGAA CGTCCAACAA GTCAAGATTG TATGGACGGG TGACGAGTGG
GAACTACACT TCGTCTGTAA GGTCGAAATA GACGTGGATG AAGCCCCCGG TGAGAAGACG
GTGGGTGTTG ATCTCGGTAT CAACAACTTC GCCGCACTCG CCTACGAAGA CGGTCACAGC
GAGCTGTACC CGCTTAACTG CTTGAAACAG GACGACTACT ACTTCAGCAA GCTGATTGCT
CGGTGTGACA ACTCGGACTC CGAGCAGGCC ACCCGGCTGA ACCAGAAGAA GTCGGCCCGC
CGAACCCACT ACTTCCACAC CCTCTCCAAG CATATCGTCC AGCGGTGTGT TGACGAGGAA
ATTGGAACTA TCGTGGTGGG CGATCTCTCC GGCATCCGTG AGGATGAGGA GAACGGCGAG
TCAAAGAACT GGGGCAAGCA CGGTAATCTT GATTTGCACT CGTGGGCGTT CGACCGGTTC
ACCGGCCTCC TCGAATACAA AGCCGAGATG GAAGGCATCA CGTTCGAGCA AGTGTCTGAG
CGGGATACCT CGAAGTCGTG TTCGTGCTGT GGCCGGAAGC ATGAAGCCAA CCGTGTTGAA
CGCGGGCTGT ATGTCTGCGA TGAGTGCGGC ACAGTGGCGA ACGCAGATGT GAACGGCGCT
GAGAACATTC GGCAGAAAGT ATCTCCGAGT TCACCGAATC TCTCGGTGAA TAGGAGTAAC
GGCTGGTTGG CACAGCCATC GACGTTGTTG TTTGACAAGG AAACTGGTGC GTTCGCACCG
CAAGAACAGG TAACGTCGTA A
 
Protein sequence
MAKQVVTRTY TASIRNQSRV QDDLDSLGFA ASKLWNVGRW TCSRIWDKID HIPTHNELTT 
YLKNHERYDD LHSQSSQRVL QELAEAFNGW YGNRQNGDTK ANPPGYRKHG DEHPRSTVTF
KQKGFKLDTQ YDRVRLSKGS NLKEYWSDFV LCKYQTRPDV DLSTVENVQQ VKIVWTGDEW
ELHFVCKVEI DVDEAPGEKT VGVDLGINNF AALAYEDGHS ELYPLNCLKQ DDYYFSKLIA
RCDNSDSEQA TRLNQKKSAR RTHYFHTLSK HIVQRCVDEE IGTIVVGDLS GIREDEENGE
SKNWGKHGNL DLHSWAFDRF TGLLEYKAEM EGITFEQVSE RDTSKSCSCC GRKHEANRVE
RGLYVCDECG TVANADVNGA ENIRQKVSPS SPNLSVNRSN GWLAQPSTLL FDKETGAFAP
QEQVTS