Gene Hlac_2092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2092 
Symbol 
ID7400612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2082197 
End bp2083363 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content58% 
IMG OID643709163 
Producttransposase (ISH3) 
Protein accessionYP_002566740 
Protein GI222480503 
COG category[L] Replication, recombination and repair 
COG ID[COG3385] FOG: Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0981749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0159333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTAAAA CCAAACAAGC AGACGGTGAG ATCCACGAGG ACCAGCTTCT TAACTTTCTC 
GTCAACCGCC TTGACGAGGA AGTTTCGCTC TCGTTAGCCA ATAACGCTGA AATCACTGCT
GAAGACATCT ATGAGGTCCT CGTCGGCGCT TGCGCCGACG GGACCTCTGT CTCTACGCTC
TGTGCGTCGA GCCAGAACTC ACCCGCTGGG AACACGGTCC TCTACCATCT TCGGACGAAG
TTCGAGCCGG AACGGCTCGA ACGAGTCGCT AACACGCTCC TGCGAAAGGA TCTCGATGAA
TTGCTCCCCG AACAGGTGGA GGTCTGCGCA GACCTCCACC TGCGGCCCTA CTACGGTGAC
GAAGACGACA CAGACGGCCT CTATCACTCG GTAGCGAAGC GTGGAACCAC TGCGTTCCAC
GCCTATGCCA CACTCTACGC GCGTGTGAAG AACAAACGCT ACACGCTGGC GGTACGCCGT
CTCAAAGACG GCGATACCGC AAGTAGTGTC CTCGCTGAGT TCTTCGGTGT CCTCGACGGC
CTTGACGCCG GGGTCAAGGC CGTCTACCTT GATCGCGGAT TCTACGACAG TAAGTGTCTC
ACGCTGCTTC AGGCGCACAA TTACGCGTAC GTGATCCCGA TCATCCGGTG GGGTGAGGCG
ATTCAGCAAG AGCTCTCGGA AGGATGGAGT CGCGTCATTC AGCATGATCT GACGGGGAAA
CTCGACGGTC ACAGCTGGAC CGTCGATTTT CCCGTCTACA TCGACTGTAC GTACCTAAAT
GGGAAGTATG ACGAGAACGG TGTGGCGCGT CACGGCTACG CCGCTGACGC GCCGTTCATC
GACTCACCAC GGGACGCTCG ATACCACTAC TCGAAACGCT TCGGTATCGA GTCAAGCTAT
CGCTTGTTTG AGCAAGCGAT AGCGACAACG ACAACACGAG ATCCAACGGT ACGGCTGCTG
TACGTGGTGG TGAGTCTCCT CTTACAGAAC GTCTGGCGGT ACCTTCACTA CGAGTATGTG
GCGACGCCCC GCCGAGGCGG GCGTCGCCTC TGGTGGTGGC CGTACAAGGA GTTCGTCAAT
ATGATTCGAC GAGCTGCGTG GACGGCCCTC GCGGTGCGTC GGGCCGTCCC CGCGAATCGG
CCACCTGACG ACCGATTCCA CCGCTAA
 
Protein sequence
MSKTKQADGE IHEDQLLNFL VNRLDEEVSL SLANNAEITA EDIYEVLVGA CADGTSVSTL 
CASSQNSPAG NTVLYHLRTK FEPERLERVA NTLLRKDLDE LLPEQVEVCA DLHLRPYYGD
EDDTDGLYHS VAKRGTTAFH AYATLYARVK NKRYTLAVRR LKDGDTASSV LAEFFGVLDG
LDAGVKAVYL DRGFYDSKCL TLLQAHNYAY VIPIIRWGEA IQQELSEGWS RVIQHDLTGK
LDGHSWTVDF PVYIDCTYLN GKYDENGVAR HGYAADAPFI DSPRDARYHY SKRFGIESSY
RLFEQAIATT TTRDPTVRLL YVVVSLLLQN VWRYLHYEYV ATPRRGGRRL WWWPYKEFVN
MIRRAAWTAL AVRRAVPANR PPDDRFHR