Gene Hlac_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1024 
Symbol 
ID7400094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1015853 
End bp1017025 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content61% 
IMG OID643708090 
Producttransposase IS4 family protein 
Protein accessionYP_002565691 
Protein GI222479454 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.765298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.715821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCACCA TACCTGATCC AGACGAGTAC CTTTCGGCGT CGGATGTCAA AGACGTAGCG 
GAAGAGGTCA TTACGCCACT CCCGTTGCCG GGTGTCGAGG GGAGCCCCCT CGACCCCGGC
GACATCTGGC TCGTCGTCAT CCTAGCCTGC ACTAACCAGA ACTCGATTTG GGACACCTGC
AACGATACCG AGGGAACGCC GTGTGACGAC ACTGTCTTGA GGTGGCTCCA CACACTCAAC
CGTCAGTGGC TTGAGGTCGT TGCCAACCTT CTGCTCGCAC GGCTCGCCAT GACGATTTTC
GACCCTGACC GGTCGAGAAC CGTCTCCATC GACTTCATCG ACAATCCCTA CCACGGCGAG
CACCATGCTG AGAAAGGCGA ACTCTGTTCG ATGGCTCCTA AGGACGGGAC TACGACCTGC
CACCGCTACT GCACGGCGTA CGTCGTCTCG AACGGGAAGC CGGTGACGCT GGCGATGACT
TACGTCCGCA GTGACGAAGA TGAGGCTGAC GCGGTCGAGC GCGTGCTCGC CCGCGTCGAA
AACTATCCCT TCGAGATCGA TCTCTTGCTT GCCGACAGCG GATTCTACAA CGAGCGCGTC
ATCCGCCGCG CTCGTGATAT CGCCCCAACG GTCGTTCACG TGCCCAAGAA GGGCGAGCGC
ATGAAGGACA AACTCGAAAC TCACAAGTCG TACATGACGA CCTATCGCAT GTACAAGGAC
AGCGAGCGGG AACTGCGCTT CCCGCTCGCG GTCGCTGTCT CCTACCAGAA CGGAGATCGA
GGCAAGCACG GCGAGGTCGT TCGTGGCTAC GTGGCGTGTG GCGTTACTGA TCGCTCAGCG
AAGCAGGTCG AACACCGCTA CAGGAAGCGT TCAGGCATCG AAACGACCTA TCGCTTACTT
CGGCAAGCAC GCGGGATCAC GACGACGCGT GATCCCGTCG TGCGGTTTGC CATCATGTTG
GTCGCGGCAT TGCTGGAGAA CCTGTGGCTG GTGCTACGGT GGGCGGTCGT CGCCCGCCCA
CGGCGGGGCG GGCGCGACCT GCCCGAGGAG TTCACGTTCA AGACGTTCTG TGACTGGATT
CGTCATGAGC TGGAAGAGGA GTTACGCCGC CGGTGGAAGA TCAAAGCGAA CGGGGTTGGA
GTGCCAGCAT CACAGGCAAC GGCCGCGGGC TGA
 
Protein sequence
MFTIPDPDEY LSASDVKDVA EEVITPLPLP GVEGSPLDPG DIWLVVILAC TNQNSIWDTC 
NDTEGTPCDD TVLRWLHTLN RQWLEVVANL LLARLAMTIF DPDRSRTVSI DFIDNPYHGE
HHAEKGELCS MAPKDGTTTC HRYCTAYVVS NGKPVTLAMT YVRSDEDEAD AVERVLARVE
NYPFEIDLLL ADSGFYNERV IRRARDIAPT VVHVPKKGER MKDKLETHKS YMTTYRMYKD
SERELRFPLA VAVSYQNGDR GKHGEVVRGY VACGVTDRSA KQVEHRYRKR SGIETTYRLL
RQARGITTTR DPVVRFAIML VAALLENLWL VLRWAVVARP RRGGRDLPEE FTFKTFCDWI
RHELEEELRR RWKIKANGVG VPASQATAAG