Gene Hlac_2828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2828 
Symbol 
ID7398867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp88989 
End bp91067 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content51% 
IMG OID643706653 
Producttransposase IS4 family protein 
Protein accessionYP_002564279 
Protein GI222475758 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCCA CTCCCGACTC AAAACAAGCG GTTCTCGACT CTCTGTCTGA GTCCCATCGG 
GAGTGGCCAT ACTCGACTAA CCACGATACA GTGAATAACG ACCCGTGGCC GCTCGCTTGG
GATGCTGCTG CATTCATTGA GGAATGGTTC TCTCATCCTG AGCATACCGA CGTGGAAGAA
GCGATCTGCC ACATAGAACT CGACCCGTCA GGGTTCGGGT ACGATGTCAC TGACTGGCAT
TTCAAGCAAC CTCCAGAACC GTTGTTGAAG GCGCACCTTC TTCGAATCGT CAAAGGTTGG
GGCGGCGAAA CTGCACTCCA CGACTACCTT GACGATAACA CCGAGTTAGT AGCTGCACTA
GGGTTCGGAA ACGGGCTTGC ATCGAAAACT ACGCTCTGGC GAGTCTGGAA CGAGAATCGA
CTTTCAGACG ACCACAAGCA AGTCGTCCGA ACCATCGGCC AAGTGCTTGT CAACGTAGCA
CGAGAACACG GTGTCCCTGC TCCCGATGAG GTGTTCTACC CCGACCCGAG TGTTGACGCT
CCAAATGCAG TCGCACAGGA TGATTTGACA GTCCGAGACC GGACGATTGC AAAAACACGG
GATGTCTGGA AGCAAGCGAA ACCAATGGTG ACTGAGAATT ATCAGTTGCC GCGTGGAAAG
AATACAGAAA TCCATCAGAA CGCGTTCTTT GAAGCACACG CGTTCATGGG GTCGCGTGAA
GAGATGTACG CGGAGGATGG GACGGTGAAC TTTGCTGCAG ATACCACACG CGAACGTGTA
CAGACTGGCA GTACTCATCG CCACCACTTG CACAAGATTG GCCCGACAGA TGCTCGTCAG
ATGCATCGTG ATGCGACACG AGAGTTAATT GAGCGTGCTC GCCGTGATTC TGAACTCGTT
GGAGGGGTCC TGGTCAGTAT TGATATCACG AAATCGAACC CGTACAGTAC AAAAAAGAAA
ATTGAGTCTG ACGAGGCTGG GAATGTGACA AACAAGTGGC TACTCGGCTA CAAAAACGAC
GATAAAAAGT CCACAGAGTT CTATTTCCAG TGGGCAAGCG TCCAAATCGT CGGATTAGAT
ATCCCGCTTG TACTTGATGC GATTCCGGTT CACCGTGGCC TGAAGCGTGC TACCATCGTT
GATCGGTTAC TGGAGAGTGC AACGGATCTC GTTGATGTCG AAATGGCAAT GATGGACCGG
GAATTCGCTC ACGATGCAGT CAAGGACGTA TGTGATGATC ACGAGGTCTA CTATTTGAAT
CCGGGTAAAA TGAGGACCAG TGAACGTGCA ACGTGTACCC GTCTTCGCCG TCAGGGTAAA
CTCATTCATA TTGAGAGTGA CGAAGACACC GACAGCAGCA CGGACGAAGG ACGAACGACG
CTTACGGACT TCACGTCTGG TGAGGAAGAT TCCGATGAGG AAGCGGGTGT TGTTCGGAAG
CGTGTGTATG TCCCTGCAAT AAACGCTGAG CGAACCGGTG ACGATGGCGA TGATGATTCA
GATGAGACCG ATAGTGAGGA TGGAGACAAC GAGTCAAACG ATAAGGATGA GCTGCGCCAA
GAGTTGCTAC ATGAGTTCTC AGAAGTAAAG GATTCAGATG CGGAAGAGAT CGAGCAGCTG
TTCGGTGACG TGATTGATGA GGTCCGAGAC GAGGAAGACA AGCGAAAGCT CCCGGGGAAT
AAGGAGGACA AGGACCGGTT TATGTTGTTC GAGTCGAATC ACCCGGCTCT TGAAATCCCA
GAAAACAGTG ACAATGGTGA GGAGCCTATG TCAGAGACGG AGAAGGCACA CATGGTGAGT
CGTGTTCTGC GTAAGTACAA GCACCGGTGG GGGATTGAGA ACGGATTTAA GCAAATCAAG
AGTTTCCGTG TTCGGACTAC GTCGATGAAT CCTGAGTATC GGTTTTTCAA CTTCCTATAC
GCGTGCACGC TGTACAACGT GTGGAGATTG ACTGATTTGC TAGTCAAGTT GGAGCTATTA
GCGGAGTCTG AGTTTGAGTA CAAACCCCTT GTGACAGCAG ACCTCTTCCT GACGATTGCG
AAGGAGTACA ATATCGTTGG GTTAGACCCT CCCGACTAG
 
Protein sequence
MTATPDSKQA VLDSLSESHR EWPYSTNHDT VNNDPWPLAW DAAAFIEEWF SHPEHTDVEE 
AICHIELDPS GFGYDVTDWH FKQPPEPLLK AHLLRIVKGW GGETALHDYL DDNTELVAAL
GFGNGLASKT TLWRVWNENR LSDDHKQVVR TIGQVLVNVA REHGVPAPDE VFYPDPSVDA
PNAVAQDDLT VRDRTIAKTR DVWKQAKPMV TENYQLPRGK NTEIHQNAFF EAHAFMGSRE
EMYAEDGTVN FAADTTRERV QTGSTHRHHL HKIGPTDARQ MHRDATRELI ERARRDSELV
GGVLVSIDIT KSNPYSTKKK IESDEAGNVT NKWLLGYKND DKKSTEFYFQ WASVQIVGLD
IPLVLDAIPV HRGLKRATIV DRLLESATDL VDVEMAMMDR EFAHDAVKDV CDDHEVYYLN
PGKMRTSERA TCTRLRRQGK LIHIESDEDT DSSTDEGRTT LTDFTSGEED SDEEAGVVRK
RVYVPAINAE RTGDDGDDDS DETDSEDGDN ESNDKDELRQ ELLHEFSEVK DSDAEEIEQL
FGDVIDEVRD EEDKRKLPGN KEDKDRFMLF ESNHPALEIP ENSDNGEEPM SETEKAHMVS
RVLRKYKHRW GIENGFKQIK SFRVRTTSMN PEYRFFNFLY ACTLYNVWRL TDLLVKLELL
AESEFEYKPL VTADLFLTIA KEYNIVGLDP PD