Gene Htur_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1686 
Symbol 
ID8742280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1751261 
End bp1754056 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content69% 
IMG OID646512264 
ProductProtein of unknown function DUF1998 
Protein accessionYP_003403244 
Protein GI284164965 
COG category[R] General function prediction only 
COG ID[COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGCG ACCACGAATC CGAACGGTAC GACACCGACG GGGCGGACGT CCCGGTCACG 
GGCGACGAAC TGGTCGACAC CTTCCCCGGC TACCGCGACG AGGGCGATAT CACGGTCCTC
GAGCGTCCCG GCCGCGAGGC GTCAACGGTG CCCAACGAGC GCGCGCTCCG CCCCGAACTG
GCCGAACCGC TCGAGCACGA CCTCTACGCC CACCAGGCCG AGGCCCTCGA GGCGCTGGCC
CGTGAAGAGA ACGTCTGCGT CGCGACGAGC ACGTCCTCGG GGAAGACCCG AATCTACGCG
CTCCAGATCG CCAGGAACTA CCTCGAGGCC CGCGCTCGGG GCGAGGACGC CACGGCGTAC
GTTCTCTACC CGACGAAGGC GCTCTCGCGC GATCAGGAGC GCGAGTTGAA CGACCTCTTC
GACCAGTTGG GCCTCGAGAT CACGGTCCGC GTCTACGACG GCGACACCGA GCGCGGGAGC
AACCGCAAGC GGATCCGCGA GGAGGCCGAC GTCATCATCT CGAACTTCGC GGGCGTGAAC
ACGTACCTCC ACGACCACGA CCGCTGGGCG CGGTTCCTCT CGGCTTGTGA CCTCGTCGTG
ATCGACGAGT CCCACACCTA CACGGGCGTT CACGGGATGC ACGTCGCCTG GATCGTCCGC
CGGCTGAAGC GGGTCCTCGA ATACTATGAC GCAGACCCGC AGTTCGTCCT GACGAGCGCG
ACGATCGGCA ACCCGGGCGA GCACTCCGCG GCACTGATCG ACGAGTCCGT GACCGTCGTC
GACGAGGACG GCTCGCCGAC GGGGCCGCGG GATCTGGTGC TCTGGAATCC GCCGCCGCGG
GCCCGCGAGG ACGAGCGAAG CGAGTCCTCG GAGACGCGAA CGGGAGCGAC GCGGGCCGAG
CGCAGCGAGG GCCGCGAGGA TTCAGACGAC GAGCGAGACG AATGGGGCGA GAACGATGCC
GACACCGAAT CGGACGACCC CGCGGACGCC GTCGTCGAGC GCGTCCCCGC CACCGTCGAG
GCCCCGCGAA TGCTCTCGCA TCTGACCTAC CACGACGCCC AGACGCTGCT CTTTGCCCCC
TCGAGGAAGC TCGCCGAACT CTCGGTCAAG CGGGCGTCGA AACACCGCCA CGACAACCGG
CGCTACTACG CGAATCCCGA CCGCGGCAGC GCGATCGAGC CCTACCACGC GGGTCACTCG
CGGAAGAAGC GCCACGGGAC CGAACACCAG CTCAAGACCG GCGTGCTCGA CGGCGTCGCC
TCGACCAACG CCCTCGAGCT GGGGATCAAC ATCGGCGAGA TGGACGCGAC GGTCCAGCTC
GGCTACCCGG GACAGCGCCA GTCGTTCTGG CAGCAGATCG GCCGCGCGGG TCGCGGGACC
AAGCGCGCCC TGTCGGTGCT CGTGGCCGAA CACCGCACCC TCGACCAGTA CGTCGTGAAC
AATCCCGACT ACCTCCTCGA GTCCGACGTC GAGGACGCGG TCGTCGACGT GGACAACGAC
GCGGTGTTCG CCCAGCACCT GCGCTGTGCG GCCGACGAAC TCGCCGTCGA CGACTCAGAT
ATCGGCGGGC TCGCCGACCG CGAGCGCCTC GAGCGGGCGA TCGAGATGTG GCGACGCGCG
GGCCAGTTGC GAGGGAGTCT CGAGACGGGC GTCTCCTACG TCGGCCCGCC GCGACCACAG
CAGACGATTT CGCTGTACGC GACGACGGGC GAGGAGTACG AGGTCGACCT CGCGGACGGC
GTCGACGAAC GCCACGATCC GGGAATGGAG CCGCTGGCGA GGGAGCGCGT GCTGCGGGAC
TTCCACGAGG GCGCGGTTCG GCTACACCAG GGCCAGCAGT ACGAGGTCGT CGACGTCGAC
CACGACGCGC CTCGGCCCTC GGTGACGGTC CGCCCGACGG ACGTCGACTA CTACACGCGG
ACCCGGACCG ACGTCACGGT CCTCGACGCG GTCTCGGAGG AGTCGCGGGA CATCGGCAAC
TTCACGCTGC ACTTCGGCCG CGGGCGGGTA CTCGTCTACC ATGGCACCTA CGACAAGGTC
GCGGTCCACG GCGGCAAGCG CAAAGAGCAG GGGATTCCCA CGGAGAACCC GCCGCTCGAG
ATGGAAACTC AACTGTGCTG GCTCGAGGTA CCACAACGAA TCGAGCGGGC GCTGATCGAG
AAGTACCGAG AGTTCGAGGT GCCCGAACTC GAGGACGGCC TCGCCGGAAC AGCCCACCTC
GGCTACGCGG GCGGGCTCCA CGCCGCTGAG CACGCGACCA TCGGCGTCGC CCCGCTCGAG
TTGATGGTCG ACAAGCGCGA CCTCGGTGGA CTGGCGACGC TGACGATCGA CTCGCATCTC
GATCAGGACG CGGGCGCAGA TAGCGGGGGC GGTATGGGTC CGGGTACTGG GCCCGCGGGC
GCGAGCGGCG ACGGCGCACC GCGAAACATC GCCGCGGCCG AGGCCACGGT CCGGGAGATC
GCACAGGGCC TCGAGCGCAC CCCCGCCAGC GGCTGGTTCA TCTACGACGG CATCGAGGGC
GGGCTGGGCT TCGCGCGGGC GATCTACGAG AACTACGAGG CCGTCGCCGA GCGCGCTCGA
GACCTCATCG CGGACTGCGA CTGCGGGAAC GTCGACGGCT GTCCGGCCTG CGTGATGGAC
GATCAGTGCG GCAACGACAA CCAGCCGCTG CACCGCGACG CGGCCGTCGA TGTACTGGAT
CAGTTGCTGG GTGAGGCGGA CGAGACCGCG CTCGAGGCAC ACCTCCCAGA CGAGGAGTAC
GGCGGGGACC GACGGCCGCC GCTGTTCTAC GCCTGA
 
Protein sequence
MSSDHESERY DTDGADVPVT GDELVDTFPG YRDEGDITVL ERPGREASTV PNERALRPEL 
AEPLEHDLYA HQAEALEALA REENVCVATS TSSGKTRIYA LQIARNYLEA RARGEDATAY
VLYPTKALSR DQERELNDLF DQLGLEITVR VYDGDTERGS NRKRIREEAD VIISNFAGVN
TYLHDHDRWA RFLSACDLVV IDESHTYTGV HGMHVAWIVR RLKRVLEYYD ADPQFVLTSA
TIGNPGEHSA ALIDESVTVV DEDGSPTGPR DLVLWNPPPR AREDERSESS ETRTGATRAE
RSEGREDSDD ERDEWGENDA DTESDDPADA VVERVPATVE APRMLSHLTY HDAQTLLFAP
SRKLAELSVK RASKHRHDNR RYYANPDRGS AIEPYHAGHS RKKRHGTEHQ LKTGVLDGVA
STNALELGIN IGEMDATVQL GYPGQRQSFW QQIGRAGRGT KRALSVLVAE HRTLDQYVVN
NPDYLLESDV EDAVVDVDND AVFAQHLRCA ADELAVDDSD IGGLADRERL ERAIEMWRRA
GQLRGSLETG VSYVGPPRPQ QTISLYATTG EEYEVDLADG VDERHDPGME PLARERVLRD
FHEGAVRLHQ GQQYEVVDVD HDAPRPSVTV RPTDVDYYTR TRTDVTVLDA VSEESRDIGN
FTLHFGRGRV LVYHGTYDKV AVHGGKRKEQ GIPTENPPLE METQLCWLEV PQRIERALIE
KYREFEVPEL EDGLAGTAHL GYAGGLHAAE HATIGVAPLE LMVDKRDLGG LATLTIDSHL
DQDAGADSGG GMGPGTGPAG ASGDGAPRNI AAAEATVREI AQGLERTPAS GWFIYDGIEG
GLGFARAIYE NYEAVAERAR DLIADCDCGN VDGCPACVMD DQCGNDNQPL HRDAAVDVLD
QLLGEADETA LEAHLPDEEY GGDRRPPLFY A