Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1686 |
Symbol | |
ID | 8742280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 1751261 |
End bp | 1754056 |
Gene Length | 2796 bp |
Protein Length | 931 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646512264 |
Product | Protein of unknown function DUF1998 |
Protein accession | YP_003403244 |
Protein GI | 284164965 |
COG category | [R] General function prediction only |
COG ID | [COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGCG ACCACGAATC CGAACGGTAC GACACCGACG GGGCGGACGT CCCGGTCACG GGCGACGAAC TGGTCGACAC CTTCCCCGGC TACCGCGACG AGGGCGATAT CACGGTCCTC GAGCGTCCCG GCCGCGAGGC GTCAACGGTG CCCAACGAGC GCGCGCTCCG CCCCGAACTG GCCGAACCGC TCGAGCACGA CCTCTACGCC CACCAGGCCG AGGCCCTCGA GGCGCTGGCC CGTGAAGAGA ACGTCTGCGT CGCGACGAGC ACGTCCTCGG GGAAGACCCG AATCTACGCG CTCCAGATCG CCAGGAACTA CCTCGAGGCC CGCGCTCGGG GCGAGGACGC CACGGCGTAC GTTCTCTACC CGACGAAGGC GCTCTCGCGC GATCAGGAGC GCGAGTTGAA CGACCTCTTC GACCAGTTGG GCCTCGAGAT CACGGTCCGC GTCTACGACG GCGACACCGA GCGCGGGAGC AACCGCAAGC GGATCCGCGA GGAGGCCGAC GTCATCATCT CGAACTTCGC GGGCGTGAAC ACGTACCTCC ACGACCACGA CCGCTGGGCG CGGTTCCTCT CGGCTTGTGA CCTCGTCGTG ATCGACGAGT CCCACACCTA CACGGGCGTT CACGGGATGC ACGTCGCCTG GATCGTCCGC CGGCTGAAGC GGGTCCTCGA ATACTATGAC GCAGACCCGC AGTTCGTCCT GACGAGCGCG ACGATCGGCA ACCCGGGCGA GCACTCCGCG GCACTGATCG ACGAGTCCGT GACCGTCGTC GACGAGGACG GCTCGCCGAC GGGGCCGCGG GATCTGGTGC TCTGGAATCC GCCGCCGCGG GCCCGCGAGG ACGAGCGAAG CGAGTCCTCG GAGACGCGAA CGGGAGCGAC GCGGGCCGAG CGCAGCGAGG GCCGCGAGGA TTCAGACGAC GAGCGAGACG AATGGGGCGA GAACGATGCC GACACCGAAT CGGACGACCC CGCGGACGCC GTCGTCGAGC GCGTCCCCGC CACCGTCGAG GCCCCGCGAA TGCTCTCGCA TCTGACCTAC CACGACGCCC AGACGCTGCT CTTTGCCCCC TCGAGGAAGC TCGCCGAACT CTCGGTCAAG CGGGCGTCGA AACACCGCCA CGACAACCGG CGCTACTACG CGAATCCCGA CCGCGGCAGC GCGATCGAGC CCTACCACGC GGGTCACTCG CGGAAGAAGC GCCACGGGAC CGAACACCAG CTCAAGACCG GCGTGCTCGA CGGCGTCGCC TCGACCAACG CCCTCGAGCT GGGGATCAAC ATCGGCGAGA TGGACGCGAC GGTCCAGCTC GGCTACCCGG GACAGCGCCA GTCGTTCTGG CAGCAGATCG GCCGCGCGGG TCGCGGGACC AAGCGCGCCC TGTCGGTGCT CGTGGCCGAA CACCGCACCC TCGACCAGTA CGTCGTGAAC AATCCCGACT ACCTCCTCGA GTCCGACGTC GAGGACGCGG TCGTCGACGT GGACAACGAC GCGGTGTTCG CCCAGCACCT GCGCTGTGCG GCCGACGAAC TCGCCGTCGA CGACTCAGAT ATCGGCGGGC TCGCCGACCG CGAGCGCCTC GAGCGGGCGA TCGAGATGTG GCGACGCGCG GGCCAGTTGC GAGGGAGTCT CGAGACGGGC GTCTCCTACG TCGGCCCGCC GCGACCACAG CAGACGATTT CGCTGTACGC GACGACGGGC GAGGAGTACG AGGTCGACCT CGCGGACGGC GTCGACGAAC GCCACGATCC GGGAATGGAG CCGCTGGCGA GGGAGCGCGT GCTGCGGGAC TTCCACGAGG GCGCGGTTCG GCTACACCAG GGCCAGCAGT ACGAGGTCGT CGACGTCGAC CACGACGCGC CTCGGCCCTC GGTGACGGTC CGCCCGACGG ACGTCGACTA CTACACGCGG ACCCGGACCG ACGTCACGGT CCTCGACGCG GTCTCGGAGG AGTCGCGGGA CATCGGCAAC TTCACGCTGC ACTTCGGCCG CGGGCGGGTA CTCGTCTACC ATGGCACCTA CGACAAGGTC GCGGTCCACG GCGGCAAGCG CAAAGAGCAG GGGATTCCCA CGGAGAACCC GCCGCTCGAG ATGGAAACTC AACTGTGCTG GCTCGAGGTA CCACAACGAA TCGAGCGGGC GCTGATCGAG AAGTACCGAG AGTTCGAGGT GCCCGAACTC GAGGACGGCC TCGCCGGAAC AGCCCACCTC GGCTACGCGG GCGGGCTCCA CGCCGCTGAG CACGCGACCA TCGGCGTCGC CCCGCTCGAG TTGATGGTCG ACAAGCGCGA CCTCGGTGGA CTGGCGACGC TGACGATCGA CTCGCATCTC GATCAGGACG CGGGCGCAGA TAGCGGGGGC GGTATGGGTC CGGGTACTGG GCCCGCGGGC GCGAGCGGCG ACGGCGCACC GCGAAACATC GCCGCGGCCG AGGCCACGGT CCGGGAGATC GCACAGGGCC TCGAGCGCAC CCCCGCCAGC GGCTGGTTCA TCTACGACGG CATCGAGGGC GGGCTGGGCT TCGCGCGGGC GATCTACGAG AACTACGAGG CCGTCGCCGA GCGCGCTCGA GACCTCATCG CGGACTGCGA CTGCGGGAAC GTCGACGGCT GTCCGGCCTG CGTGATGGAC GATCAGTGCG GCAACGACAA CCAGCCGCTG CACCGCGACG CGGCCGTCGA TGTACTGGAT CAGTTGCTGG GTGAGGCGGA CGAGACCGCG CTCGAGGCAC ACCTCCCAGA CGAGGAGTAC GGCGGGGACC GACGGCCGCC GCTGTTCTAC GCCTGA
|
Protein sequence | MSSDHESERY DTDGADVPVT GDELVDTFPG YRDEGDITVL ERPGREASTV PNERALRPEL AEPLEHDLYA HQAEALEALA REENVCVATS TSSGKTRIYA LQIARNYLEA RARGEDATAY VLYPTKALSR DQERELNDLF DQLGLEITVR VYDGDTERGS NRKRIREEAD VIISNFAGVN TYLHDHDRWA RFLSACDLVV IDESHTYTGV HGMHVAWIVR RLKRVLEYYD ADPQFVLTSA TIGNPGEHSA ALIDESVTVV DEDGSPTGPR DLVLWNPPPR AREDERSESS ETRTGATRAE RSEGREDSDD ERDEWGENDA DTESDDPADA VVERVPATVE APRMLSHLTY HDAQTLLFAP SRKLAELSVK RASKHRHDNR RYYANPDRGS AIEPYHAGHS RKKRHGTEHQ LKTGVLDGVA STNALELGIN IGEMDATVQL GYPGQRQSFW QQIGRAGRGT KRALSVLVAE HRTLDQYVVN NPDYLLESDV EDAVVDVDND AVFAQHLRCA ADELAVDDSD IGGLADRERL ERAIEMWRRA GQLRGSLETG VSYVGPPRPQ QTISLYATTG EEYEVDLADG VDERHDPGME PLARERVLRD FHEGAVRLHQ GQQYEVVDVD HDAPRPSVTV RPTDVDYYTR TRTDVTVLDA VSEESRDIGN FTLHFGRGRV LVYHGTYDKV AVHGGKRKEQ GIPTENPPLE METQLCWLEV PQRIERALIE KYREFEVPEL EDGLAGTAHL GYAGGLHAAE HATIGVAPLE LMVDKRDLGG LATLTIDSHL DQDAGADSGG GMGPGTGPAG ASGDGAPRNI AAAEATVREI AQGLERTPAS GWFIYDGIEG GLGFARAIYE NYEAVAERAR DLIADCDCGN VDGCPACVMD DQCGNDNQPL HRDAAVDVLD QLLGEADETA LEAHLPDEEY GGDRRPPLFY A
|
| |