Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5017 |
Symbol | |
ID | 8745823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013748 |
Strand | + |
Start bp | 5547 |
End bp | 8963 |
Gene Length | 3417 bp |
Protein Length | 1138 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646515631 |
Product | hypothetical protein |
Protein accession | YP_003406578 |
Protein GI | 284176302 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.483629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGACA AAACCGCCGA CACCGAGAGC CGCTGGACTC GCCTCACCGC GGCGATCCCG ACCGGAGCAC AGGCGCTGAC GATCGTCTTC GTCGTCCTCC TGGTCGCGTC GATGCCCGCG CCGTTCGCGT TCGCGGCCGG CATGAACAGC GGGACGGCCG CTGCCGCGAC CTCAACCACG ACGACGACGT CGCTGAGTTC GGCGTCACTG TCGACCACGA CCACGGCAAC GCTTGATGAC AATACGTCCT ACTATGACGA TTTCGAAGAC GGAACTGCGG ACGGCTGGAA CGATAATGAG TCGACTGCAA TAATTTCCAG CGAGTCGTTC TACGGGAACA ATTCACATCT GGTCTATGAC GGCAACGGCT CAGCAGCTAA CGTCACATGG GCGGGTGGCC CCACGTTCTC GAGCCAAGAG AACTTCGAAC TCACTGGCAC ATACCGCGCA GATCACGGCG ATGGCAGCGG CGGTGCGATT CGATTTGGCC TTGGAGAGAA AGACACAGTC GAATCGGGTG GCCCCATCTG GGCTGTCGTT TACATCCAAC CCAACGAGAA CGAGATTTGG CTCGAGTCGA GCGGACCGGA CACGACGACG AGCGAAAAGA GCTCCGAAAG AATCAATGCA ACGTTCGATG ACATTTGGGC CGACTTCCGG TTGCAGTTCG ACAGCGGCCA GATGAAATAC AAAGTCTGGG AGGCTGGCAC AGGAGAGCCA ACTGACTGGA TGATCACACA CGACGCACCG GAGGATGTCA AATCACAGTT CTTCGCCCAT GCCGGCCAGT CCGAACACGG GCAAGAGATC CAGTTCGACC AAGTCGACGC TGGCGGCCAC GCAATCACTG GACAGTTCGT TGACTCAGAC GGAAATCCAG TCCCGAACGC GACCGTTGAG GGCTATGGCG TCGACTACGC GAACGTCGAA CAGCGGCTCA AGGAACAGGC CGATGATAAC AACGTCACCG CCGAGGAACT CGAAGAAGAG GCTCAGAAAC TCTTAGATGA AGCGGAGGGC GTTGAACCGC CTGACGGATG GACAAATTTC TACGAGACGT ATAAAGAGAG TCCTGAAGCG ATTAGTTCTG AGGAGTTTTC CGAGGGACTT GACGGAACGT ATCCGTTGGT CCACGAGTAC GACGACTGGG GGCAGGGAAG TACAGACATC CTCAGCGAGG AGGTGGACGC CCCACACCAC ACGGTTGACT CTGGCGAAAT GGTTGTAATC TCGCTGTGGG ATCTTGAGGA AGAAAACCCG ATCCTCCCAG AAGGTCCGGT TAGCAACTCC CACCCCGGCG AGATTACGGA CGGGCCAGTC GTCATTGAGG AATACTCGGC TGGCGATCAA GTCGACTCGA AAATAGTCGG GATGCAGGAT GACGCGTTCG TTATCGAACG TCCGTCGGTC ATGCCGAACA CCGACGTTCC GGCGTACAAA ACCTACCTCT CGACAGGAAT CTACCGCGTC TACCCTGAGG GCAGCCCTGA GAAATCCTAC TGGGTTCAGG TCGGCGACGC CGAAGAACAG TGGAACGCGC TCGAGACCGA ACTCGAGAAC CGAGCCGACG ATCTCGAAGA TGAATCCGAC AAACGCACCC AACGCGCGCA GGATCTCCTC GATAACATGG AGGTCGGCGA AATGGAACGC CGGACGACGA CGACCGACGA AAACGGGACG TTCTCGATCC GATTCCAGAC GGGCGTTCAG AGGGCCGCCG TGCAAGGCTA CCGTGCCGAC GGGACCGTAT TGACCGACAT CACGGGACCG TCGTTCGACG ACCTTCGTGA TGAATCCGAG TCTGGATACA ACGGCACCTA CGTTCTCAGC GCGCCCAAGC GCTTCGACGT CCCGGCCGAA GGCGTCACGA TCGAAGGCTA CCGCGCTGAC GAACTGCCAC AACACCCAAT CGAGGACTTC AACGACTTCT TAGAGTGGAA ACAAAACCAA GTCCTCAACG AGACGGTCAA CGACCTCCAA AGCGAGTACG ACCAGCGACT TGAAGAACTG AACCGGACCC AACTCGAGGC TCGCTACACG ACTCACCGAC CGCTCATTGA GACGGTGCCG GGAGCGGAAG AGCGCTACCT CAGCCGCTCG CAGTTCGATT CCATCCAATC CGCTGAGGAC CTCTCGGACG ACGACCTCGA GACCGAGGTC GGTCACATGG AGACGGTCCT GCGGAGCACC GACCAGATCG CCCCGCCGGA GACCGATGAC TCGGACTCGC CGATCTCAAT CGAGGACGGC GAGCTGAACG CCGAGTACGG GCCAATACCG AGCGGCATTG ACACGGACAC ACTCCAGCCG GAACTCCACT GGTCGAACGG CGAGAGCGAG GAAATCCCCG AAGAGTACTG GTCGATCGAA TCGACGGGTG CACTCGGACG GAGTTCCCAA CTCGTCATCG AGGGCTACCC GATCGACGAC ACTGACCCGG CGGCGTTCGA CCTGCGTGTT CTCGGCGGCG GCGCTGGCGG CGTGCTCGAT GACCGCCTCT CGGCGCTGAA CCCGTCGTTC AGCGGTACAA TCCCCGACGT GCAGGCGTTC GATCTGAACA CGATGGCGCC CGGAGACAGC GAGACAGTCT CCATGACCGT GCGACTGGGT GATGACGACA ACTACGGCGG CCTCGAGTCG GTCGAAGTCT TCGGCCCCGA GGGCCAGCAA CTCACGACCG ACGTGACTGG CGACAAAGCG TCGTTCGAAA CCAACGGCGC CGGCAAACAC TTCGTTCGCG CGACCGTCAC GGACTCGACC GGCGGCCAAT TCGTCCACAC GTTCCAGCTC CGCGCACTCG AGCAAGGCCG GGACGATCCG GCGACGGTTC GGGCTGAGAC AGCGATCGGC GACCGCGTGT TCGCGCTGGT CGGCGAGAAA CTCGAAGACG CAGAGATTCG CGCCAACGGC GGCTCGCTCG AGGTCGACGC GATCGCACCC GGCGGCGAGA TTCCCTCGTC GATCCACATC AAGCCACGCG CCGCGATGGA GCAGTCGGCG ACGGATATCA ACATCCGCGT TCTCGAGGGC CACGACGAGG CGACCGTCGA CACGACTGTC GAGACGGTCA TCCATCTGGA TTCACTCGCC GACGGCGCCG TTGTCTGGCG TGGCGATCCC GGCCTCCTCG GCCAGCCACT CGCCGACGGT GGCACGCGCT ACGGCGAGGT GATGGAGCGC GACCTCGGCG ACGGCGAGAC GAAGTACGTC ATCCGAACGT ACTCGGACTC CGACGGCTCG GTCTCGTTGA CCATCAACGA GGACCCGGGG CTGTGGGCCG GAACCGAGCA CACGATTGCC AAGAGCCTCC CGCGCCCGTC ACTCCCGGTC CTCTCGATCG TCTCGAGCCT ATTCGGCTCG CTATCGGTCG TCGGCGTCGG ACTCATTGCG CGCCGCCGGC AGCCAAAACT CCGGTGA
|
Protein sequence | MHDKTADTES RWTRLTAAIP TGAQALTIVF VVLLVASMPA PFAFAAGMNS GTAAAATSTT TTTSLSSASL STTTTATLDD NTSYYDDFED GTADGWNDNE STAIISSESF YGNNSHLVYD GNGSAANVTW AGGPTFSSQE NFELTGTYRA DHGDGSGGAI RFGLGEKDTV ESGGPIWAVV YIQPNENEIW LESSGPDTTT SEKSSERINA TFDDIWADFR LQFDSGQMKY KVWEAGTGEP TDWMITHDAP EDVKSQFFAH AGQSEHGQEI QFDQVDAGGH AITGQFVDSD GNPVPNATVE GYGVDYANVE QRLKEQADDN NVTAEELEEE AQKLLDEAEG VEPPDGWTNF YETYKESPEA ISSEEFSEGL DGTYPLVHEY DDWGQGSTDI LSEEVDAPHH TVDSGEMVVI SLWDLEEENP ILPEGPVSNS HPGEITDGPV VIEEYSAGDQ VDSKIVGMQD DAFVIERPSV MPNTDVPAYK TYLSTGIYRV YPEGSPEKSY WVQVGDAEEQ WNALETELEN RADDLEDESD KRTQRAQDLL DNMEVGEMER RTTTTDENGT FSIRFQTGVQ RAAVQGYRAD GTVLTDITGP SFDDLRDESE SGYNGTYVLS APKRFDVPAE GVTIEGYRAD ELPQHPIEDF NDFLEWKQNQ VLNETVNDLQ SEYDQRLEEL NRTQLEARYT THRPLIETVP GAEERYLSRS QFDSIQSAED LSDDDLETEV GHMETVLRST DQIAPPETDD SDSPISIEDG ELNAEYGPIP SGIDTDTLQP ELHWSNGESE EIPEEYWSIE STGALGRSSQ LVIEGYPIDD TDPAAFDLRV LGGGAGGVLD DRLSALNPSF SGTIPDVQAF DLNTMAPGDS ETVSMTVRLG DDDNYGGLES VEVFGPEGQQ LTTDVTGDKA SFETNGAGKH FVRATVTDST GGQFVHTFQL RALEQGRDDP ATVRAETAIG DRVFALVGEK LEDAEIRANG GSLEVDAIAP GGEIPSSIHI KPRAAMEQSA TDINIRVLEG HDEATVDTTV ETVIHLDSLA DGAVVWRGDP GLLGQPLADG GTRYGEVMER DLGDGETKYV IRTYSDSDGS VSLTINEDPG LWAGTEHTIA KSLPRPSLPV LSIVSSLFGS LSVVGVGLIA RRRQPKLR
|
| |