Gene Htur_5017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5017 
Symbol 
ID8745823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013748 
Strand
Start bp5547 
End bp8963 
Gene Length3417 bp 
Protein Length1138 aa 
Translation table11 
GC content61% 
IMG OID646515631 
Producthypothetical protein 
Protein accessionYP_003406578 
Protein GI284176302 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.483629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGACA AAACCGCCGA CACCGAGAGC CGCTGGACTC GCCTCACCGC GGCGATCCCG 
ACCGGAGCAC AGGCGCTGAC GATCGTCTTC GTCGTCCTCC TGGTCGCGTC GATGCCCGCG
CCGTTCGCGT TCGCGGCCGG CATGAACAGC GGGACGGCCG CTGCCGCGAC CTCAACCACG
ACGACGACGT CGCTGAGTTC GGCGTCACTG TCGACCACGA CCACGGCAAC GCTTGATGAC
AATACGTCCT ACTATGACGA TTTCGAAGAC GGAACTGCGG ACGGCTGGAA CGATAATGAG
TCGACTGCAA TAATTTCCAG CGAGTCGTTC TACGGGAACA ATTCACATCT GGTCTATGAC
GGCAACGGCT CAGCAGCTAA CGTCACATGG GCGGGTGGCC CCACGTTCTC GAGCCAAGAG
AACTTCGAAC TCACTGGCAC ATACCGCGCA GATCACGGCG ATGGCAGCGG CGGTGCGATT
CGATTTGGCC TTGGAGAGAA AGACACAGTC GAATCGGGTG GCCCCATCTG GGCTGTCGTT
TACATCCAAC CCAACGAGAA CGAGATTTGG CTCGAGTCGA GCGGACCGGA CACGACGACG
AGCGAAAAGA GCTCCGAAAG AATCAATGCA ACGTTCGATG ACATTTGGGC CGACTTCCGG
TTGCAGTTCG ACAGCGGCCA GATGAAATAC AAAGTCTGGG AGGCTGGCAC AGGAGAGCCA
ACTGACTGGA TGATCACACA CGACGCACCG GAGGATGTCA AATCACAGTT CTTCGCCCAT
GCCGGCCAGT CCGAACACGG GCAAGAGATC CAGTTCGACC AAGTCGACGC TGGCGGCCAC
GCAATCACTG GACAGTTCGT TGACTCAGAC GGAAATCCAG TCCCGAACGC GACCGTTGAG
GGCTATGGCG TCGACTACGC GAACGTCGAA CAGCGGCTCA AGGAACAGGC CGATGATAAC
AACGTCACCG CCGAGGAACT CGAAGAAGAG GCTCAGAAAC TCTTAGATGA AGCGGAGGGC
GTTGAACCGC CTGACGGATG GACAAATTTC TACGAGACGT ATAAAGAGAG TCCTGAAGCG
ATTAGTTCTG AGGAGTTTTC CGAGGGACTT GACGGAACGT ATCCGTTGGT CCACGAGTAC
GACGACTGGG GGCAGGGAAG TACAGACATC CTCAGCGAGG AGGTGGACGC CCCACACCAC
ACGGTTGACT CTGGCGAAAT GGTTGTAATC TCGCTGTGGG ATCTTGAGGA AGAAAACCCG
ATCCTCCCAG AAGGTCCGGT TAGCAACTCC CACCCCGGCG AGATTACGGA CGGGCCAGTC
GTCATTGAGG AATACTCGGC TGGCGATCAA GTCGACTCGA AAATAGTCGG GATGCAGGAT
GACGCGTTCG TTATCGAACG TCCGTCGGTC ATGCCGAACA CCGACGTTCC GGCGTACAAA
ACCTACCTCT CGACAGGAAT CTACCGCGTC TACCCTGAGG GCAGCCCTGA GAAATCCTAC
TGGGTTCAGG TCGGCGACGC CGAAGAACAG TGGAACGCGC TCGAGACCGA ACTCGAGAAC
CGAGCCGACG ATCTCGAAGA TGAATCCGAC AAACGCACCC AACGCGCGCA GGATCTCCTC
GATAACATGG AGGTCGGCGA AATGGAACGC CGGACGACGA CGACCGACGA AAACGGGACG
TTCTCGATCC GATTCCAGAC GGGCGTTCAG AGGGCCGCCG TGCAAGGCTA CCGTGCCGAC
GGGACCGTAT TGACCGACAT CACGGGACCG TCGTTCGACG ACCTTCGTGA TGAATCCGAG
TCTGGATACA ACGGCACCTA CGTTCTCAGC GCGCCCAAGC GCTTCGACGT CCCGGCCGAA
GGCGTCACGA TCGAAGGCTA CCGCGCTGAC GAACTGCCAC AACACCCAAT CGAGGACTTC
AACGACTTCT TAGAGTGGAA ACAAAACCAA GTCCTCAACG AGACGGTCAA CGACCTCCAA
AGCGAGTACG ACCAGCGACT TGAAGAACTG AACCGGACCC AACTCGAGGC TCGCTACACG
ACTCACCGAC CGCTCATTGA GACGGTGCCG GGAGCGGAAG AGCGCTACCT CAGCCGCTCG
CAGTTCGATT CCATCCAATC CGCTGAGGAC CTCTCGGACG ACGACCTCGA GACCGAGGTC
GGTCACATGG AGACGGTCCT GCGGAGCACC GACCAGATCG CCCCGCCGGA GACCGATGAC
TCGGACTCGC CGATCTCAAT CGAGGACGGC GAGCTGAACG CCGAGTACGG GCCAATACCG
AGCGGCATTG ACACGGACAC ACTCCAGCCG GAACTCCACT GGTCGAACGG CGAGAGCGAG
GAAATCCCCG AAGAGTACTG GTCGATCGAA TCGACGGGTG CACTCGGACG GAGTTCCCAA
CTCGTCATCG AGGGCTACCC GATCGACGAC ACTGACCCGG CGGCGTTCGA CCTGCGTGTT
CTCGGCGGCG GCGCTGGCGG CGTGCTCGAT GACCGCCTCT CGGCGCTGAA CCCGTCGTTC
AGCGGTACAA TCCCCGACGT GCAGGCGTTC GATCTGAACA CGATGGCGCC CGGAGACAGC
GAGACAGTCT CCATGACCGT GCGACTGGGT GATGACGACA ACTACGGCGG CCTCGAGTCG
GTCGAAGTCT TCGGCCCCGA GGGCCAGCAA CTCACGACCG ACGTGACTGG CGACAAAGCG
TCGTTCGAAA CCAACGGCGC CGGCAAACAC TTCGTTCGCG CGACCGTCAC GGACTCGACC
GGCGGCCAAT TCGTCCACAC GTTCCAGCTC CGCGCACTCG AGCAAGGCCG GGACGATCCG
GCGACGGTTC GGGCTGAGAC AGCGATCGGC GACCGCGTGT TCGCGCTGGT CGGCGAGAAA
CTCGAAGACG CAGAGATTCG CGCCAACGGC GGCTCGCTCG AGGTCGACGC GATCGCACCC
GGCGGCGAGA TTCCCTCGTC GATCCACATC AAGCCACGCG CCGCGATGGA GCAGTCGGCG
ACGGATATCA ACATCCGCGT TCTCGAGGGC CACGACGAGG CGACCGTCGA CACGACTGTC
GAGACGGTCA TCCATCTGGA TTCACTCGCC GACGGCGCCG TTGTCTGGCG TGGCGATCCC
GGCCTCCTCG GCCAGCCACT CGCCGACGGT GGCACGCGCT ACGGCGAGGT GATGGAGCGC
GACCTCGGCG ACGGCGAGAC GAAGTACGTC ATCCGAACGT ACTCGGACTC CGACGGCTCG
GTCTCGTTGA CCATCAACGA GGACCCGGGG CTGTGGGCCG GAACCGAGCA CACGATTGCC
AAGAGCCTCC CGCGCCCGTC ACTCCCGGTC CTCTCGATCG TCTCGAGCCT ATTCGGCTCG
CTATCGGTCG TCGGCGTCGG ACTCATTGCG CGCCGCCGGC AGCCAAAACT CCGGTGA
 
Protein sequence
MHDKTADTES RWTRLTAAIP TGAQALTIVF VVLLVASMPA PFAFAAGMNS GTAAAATSTT 
TTTSLSSASL STTTTATLDD NTSYYDDFED GTADGWNDNE STAIISSESF YGNNSHLVYD
GNGSAANVTW AGGPTFSSQE NFELTGTYRA DHGDGSGGAI RFGLGEKDTV ESGGPIWAVV
YIQPNENEIW LESSGPDTTT SEKSSERINA TFDDIWADFR LQFDSGQMKY KVWEAGTGEP
TDWMITHDAP EDVKSQFFAH AGQSEHGQEI QFDQVDAGGH AITGQFVDSD GNPVPNATVE
GYGVDYANVE QRLKEQADDN NVTAEELEEE AQKLLDEAEG VEPPDGWTNF YETYKESPEA
ISSEEFSEGL DGTYPLVHEY DDWGQGSTDI LSEEVDAPHH TVDSGEMVVI SLWDLEEENP
ILPEGPVSNS HPGEITDGPV VIEEYSAGDQ VDSKIVGMQD DAFVIERPSV MPNTDVPAYK
TYLSTGIYRV YPEGSPEKSY WVQVGDAEEQ WNALETELEN RADDLEDESD KRTQRAQDLL
DNMEVGEMER RTTTTDENGT FSIRFQTGVQ RAAVQGYRAD GTVLTDITGP SFDDLRDESE
SGYNGTYVLS APKRFDVPAE GVTIEGYRAD ELPQHPIEDF NDFLEWKQNQ VLNETVNDLQ
SEYDQRLEEL NRTQLEARYT THRPLIETVP GAEERYLSRS QFDSIQSAED LSDDDLETEV
GHMETVLRST DQIAPPETDD SDSPISIEDG ELNAEYGPIP SGIDTDTLQP ELHWSNGESE
EIPEEYWSIE STGALGRSSQ LVIEGYPIDD TDPAAFDLRV LGGGAGGVLD DRLSALNPSF
SGTIPDVQAF DLNTMAPGDS ETVSMTVRLG DDDNYGGLES VEVFGPEGQQ LTTDVTGDKA
SFETNGAGKH FVRATVTDST GGQFVHTFQL RALEQGRDDP ATVRAETAIG DRVFALVGEK
LEDAEIRANG GSLEVDAIAP GGEIPSSIHI KPRAAMEQSA TDINIRVLEG HDEATVDTTV
ETVIHLDSLA DGAVVWRGDP GLLGQPLADG GTRYGEVMER DLGDGETKYV IRTYSDSDGS
VSLTINEDPG LWAGTEHTIA KSLPRPSLPV LSIVSSLFGS LSVVGVGLIA RRRQPKLR