Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2866 |
Symbol | |
ID | 8385175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2941023 |
End bp | 2947826 |
Gene Length | 6804 bp |
Protein Length | 2267 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644973944 |
Product | GLUG domain protein |
Protein accession | YP_003131760 |
Protein GI | 257053927 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAATC GAGCAGCCGG GGGACTTTAT CGTAAGATCC GTGCAGACGA GCGTAGTGTT TCGGAGATAC TCGGGACGGT CCTGATATTT TCGTTCGTAA TCTTCCTGTC GGTCGGTCTG ATCGTCATCG GGCTCAGCGC CTTCCAGGGC GCGACCGCAC AGACCGAGGA CCGACTCGCA CAGGACTCCA TGCAGGAGAT GGGCGACCGG TTGCATTCGC TGACTGGCAG TCAGATCGAC ACCGCAACCG AGTTCGAATT CCCGACCGGA ACCGGGGACG ATATCAACGC ACTTGACGAA GGGGTCGTCA ACATCACGAT CGAAACACAT TCGGATTATG TCGGACTCGT CGAGGCAAGT GATGCCTCGA ATTCCACAGA AATAGACCTC GGCACCATCG AGCACGAAGC GGAGGATGGC ACGATCACTG CCTATCAAGG CGGGGCACTG TTCGAGAGGC AGGGCGATCT CATCGAGATC CTCCAGGAGC CGACGTTCGA TTATCGGGGT GACGCGATCG ATCTCTCCTT CCACTCGGTG GACATCGATC AGATTTCGGA TGCGGAGTCT ATCACGGCAA AACGGCTGCG CAAACAGTCC GAGGACCAGT CCGAAAAGCT TCGGGAGATG ATGCGGCCAC ACTGGAATCT GACGGGCTAT AGCGACATCA TGGCACCGGT CACGATCACG GTCACGATCG AAAGTGAGTA CGCCGACGCC TGGCAGATGT ACGCCCAAAA CCGGATGACG GAGACGCCCA CTGTCAATCG GAATGGAAAC GAAGTCGAAG TCGTTTTTGA CAAGTTCGAC GGGGGGCTTA CGTTCAATAC GAATCAGACC TTCGACGACG ATGTCTTCTA TTCGGGAGAG GCAGCATTGA CCGGCCTGGT CAACGTCTCG AACGCGACGA TCGGTGGTTC GGGCGGGGAA GTCGAGGTCG CCGAAATCGA CGGTCATGGT GCAGCATCCC CGAGCCCACA TTACATACTG GGCGTATACA ACGAAACGGC CAGTGAGTGG ATGATATATA ATACGACTGA TGGTACTGTT CGAACTCTTA CGGGAGATGT CGTGACGAAC CCGGACTTTC TTGATAGTCC ACCTACCCTA AGCAACAACG ACACGTACCA GATCGACCCG AAAGACACGT GGACCTGCGT CGTTGATGGG GATCCGAGTA AAACGGATCA CGAGGAGTTT GTCGACTACG TCGACACATC CGGTGAGGGC TGTCTCTCGG AGCCGCTGGT CGGGGACGCG CCCGATGATT CGGTGGCCAG TCCACATTTC AACGTGTCGT TCGACGACGT CAGGTTGGCC GGGTCGAAGA ACATCAACAC TGACGACGTC GTCGCCGGAC AGGACAAGAT CGAGCTCGAT TGGACTGTCG AAAACGACTA CGTCAGCAAC GGAACGACGC CGGTCGTCCT GCTGTTCAGT GAAAAGGGTT CGAGTGACTG GATCCCGCTT GAGAAGGCGG ACGTCACTCT CAACGGGACC GGGGATACCG CATCGGACAC GTTCACAGTC AACGCGACGG GGAGCGCCAA CGTGACTTTC CAGGTCGCCA CGTTGGATAC GAACGACAAG TCCCAAGAAA TCGAGATCGT CAAACGACCG GAGCGAGGGA AGTTCCAGAT CGACAGTCTC TCCGTGAACA AGAATACACT CACCGCCGGC GAGGATCTCC AAGTCGATGT CGAGATCAAC AACACCGCAT CGATCAGCGA CACCCAACTG GTCGAGTTGC AGTTCGACGA CAACAGTGGT CCGGCGGCAG TCGCCTGGAA GAAAGTGAGC GTGAACGCCG GAACGACGAA GACGGTGTCG ATCAACTGGA CGACTACGAA CGCTTTCAAC ACGTCCAACG GCGAGGTGAT CGCCGAGACG TATTACGACA ACGAATCGGA AACAAACATT ACAATCGAAG AGGACACGGG GGCCAACGCC TCTTTCGACG TCACAATTGG AAACGTCAGT CCCGATCCAG CCACGGACGG AGATACCGTC ACGGTGGCGA CAAACGTCAC GAACGTCGGC AACGAGACGT CTGAACAGGA CATTGCACTT CTGGCAAACG GAAACATTGC CGACGTTAAA CTCAATCAAA ATCTTACCAG CAACCAGTCG AAAACCGTCT CTCTGAAATG GGACAGTATT GGATACGGTG GCGAGAATAT CATTTTGACT GTGGCCAGTG CCGACGATAC GAACCAGACC ACAATCTCAG TTCAGGAACT CGAACCGGCG GAATTCCTGA TCGACAGCCT TACTGTTCCT AAGACGAACC TCACGGCCAG CGAAGACTTG CAGGCCACCG CCGAAATCAA CAACACTGCG AGCCGAAACG AGACGCAGAT CGTCGAGATG CAGTTCGACG GGACGCCGGT CGCCTGGAAC GAAGTGTCCC TCTCAGCTAA CGCAGTTCAG TCGGTGTCTC TCAACTGGAC GACGACAAAC GCCTTCGAGA CCTCCTACGG CACGGTGACC GTCGAGACGT ACGATGACAG CGCCGCGGAA TCGCCGATCG AGATTCAGAA GGGAATGGAG ACGAACGCGA CCTTCGCGGT CGATATCGAG AACGTCACGC AACCGATTCA CGACGGCGAT ATCGTCACCG TGGAAGCCAA CGTCACGAAC GTCGGCAATG AGACGACCGC CCAGGACATC GCGCTCCTCG CGGACGGGAA CATCGCTAAC GCCACGCTCA ACCAATCGCT CTCCCCTGGC CAGTCGACGA CGCTCTCTCT GAAGTGGGAC AGTAGCAATC GCGGCGGTGA GAATGTCACA CTGACTGTGG CGAGTGCTGA CGATACGGAC AGCGAAACGG TGTCGGTCAC GGCGGTCGAT CCACCGGATT ACAATATTGC CATCGACGAC GTGACCGTCG ACGGCGATCC GAACGGCGCG GTCGATCCGG GTAATTCCAC CGTCACTGTC GAGGCGACCG TCTCCAACAG CGGCGACAGC GAATCCCCGA TCGTCTGGCT CGAAGACTTC GAGGATCGCG TCGTGGCGGT CGATGACTCG GTAGGGATAA TCGGGATGCG TGCGAGCGAT CCTGACAGTA CGTCGGTGAC GCTCCAGTGG AACGTTCCGT CAGACGTCAA TACGACCGAC CCGGAGATCA CGGTCGCCGT CGATGGTGAC GAGGACTCAG AGGAACTCGA CATCGAATCC AGGTTTGTGG TTGAGGATAT TCAGACGACA AAGGTGAAGA GCTCGGCGTT CAGTGAACTT CAGATCGAAG ACGACTCAAC TTCTGGAGGG TTTTGGACCT CGGATGAGAG TGAGTTTACC GTCGATTACG AAGTAAATCC GACCTCTGAT TTCGACAATG TCACAATTGT GTTCGATGGT GACGGAGGCG TATATGACGA ACAAGTATCA ACGACTACAG ACGGTACTGT GACTCATTCC CGGAATGGGG AATACGGCTC GACGTACGAC ATTACCGCTC AGATCATGGA CGACGGGTCG GTCGTCGAAA CAGTCACGAG AGGTGGCGAG GTTGCCGACG GTGACGATCC AGACATGTCT GAGCGGTCCG GTAATCTCGT CGACTCCGTC ACTTTCGAGG CTACCATTGA GAATATCGGC GACACGACCG TTTCCGACAC TGTCCAGCTA TTTGATCCAG CTGGAAACGA GATCGGTGAT ACGTCAGTTT CCCTTGATGG AGGTGACGAT ACAACAGTGG TCTTCAATTG GGTTGGTCCT GACCGCACGG GATCAGTGGA GGTCACGACC AAAGACGACT CGGCATCAGA ACGCGTGATA ATCGAACGCG ACGGCCCTGT CTGTGAAGCA GTCTCTTACA CCGAAGATTC GGAGGGGCGC TACGAGATCA GCAACGTCGA TCAACTGCAG TGTATTAACG AGCACGGCCT CGATGAAGAG TACGTCCTCG TCGACGATAT CGACGCTTCA GGAACCGAAC ACTGGAACGA CGGGAAAGGG TTTGACCCGA TCGGCCCCGA GGGGCACGAT TCTATAAGCA TTCCAGAGGA TGACCCGTGG GAAACCTTCA GTGGTGAATT CGACGGGAAC GGACATACTA TTGAAGGACT CTACATCGAT CGACCGAGCG AGAACTACGT TGGGCTCTTT GGCGCAACAG ATCGCCCGTA TGACGATGTA CCAGTTGGAT ACGGTTCGAC AGTTGAGAAC GTAATACTCG AAGACGTTCA CGTCGTCGGA AATATGTACG TGGGCGGACT TGCGGGACAA GCTGGTGGTG AGATTATCAA CGCACGTGCT GAGGGTTACG TCGAAGCTGA CAAACAGCTG GTCGGTGGAC TCATTGGTCT CGGAGCACAC GCTGATATAG ACAACCGGAT CGTTGCTGAT GTGGAAGTTG TCGGCGGAAA TGTACCGAGC CACCTCAGTG GTGTGCAGGA AGGGAACGCA AAGGGGATCG GTGGGTTGAT CGGTCGCGCT GCGTGGGGAA CAGAAATCGC CACGGGCTAC AGCACAGGAG ACGTCACGGG CCCAGTTAAT GTTGGTGGGA TTATGGGTAG TTCTTCACTG GTCGATTCCG AGTTCGAAGA GATGTACTCG GCTTCCACAG TAACGGCAAC TGCTCCTGGT GCTGAGGGCG GTTCAATTGT TGGCATTGTA CAGTCTGACG GTGATCGATT TAACGAGGAC ATGTATTACG ATGAGGCTCT CGAGTCTTCT GCCTGGGGAG AAGCAGAAAG ATGGACTGGA TATTCCATTG TCGACATGGA TATTGATGAG GTTGAGCAGA CTGACTGGAT CGGTCGAACG ACCACCGAAA TGCAGGGTCT TGACGTGAAC GAGCCAGGTC GGTTAGGCAA CCTTGACTTC GAAGAAGACG GTGGCCCCTG GGTAGCAGTC CTCGGTGACT ATCCCCGCTT CCAGTGGGAG CTTGAGGCCG AAGGCCAGGT CGGAGTCGAC ATCGACGAAG ACAACCTCGA TCCCGTCGTG GCAGGTCAGT CACTGACAGT CCCGGTGACA CTCACCAACA GCAACTACGA GAACGTTACC CAGACGATCC GGCTCCTCAG TGACGGGACG CCAGTCGATA GCACGAGTGT GACAATCACA GAGCGTACGC CCGGTGGCAA TTCCACGACC AAAAACGTCT CTCTCACCTG GAACACGAAC GAGGACGACG TCGGACCGAC CTCCCTCGAA GTGCGAAGCG AGGACGATAC CGATCAGGAA ACGATCGAGA TTGACGCGCC CGCTGTCGGC GATTACCTTG TCGACTCCGT CAGCGCCAAC GATCCGACGA CTGCCGGTGA CGTTCTCACG GTCACCGCCG ACATCAGGTA CAACGGCTCA ACGAGTCCCA GTCCTGAACT CGTGACCCTC CAGAACTTCG CCGGTGGCGT CGTCGATAGT ACGAATGTCA CCGGGAACAC GACGGTAGCT CTCACGTGGG ACACGAGTGA GGGATTGATC TCGGGCGGAA ACATCACGGA TTCGATCACT GTCGGGACGA GTGACGACAC CAATACCACC CAGGTGACGA TCGAAGCGGC CGGCGATGGC GAGTTCAACG TGACTGCCGT CTCGACGAAC GCTCCCATCA CGGAAGGCGA GTCCCTGAAC GTCACCGCCG AAATCGAAAA CACCGGTACC GAAACCGGCA CCCAGGAAGT ATTGCTCTGG GACTTCGACG GCAACGCGGT CGATATCGGT GTGGTCAGCC TGAACGCGGG TGACTCGACG ACGGTCGATC TCACCTGGCA GACGGCCGTC GGCGACAACG GAACAGGCAC AATCGAAGTC ATCACCGGCG ACGACACGAG GACAGCGACG GTCGACATTA CTCCGGCCGG CACCAAGCAA TACACCTTCG AACACACCTC AGTCGACGAT CCGACCACCG CCGGTGAAAC GCTAAATGTT ACGGCTGAAG TCTCCTACAA CGGCACAGTC TCACCTCCTC CCTCTGAGCC CGTGACGCTT CAGGACTTCG ACGGCAATGT TGTCGATCAT GAAACAGTCA GCGGAAACAA TACCGTGACC CTCCAGTGGG ATACCAGCGA GAACCTGGTC ACGAGCGGGA GTATCAGCGA CGACATCACT GTCGGGACGA GTGACGATAT CGAAACGGAG GAGGTTACGA TCAACGCCGC AGGCGACGGT GACTTCCAGA TTCAGAACCT CGCCACGAAC GACCCGGTCA CGGAGGGCGA TACGCTGGAC GTCACCGCTA CCGTGGAAAA CGTCGGTTCG GAAAACGGCA CCCAGAACGT CCTCCTCCGG GACCACGGGG ACAACGCAGT CGATATCACA TCCGTCTCGC TGGATGCGGG CAATTCGACG ACCGTCACGC TCAACTGGAA TACGACCGTC GGCGACAACG GTCAGGACGA TATCACGGTA ACCACCATGG ACGACACGGC GACGCAACAG GTCACCATCG AGGAGCAGCC CGAGGAGCAA CTCGACGTTG CAATCACGGA AGTGCGGTAC AATGGCTCGC AGATCACCAG CGGGACGACG GTAGAGGCAG GAGCCACCCT CGAAGCCGAC GTGACAGTCA GCAACGCGAC CCAGCAAGTC ACGAAGCCGG TGTGGCTCGA GTTCGACGGT GACACGGTTG CGTTCAACTC CGGCGTTACT GTCAATGCAG GTTCGAACAC AGACGTAACT CTGACGTGGA GCGTTCACGA GGCGATGAAT GGTTCGGATC TCGTTGCGAA GACAGGCCAC GATAGTGATC TCTTCGACCT GACGATCGAG CGTGTCCAGA CGTCGACGAA GCCGCTGACG TCACCCAGCG GGGAGCCGAT CGACGTCGAT CTGGAGAAAA TCGCGGTCGG GTAA
|
Protein sequence | MGNRAAGGLY RKIRADERSV SEILGTVLIF SFVIFLSVGL IVIGLSAFQG ATAQTEDRLA QDSMQEMGDR LHSLTGSQID TATEFEFPTG TGDDINALDE GVVNITIETH SDYVGLVEAS DASNSTEIDL GTIEHEAEDG TITAYQGGAL FERQGDLIEI LQEPTFDYRG DAIDLSFHSV DIDQISDAES ITAKRLRKQS EDQSEKLREM MRPHWNLTGY SDIMAPVTIT VTIESEYADA WQMYAQNRMT ETPTVNRNGN EVEVVFDKFD GGLTFNTNQT FDDDVFYSGE AALTGLVNVS NATIGGSGGE VEVAEIDGHG AASPSPHYIL GVYNETASEW MIYNTTDGTV RTLTGDVVTN PDFLDSPPTL SNNDTYQIDP KDTWTCVVDG DPSKTDHEEF VDYVDTSGEG CLSEPLVGDA PDDSVASPHF NVSFDDVRLA GSKNINTDDV VAGQDKIELD WTVENDYVSN GTTPVVLLFS EKGSSDWIPL EKADVTLNGT GDTASDTFTV NATGSANVTF QVATLDTNDK SQEIEIVKRP ERGKFQIDSL SVNKNTLTAG EDLQVDVEIN NTASISDTQL VELQFDDNSG PAAVAWKKVS VNAGTTKTVS INWTTTNAFN TSNGEVIAET YYDNESETNI TIEEDTGANA SFDVTIGNVS PDPATDGDTV TVATNVTNVG NETSEQDIAL LANGNIADVK LNQNLTSNQS KTVSLKWDSI GYGGENIILT VASADDTNQT TISVQELEPA EFLIDSLTVP KTNLTASEDL QATAEINNTA SRNETQIVEM QFDGTPVAWN EVSLSANAVQ SVSLNWTTTN AFETSYGTVT VETYDDSAAE SPIEIQKGME TNATFAVDIE NVTQPIHDGD IVTVEANVTN VGNETTAQDI ALLADGNIAN ATLNQSLSPG QSTTLSLKWD SSNRGGENVT LTVASADDTD SETVSVTAVD PPDYNIAIDD VTVDGDPNGA VDPGNSTVTV EATVSNSGDS ESPIVWLEDF EDRVVAVDDS VGIIGMRASD PDSTSVTLQW NVPSDVNTTD PEITVAVDGD EDSEELDIES RFVVEDIQTT KVKSSAFSEL QIEDDSTSGG FWTSDESEFT VDYEVNPTSD FDNVTIVFDG DGGVYDEQVS TTTDGTVTHS RNGEYGSTYD ITAQIMDDGS VVETVTRGGE VADGDDPDMS ERSGNLVDSV TFEATIENIG DTTVSDTVQL FDPAGNEIGD TSVSLDGGDD TTVVFNWVGP DRTGSVEVTT KDDSASERVI IERDGPVCEA VSYTEDSEGR YEISNVDQLQ CINEHGLDEE YVLVDDIDAS GTEHWNDGKG FDPIGPEGHD SISIPEDDPW ETFSGEFDGN GHTIEGLYID RPSENYVGLF GATDRPYDDV PVGYGSTVEN VILEDVHVVG NMYVGGLAGQ AGGEIINARA EGYVEADKQL VGGLIGLGAH ADIDNRIVAD VEVVGGNVPS HLSGVQEGNA KGIGGLIGRA AWGTEIATGY STGDVTGPVN VGGIMGSSSL VDSEFEEMYS ASTVTATAPG AEGGSIVGIV QSDGDRFNED MYYDEALESS AWGEAERWTG YSIVDMDIDE VEQTDWIGRT TTEMQGLDVN EPGRLGNLDF EEDGGPWVAV LGDYPRFQWE LEAEGQVGVD IDEDNLDPVV AGQSLTVPVT LTNSNYENVT QTIRLLSDGT PVDSTSVTIT ERTPGGNSTT KNVSLTWNTN EDDVGPTSLE VRSEDDTDQE TIEIDAPAVG DYLVDSVSAN DPTTAGDVLT VTADIRYNGS TSPSPELVTL QNFAGGVVDS TNVTGNTTVA LTWDTSEGLI SGGNITDSIT VGTSDDTNTT QVTIEAAGDG EFNVTAVSTN APITEGESLN VTAEIENTGT ETGTQEVLLW DFDGNAVDIG VVSLNAGDST TVDLTWQTAV GDNGTGTIEV ITGDDTRTAT VDITPAGTKQ YTFEHTSVDD PTTAGETLNV TAEVSYNGTV SPPPSEPVTL QDFDGNVVDH ETVSGNNTVT LQWDTSENLV TSGSISDDIT VGTSDDIETE EVTINAAGDG DFQIQNLATN DPVTEGDTLD VTATVENVGS ENGTQNVLLR DHGDNAVDIT SVSLDAGNST TVTLNWNTTV GDNGQDDITV TTMDDTATQQ VTIEEQPEEQ LDVAITEVRY NGSQITSGTT VEAGATLEAD VTVSNATQQV TKPVWLEFDG DTVAFNSGVT VNAGSNTDVT LTWSVHEAMN GSDLVAKTGH DSDLFDLTIE RVQTSTKPLT SPSGEPIDVD LEKIAVG
|
| |