Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_18610 |
Symbol | |
ID | 7312675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 1983804 |
End bp | 1990637 |
Gene Length | 6834 bp |
Protein Length | 2277 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643612308 |
Product | YD repeat protein |
Protein accession | YP_002509605 |
Protein GI | 220932697 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAAC TTGTATTGGT ATCTATCATG TTAATTTTTA TTTTGTTATT TCAAATGTTT TTTATTGTCT TTGATAGTGG AGTTATTTTT GCTAACAATG GACAAGATGA AAATGATGAA GAAGAGGAAG ATGATGAAGA AAGTGAAGAA AATGAATGTG ATCATCCTGT TACAAGTACA TACTATCATT GGGAATATAC CAGCAATAGT CACTGGCTCG AATGGGAAGT GTATTGTGTT AATTGTGGTA AAACCCTGAG TACAAATTCC AGTTCACCAG AACCTCATAA TATTACTTCA AGTTATAAAA CAAAGAACTC TTCTAAATCC AGGTGTACAC AGATAGTAAC AGAGTCGTGT AGTACCTGTG GTTATACACA TACCTATGAA AAAACGTATA ATCATACAAG GGGTACAACT TGTTCTAAAT GTGGTACATA TTATCCGACC CACGAAGAAG AGGCTGAACA GAAGACAGGA GAAAGTAATG AAAAAAGTGG GGAAGGAGAA GATGAACTTG ATGAAGGTGA AGAAGCCAAT GATAATGCTG ACCAGGAGAT AAATGATGGG GAACAGAGTA ATACTGATGC AAATAATGAA TATGGAGACA CTGAGTCTGA CCTATCAGAT GCCGGAGAGG AAAATGATAA TGCAGAAACG GATATTAATA ATGCAGATGA AAATATTGGT TCATCAAATG ATTGTACAGC AGAAGCTGAT CAATATAATA CATCTGCTGA AGAGGGTATG GGTAATTCAG AATATTCACA GGGCGAATCA TCAGATAATT TTGACGAAAC ATTAGCATGT GAAGAACAGG CTGAGGGCTA TGAACAGGAT GTTAATGACA TTTACAATGA TTTAGAGGAA ACTGAAGACG GAAACACAGA CCTTATGGAG GGAGAAAATA ATAGTGCAAA CAGTGATACA GCCGGGGATC CGGTTAGATA TACTACCGGT GAATATGTAA GTAAGAGTAC TGATTTACAA ATTAATTCAG TTGAACCCCG GATAAAAGTA GTACGCAGTT ACAGTAACTT TGATACAAGT AGTTTTTCTT TTGGTAGAGG CTGGAATTTT AACTATGATA CCAGGATTAT AATTGGCATC AGGGCTAATT ATATTGAACA GCAGGAAGAA TTACAGGAGC TTGCAGAATT AGTTGACGAA ACATACCAGG AAGCTGTAAT AGCATATCAG GAAGCAATGG ATGCTGCATG TGAGTCAGTT GAATTTGCTA AGGAGGCAGT AGAAGATGCA GAAAAGGCGG TTAATGCTGC CCGTGACGCA AAGGACTATG CTTTAACTGC CAAAAACCTG GCTGAGAGTG GAAAAATAAA TGCCAGTGAG GCTAAATACT ATGCTGACTC TGCTTTTGAC CATTTAAATA AATGCCTTGA ATATGCAGAT AACTCTATTC AATCAGGGAA AGAGGCTGAA AACCAGGCAA AGTTTGCCAG AGAAAAAGCC ACAGAGGCCA GAAATATTGC TGATATAGCC ATTGATTTGG CAGAAAGTGC TGTGTACCAT GCTAACCGGA GTGGTAAGCA GAGTGTTATT GATGCTGCTT ATGCTGCCCT GAACAGGGCG AATACTTTGA AAACTAACGC GTTAAATGAA ATAGATGAGG CAGACAATAA TATTGAAACA GCCAGAAAGA TTAAATCAGA TGCTCAAGAT TTAAAAAATA AAGTAAACAA TTTAAACCTG GCAGATAAAA TTAGACAGGC TAAAAATACT GCAGAAAATA ACTATAAAAC AGCTGAAAAA ACACTGGTAA TTGCCCGGGA TGTACTTGAA CTGGCTGAAG ATACTTTTAA TAAAGTTAAT AACACACTGG TATTAGCGGA AGCCCGGCTT AATGAAGCAA GTCAATGGAA AAAGGAACTT AATGACAATT ATACTGTAAT TGAAAATATC GATAAAATAA ATAACAGGGT GCAGGAAATA GCAAGACAGG CATTATTAAA CAGGGAGTAT TCAGAAGCAG GGCGTTATCG TAACCAGTAT AATGTTGACT GGAGTGTAAA CCCGGCAAGA GATGAAATAG GTATCGGTAA AATTGTTTTA ATAGATGATA AGGGCACCCC TCATATCTAT AAAATACTTT CAGAACCTGA CTTTGAGTCT AAAATTACAT TCCCTGATGG TTTAAAGAAC TATTATCCAG AAGGCAGTTT AACAGAACCG GTAACTATAA CTGATGACAG GTTAGAAATA TTACCTGATG GCAAATACCT GCTAACAAAG AAAGATAAAA CAACTTACCT TTATTCGTAT TTTGGCAGGT TGCTTAAAAT TGAAGAACCA AATGGTAGAT ACCTTAAGTT TGGCTACAAC GAAAACGAGC AGCTTGTGTC CATAGAGGAT ACATTCGGGC GGAAAGTAAA AATTGAACGG GTTAACGGAA AAATTGTTAA AATAACCGAT CCGGTTGAAC GTGTTTATAC CTATGACTAT GAAGGCAATA ACCTTGTAGC ATTTACTGAC CCTGAAGGGT ATACACGCCG CTATAGCTAT AATGAGAACG GGATAACTGG CTTAACTTAC CCGGATGGTT CAGGCTGGAA ATATTATTAC ACTGAATTAA ACGGTATAAA AGTCATTGAT TACCAGCAGG ATGCTGCAGG AAATATTATT GATTTTGAAT ACTATCCCGG GACAGGGGAA ACAGTGGTAA TTAACAGGAA GGGAAATAAA ACAACCTACC GGTATAATGA ACGACACCTG ACTGAAGAAG AGATCAATGC AAATTACCAG AGTATTATTA AAAGGTATGA TAATAACAAT AACTTAGTAT CTATAACTAA CCAGCGGGGT TATACAACGT CTTATACATA TGATAAAAAC AATAACATAA CAAGTGTAAC AGATGCTGCC GGTTCAATCT ATTTTACCTA CAATGATTTT AACAAAATTA CAAGTATTAC AGATAAAAAC GGATATACAA CCAATTTTTA CTATGATGAC CGGGGCAACC TGACCCAGAT CGTATATCCT GATGGTTCAA ATAAACAGTA TATTTATAAT GACCTTGGTT TATTAGTTGT TGAAATAGAC CAGCTTGGTA ACAGGATAAA CTATACTTAT GATGGTTATG GAAATATTAC AGAAAAGGTT TATCCAGATG GGAGCAGGGA AAAGTATGAA TATGATAAGG TAGGAAGGTT AATTAAAAGG ATCAAACCAG ATGGTGGACA AATAAGCTAC CATTATGATA ATAATGATAA TATAATCAGG GTAGTTGATG AACTGGGTAA TGAAGAAACT TTTAAGTACA ACTCAAGGGG AAAAATAATT GAAAAAATAG ACCCTCGGGG TAATGTAACA AGATATGAAT ATGATGACAG AAATAATTTA AGCAAAATAA TAGATGCTGA AGGAAATATC AAAGAAATTT TCTATGATGA AGCAGAAAAT ATGACCAGGA AGATACTGGC TGAAAATGTT AGCTATGTTT ACCGTTATGA TAACCTTGAT AGGTTGATTA CAGCGACCCA GGTAGAAACA GGTATTACAA CTTCCTATGA ATATGACCCG GTAGGGAACC TTATTGCTAT TACAGACGGT GAAGGGCGGA CAACCAGGTT TGACTATGAT GGACTAAACC GGAGGGTAAG AGAGATAGAC CCGTTAAACA ATACTGTTGT ATACAGGTAT TACCCTAATA ATCAGTTAAA ATCTATTACA GATAAAAACG GGAATACAAC CAACTTTAAA TATGATTGTA TGGGAAGGCT GACCGAGGTT ATAAACCCAA TTGGTGAGAG GGTACAGTAT AAGTACGATG CAGCTGGTAA CCTGGTAGCC GAAATTGATC CAATGGGGAA TACCACCCGC TATCAATATG ACTGTATGAA TAGACTGGTT AAGGAAATAG ATCCTGCAGG AAATGAAATC CAGTATGAGT ATGACCTGTC CGGAAACCTT ATTAAAGTTA CAGACCCTGA AGGAAATACA ACTTCATATA CTTATGACCT GAAAGGAAGG GTTATTAAAG AAACCAATGC CCTGGGATAC AGTAAGACAT ATGAATATGA CGCAGTTGGA AATTTAATAA CCTTCACCAA TGAGGCCGGA GTAAAAACAA CATACAAATA TGACGGACTT AACAGGCTTG TTGAGATTAA GGACGCACTG GGTAACAGTA CGAAAATTGG GTATACCCCA CTCGGGAAAA TAGCCTGGAG GGAAGATGCC CTGGGCAACA GGACAGAATT TACCTATGAT TCTGCTGGAA GGCTTGTCAA AGAAACTGAC CCTGAAGGCA ATACAGTTGT GTATACCTAT GATAAAGCAG GCAACCTGAT TAAGGTAACC GATGAACTGG GCTATACTAC AAATTACTGC TATGATAAAC TCAATCGGTT GATAGAGGTC GTGGATGCCC TGAATAATAA AGTGGAGTAC AGTTATGATC CCAGGGGTAA CCTGACCATG ATGGTTAATG AAATGGGAAA TACCTATCAA TACCATTATG ATGCCCTGGA CAGGCTGGTT AAAGAAATAA ATTATCAGGG GAAAGAACAG ACCTACAGCT ATGATGCCAA CGGCAACCTG ATTGCTAAAA AGGACTTTAA CGGGAACACT ACAACCTATA ATTATGATGA ATTGAACAGG CTACTGGAAG TAATATTTAA CGGCGGGAAC AAGAAGCGGT TTAGTTATAA CAGAAACGGG ATGATGACCA GGGCCCAAAA TGACAACTTA CTTCAGAGGT ACTACTATGA TGGGCTATCA AGGTTGATAA AGGTCGAGGT TGAAGATGAT GAAGGGGAAA ACTATAAGAT AGAGTATCAG TATAATGAAC TTGGCCAGAA AACCAGGGTT ATCTATGATG ACAGTAATAA TTTAAAAGAC AGGGTAACCG GGTATGAATA TGATGAATTA ATGAGATTGA GCAGGGTTGA GCTGCCTGAT GGTGGTGAAA TAAAATATAG ATATGATAAG TTAAACAGGA TAATAACCAG GATAAATAAT AACAGAACCG CGACCAGCTA TACTTATACC CCTGATGGGC AGGTTGAGAC CATTACCCAC TGGAAAGGTT TTATAGGACA CAACCAGAAT ATAATCCAGT CCTATGGTTA TGTTTATAAT GCCCGTGATG AGAGGGTATT ACAGGTAGAA GAAAACGGGG AGATAACAGC TTACCAGTAT GACCCCGCAG GCAGGCTGGC AAAAGTTTAC TACCCCTTCA GTGACCGTAA GAAGATAGAG GATTTAAAAG AAAGATTCTA TTACGGCCTT TTACCCGAAT GGCCGGAGCC CTATAAATAC GGGCTTAATA TTGAAGAACC CCTTTCCTGG GACAAAGAAA ACAACCTTTA TAACCAGCTA AATGAACTGA CCGGAAAGCT GGAAGGAAGC CTGGCAGGTA TGCCTGGAGT TGGTAACAGC ACCCGGGGCC AGAAAGGTAA AGGCAGACTG CCGGAATTAA TAATCTCACA GGGTGAGGGA AGTTTGAACT TCACAGATAG AATAGACTTA CCCTATGACG TAAGAAACAG AGTAGAGGAG CTATACAGCA GGATAAAAAA TAACGGCTGG GGACTGGATA TATATGGTGA CAACTTCTGG GTTGAAGAAT TTACCTATGA CCCGGCAGGT AATATTACCG AAAAGAGGAA CGGGTGGGGT AAAATAGAAT ATAAGTATAA TGATGCCAAC CAGTTAGCAA AGGCCGGTAA CCGGCAGTAT GAATATGACA GTAATGGTAA TCTCATCAGG GAAGAGCTGG GACACTATTA TGCCGAATAC CACTATAATT ACGAAAACAG GCTAATAAAA GCAGTCAACA ACAGCCACCC CCATTTCCTG GGAGGCAAAA GTCCCTTTAA AGGTTCAGTA AGTTATACCT ATGGACCTCT TGGCAGGAAG GTTAAAAAGG TTACAGACCC ACAGCATGGA GCTAAAGTAG GTATAACAAA ATATATCTAT GATGGAACGA GAACAAATGT ACTGGTTGAA TATGAGATAG AACGCTTTGG TGGGGACCAC CCCGGAAACA ATAAACCTGG TAAAGGACAT AAGCATAACA ATAATCCATT TAACAATGCA GGAAAAATAA ACCGGATTAA CGAATATTAC TATGGGAATG GTTTAATAGC CATGAATTAC TTAAGCCACC CTGACAGGGG ACGAATCCAC TATGGTAATA ATGTATCCTA TTACCATAAA GATGCCCTGG GATCCATAAT ATTAATGACA GGCAGGAACG GACAGGTAAT CGACAGGTAT GAATATGATG CTTATGGAAA TCCCTACAGT GGCAGGTTTG AACAGGGTAA TAACATGAAC TCATATGGAT TCACCGGACA GAGATATGAA GCCAGGCTTG GAGTCTACAC CTTTGCCTAC AGGACATATA ACCCCAGGGT TATGAGGTGG ATAACTCCTG ATCCGGTAAG AGATGGGATG AACTGGTATA CCTATGTAAA TGGGGATCCG GTAAATTTAT GGGATCCGCT GGGGTTGTGT GATATTGATC CTGATAGCTG GAGAAATATA ATGAAGGAAC AACAGCCCAT GATGCGGGGT GATGATGTAG AACAGGTACA AACCTTTCTA AACCAACAGG GGTATGATGT AACAGTTGAT GGTATATTGG GCCCTGAGAC GGCAGGGGCA GTTAGAGACT ACCAGGAAGA TAAAGGTTTA TCTGTAGATG GTGTTGTAGG GCCCAATACC CGGGAAGAGA TAAAAAAGGA TCTGGGGATA GAGGATGTTA GGCATGAAAT ATATTTTAGT CACGATACAG ATAAAGTTTA CTGGACAGAT AATACCGGAA AAATTATAAA ATCATGGCAA GCTAGTGATG ATATTATAGG ATGA
|
Protein sequence | MRKLVLVSIM LIFILLFQMF FIVFDSGVIF ANNGQDENDE EEEDDEESEE NECDHPVTST YYHWEYTSNS HWLEWEVYCV NCGKTLSTNS SSPEPHNITS SYKTKNSSKS RCTQIVTESC STCGYTHTYE KTYNHTRGTT CSKCGTYYPT HEEEAEQKTG ESNEKSGEGE DELDEGEEAN DNADQEINDG EQSNTDANNE YGDTESDLSD AGEENDNAET DINNADENIG SSNDCTAEAD QYNTSAEEGM GNSEYSQGES SDNFDETLAC EEQAEGYEQD VNDIYNDLEE TEDGNTDLME GENNSANSDT AGDPVRYTTG EYVSKSTDLQ INSVEPRIKV VRSYSNFDTS SFSFGRGWNF NYDTRIIIGI RANYIEQQEE LQELAELVDE TYQEAVIAYQ EAMDAACESV EFAKEAVEDA EKAVNAARDA KDYALTAKNL AESGKINASE AKYYADSAFD HLNKCLEYAD NSIQSGKEAE NQAKFAREKA TEARNIADIA IDLAESAVYH ANRSGKQSVI DAAYAALNRA NTLKTNALNE IDEADNNIET ARKIKSDAQD LKNKVNNLNL ADKIRQAKNT AENNYKTAEK TLVIARDVLE LAEDTFNKVN NTLVLAEARL NEASQWKKEL NDNYTVIENI DKINNRVQEI ARQALLNREY SEAGRYRNQY NVDWSVNPAR DEIGIGKIVL IDDKGTPHIY KILSEPDFES KITFPDGLKN YYPEGSLTEP VTITDDRLEI LPDGKYLLTK KDKTTYLYSY FGRLLKIEEP NGRYLKFGYN ENEQLVSIED TFGRKVKIER VNGKIVKITD PVERVYTYDY EGNNLVAFTD PEGYTRRYSY NENGITGLTY PDGSGWKYYY TELNGIKVID YQQDAAGNII DFEYYPGTGE TVVINRKGNK TTYRYNERHL TEEEINANYQ SIIKRYDNNN NLVSITNQRG YTTSYTYDKN NNITSVTDAA GSIYFTYNDF NKITSITDKN GYTTNFYYDD RGNLTQIVYP DGSNKQYIYN DLGLLVVEID QLGNRINYTY DGYGNITEKV YPDGSREKYE YDKVGRLIKR IKPDGGQISY HYDNNDNIIR VVDELGNEET FKYNSRGKII EKIDPRGNVT RYEYDDRNNL SKIIDAEGNI KEIFYDEAEN MTRKILAENV SYVYRYDNLD RLITATQVET GITTSYEYDP VGNLIAITDG EGRTTRFDYD GLNRRVREID PLNNTVVYRY YPNNQLKSIT DKNGNTTNFK YDCMGRLTEV INPIGERVQY KYDAAGNLVA EIDPMGNTTR YQYDCMNRLV KEIDPAGNEI QYEYDLSGNL IKVTDPEGNT TSYTYDLKGR VIKETNALGY SKTYEYDAVG NLITFTNEAG VKTTYKYDGL NRLVEIKDAL GNSTKIGYTP LGKIAWREDA LGNRTEFTYD SAGRLVKETD PEGNTVVYTY DKAGNLIKVT DELGYTTNYC YDKLNRLIEV VDALNNKVEY SYDPRGNLTM MVNEMGNTYQ YHYDALDRLV KEINYQGKEQ TYSYDANGNL IAKKDFNGNT TTYNYDELNR LLEVIFNGGN KKRFSYNRNG MMTRAQNDNL LQRYYYDGLS RLIKVEVEDD EGENYKIEYQ YNELGQKTRV IYDDSNNLKD RVTGYEYDEL MRLSRVELPD GGEIKYRYDK LNRIITRINN NRTATSYTYT PDGQVETITH WKGFIGHNQN IIQSYGYVYN ARDERVLQVE ENGEITAYQY DPAGRLAKVY YPFSDRKKIE DLKERFYYGL LPEWPEPYKY GLNIEEPLSW DKENNLYNQL NELTGKLEGS LAGMPGVGNS TRGQKGKGRL PELIISQGEG SLNFTDRIDL PYDVRNRVEE LYSRIKNNGW GLDIYGDNFW VEEFTYDPAG NITEKRNGWG KIEYKYNDAN QLAKAGNRQY EYDSNGNLIR EELGHYYAEY HYNYENRLIK AVNNSHPHFL GGKSPFKGSV SYTYGPLGRK VKKVTDPQHG AKVGITKYIY DGTRTNVLVE YEIERFGGDH PGNNKPGKGH KHNNNPFNNA GKINRINEYY YGNGLIAMNY LSHPDRGRIH YGNNVSYYHK DALGSIILMT GRNGQVIDRY EYDAYGNPYS GRFEQGNNMN SYGFTGQRYE ARLGVYTFAY RTYNPRVMRW ITPDPVRDGM NWYTYVNGDP VNLWDPLGLC DIDPDSWRNI MKEQQPMMRG DDVEQVQTFL NQQGYDVTVD GILGPETAGA VRDYQEDKGL SVDGVVGPNT REEIKKDLGI EDVRHEIYFS HDTDKVYWTD NTGKIIKSWQ ASDDIIG
|
| |