Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_20420 |
Symbol | |
ID | 7314366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2200095 |
End bp | 2203841 |
Gene Length | 3747 bp |
Protein Length | 1248 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643612486 |
Product | conserved repeat domain protein |
Protein accession | YP_002509782 |
Protein GI | 220932874 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGGGGTA AAAAACCTGG ATGGGATAAA TTTTTCCTCA TACTTATTTT CCTGTTATTT ATAACCTGGG CCTGCCAATT TTCCAGGGTA TTTGCCAGTG ATACCGATGA TTATATAGAT GTTGATTTTT CCAGTTCTAC TGTTAAGTTT GTTAACATTA GTCCTGATGG CAGTGGATAT ATTATCCCAT ATGCTACCAA TCTCAGTATA GTTTCCCCCC TTCGTAAATG GGAGTTATAT ATAGAAGCTA CCGGTGATCT TGTCAATAAT AATGACGGGA CCAGGATACC GATCAGTAAC CTTTCATGGG CTTTACATAG CGACAATGGT AAAACCAACT GGAGGCCCCT TAAAACAAAA CCAACCCTTG TAAGTAAAGG ACTTCCCCGT AAGTCACCGG GTTATGTTGC ATCATTTGAT TATAAACTGG CTATAAATCC GGGAGTTAAA GCTTCTTCTT CCCCGAAAGC AATTTATGAG ACGGCAATAA GGTTTAATTT AACAGCAGGG TCTTTATCTT CAAGCCTTGC TTATCCCAAT CCTTTTTCAC CAAATGGTGA TGGAATTAAG GATACAACTG AAATATCGTT TAATATTACA GAAGAGTTTG TGAATATCCC GGTTTTTGGT TATATAGTTG ATGTTGATGG AAATATGGTA AGTATTTTTG ATGACCTGTG GTTTAAGTAC TTTGATAAAA CGGGGATATA TAAAATATCC TGGGACGGCC AGGGTTTTGA TAATGAAGTC TTGCCTGATG GACAGTATTA TTATATTATT GAGGCTTTTT CCTTGTCCGA TACCTGGCCT TTGCTTGACT GGTTTAAAGT TATCGGGGTA GGAACGCTCG AGATTAGTAA TAATACGGCC CCCGGTGATT CTTCCATTAA AGGTCGGGTT TTTAGTGAAG AAGGTGGTTT TTTACCCGGA GTTAGTATTG ACCTTTACAG TGAAAGTGGT ATCAAGGTAA CAGAGACAAT AACCGATTCT AAGGGCAAGT TTAGAATAAA TGACCTGGCC GGTGGTTATT ATTACCTGGA ACTTTCCCGG GAAAATTATC TGCCGTACCG TACCCCTGTG TTTTACCTTG AGGAAAAATC TAATCTCAAA AAGGATATTA TTCTGAGCCA TAATTCATCC CTTTATATAG ATGTTAAGGC TGATAAGACC CAGGCCAGAC CAGGTGATGT TATTACATAC AGTGTTTACG TTGAAAATAA AGGGACATCC AGAGTCAATA ACACCTGTCT TCAGGAGAGG TTACCGGAAG GGTTGGCCTA TCTGGAAGGA AGTTTTGAAT CAACCCGGGA TCTTTCCGTT AGTGATTACA AGGATAGTCT CAGGTGGGAA TTTGGTAGCC TGGCTCCTGG TGACTTTTTT GAAGTAAAAT ATTCTGTGGT GGTGACTCCG GCTGCCCCGG AAGGTGAATT GAGAAACAGG GTTACTGTCA GTGGCACGGC TGAAGGTGAT TATGGGGGTT ATGTAGCTAG AGAGGGAAGT GCTCCTGTGA TTATAAGCAG AGGAGTATTT GCCAGTTGTG GCGATATTCT TGGTAGAATT ACTGGCTACC GGGAAGACCT GGATGATTCA CCATTAGATA AAAATATTTA CGTAAAATTA AGTAATGGTA GTCTGGCCTC TGTACATGAT GGTAAATACT ACCTTGAGGG TATCCCTGAG GGGAATTACC TGATCTGGAT AGTGCAAAAA CGAAACGGTA GATTTGTTAA TATTACAAAG CCTGAAACTA TCAGGGTAGC AGGTGGTCTG GCTGTAAGGA AAGATTTTTA TATAAGCAAA GGTGATTTTG TAAAAGCTGA TAACAAAGAG CAGTCCTATT ATGGTGTAGC TCAACTTGAC CTTAATTTTA ACAATGGACT CAAAATGAGT GGTGAGCTTG ATTTAGAATA TAAAACAGGG GTGAAAGACT GGGATTTATC GATAATATCA TCAACCAGGA GTCGTAAAAA CAGGGAGCAG GATGGTAATG GTATAGATAT ATTACAGGGT TCCCCGTTAA AAATAACAGC CAGTAGGGAA AAAGATATTG TTGAAATGGG AGAAGTTGAT TCAGGTGTCC TCGGCATGGC CTGGACAAGG GAGCTTGATA ATAAAAAATT AAATGTTTAT GCCGGGGCCG GTGAAGTAGG AGAAGCCCTG GAGGAAATTC CTCTTGATAA TTTCAGTGGT CCCTATCAAC TCGACCATTT CCCGGTGGTT TCCGGAAGTG AAGAATTATA TATATGTGAG TTTGACCAGG ATGGTAATCT GATCAATGCC TTTAAACCTG AATATTACCT TGATTATTAT GAAGGCAAGG TATATTTTAA TAAACATTAT ACTTATATTG ACTACCGTAA TTATAGAAAG GTAATGCGGA TTAAATATCG TTATAAAACT GGTTTTCTGG ATTGCACCAG GTATTATGGA TTTAATATAA CTACTGAAGA TGATAAAGCC AGACTTACGG GAGAGGGTTC GTTAAATGAT TCGGGCCATG CAGTTAATAT AGAAGGTGAA TATCGTAATA ATATAGAAAA AGGTGTCCTG ACCCTTACAG GTGAGGTAGA TTACCGGGCC AATAAGGAAA CAGATGGAAA TGCCATGAAT TTGAAAGGGA ATGGCGCCTA CAATTTCCAG GTTAAGGGGC ATGATTTAAA TCTAAAATTG TGGCATAAAC ATAGCAATAT TGAAGATATG GTTATTGAAC CCCCGGGGGT TTCAGGAAAC GAGACCGGGA TTGGTTTACA GGATTCATGG AATATTTTAC CTCAACTAAA AAATAAAAAC GATATCAGTC TGGTCAGGAA TTCAGAAGGT ATAATTGAGT CATTATTAAA TACAGAGTTT GTTTACACCA GGGATATATA TTCGGCTTCT CTAGCCTGGG AAGGAAGGTC CCATGGAGAG GACAATCTCA TTCTGACCGG TTTGATTGAG ATTGATAGGA TTAAAGACTG GAAGCAGCAT GTTTCCCTAA CTCTTTCTCA GGAAAAAGAC TGGATGCCTT TACTGATGTA TCATGGTGAA GCAGATCTGA AACAGGGCTA TCAGATTGAA AATGAGGTTC AGCTGGGGGC CAAGAATACT GTGGATGTTG GCCTGGACTA TAAGGGTGAT CACGGCATAA ATTATGAGTC TACCATAGCC TATGACTGGG ACCGGAATCA GCTGGGTTTA ACTGCCAGGA TGACTTCCGG ACAGGATAAA TTAAAGACTG ACGGTCAGGT TAACATCAAT TATGATATGG TAGCACATGT CCTGGATCTC TATCTGGAAC AGAAAGGGCT ATATCTACCT GAACAGAATT ATTATTTAAA AGAAAAAATC AGTGCCGATA GAACTTTTGG GAAGCAAAAA CAGGGTGACT TAATCCTTTC AATGTGGGCT GGTAAAGTAT TTGATAAAGG AAATATATTC AGCAAGAATA ATTTAGAGGT GAATGGTGGT CTTATCTGGA GAAATAATAC CGAGAAGGGA TATAATATAA ATACCGCGGC AACTTCTCTG GGATTACAAT ATGACATTAA ACCGGTAATT ATTAGTTTTA AAACCCAGCA CTGGTTACAG GATGTGGTTT CCTTTAATGA ATTGGTTGCT GACCTGGGAG TAATAATACC TGTAAAAGAC TGGTTAGAAC TAAAGGTGGG GTATCGTAAT AATTCCCTCC TTAATGACAA TGCTCTGGTT CCTGTTCTGG AAAAAGGGTT TTACATTAAT ATTACCGGTA AAGATTATAA CTTTTAG
|
Protein sequence | MGGKKPGWDK FFLILIFLLF ITWACQFSRV FASDTDDYID VDFSSSTVKF VNISPDGSGY IIPYATNLSI VSPLRKWELY IEATGDLVNN NDGTRIPISN LSWALHSDNG KTNWRPLKTK PTLVSKGLPR KSPGYVASFD YKLAINPGVK ASSSPKAIYE TAIRFNLTAG SLSSSLAYPN PFSPNGDGIK DTTEISFNIT EEFVNIPVFG YIVDVDGNMV SIFDDLWFKY FDKTGIYKIS WDGQGFDNEV LPDGQYYYII EAFSLSDTWP LLDWFKVIGV GTLEISNNTA PGDSSIKGRV FSEEGGFLPG VSIDLYSESG IKVTETITDS KGKFRINDLA GGYYYLELSR ENYLPYRTPV FYLEEKSNLK KDIILSHNSS LYIDVKADKT QARPGDVITY SVYVENKGTS RVNNTCLQER LPEGLAYLEG SFESTRDLSV SDYKDSLRWE FGSLAPGDFF EVKYSVVVTP AAPEGELRNR VTVSGTAEGD YGGYVAREGS APVIISRGVF ASCGDILGRI TGYREDLDDS PLDKNIYVKL SNGSLASVHD GKYYLEGIPE GNYLIWIVQK RNGRFVNITK PETIRVAGGL AVRKDFYISK GDFVKADNKE QSYYGVAQLD LNFNNGLKMS GELDLEYKTG VKDWDLSIIS STRSRKNREQ DGNGIDILQG SPLKITASRE KDIVEMGEVD SGVLGMAWTR ELDNKKLNVY AGAGEVGEAL EEIPLDNFSG PYQLDHFPVV SGSEELYICE FDQDGNLINA FKPEYYLDYY EGKVYFNKHY TYIDYRNYRK VMRIKYRYKT GFLDCTRYYG FNITTEDDKA RLTGEGSLND SGHAVNIEGE YRNNIEKGVL TLTGEVDYRA NKETDGNAMN LKGNGAYNFQ VKGHDLNLKL WHKHSNIEDM VIEPPGVSGN ETGIGLQDSW NILPQLKNKN DISLVRNSEG IIESLLNTEF VYTRDIYSAS LAWEGRSHGE DNLILTGLIE IDRIKDWKQH VSLTLSQEKD WMPLLMYHGE ADLKQGYQIE NEVQLGAKNT VDVGLDYKGD HGINYESTIA YDWDRNQLGL TARMTSGQDK LKTDGQVNIN YDMVAHVLDL YLEQKGLYLP EQNYYLKEKI SADRTFGKQK QGDLILSMWA GKVFDKGNIF SKNNLEVNGG LIWRNNTEKG YNINTAATSL GLQYDIKPVI ISFKTQHWLQ DVVSFNELVA DLGVIIPVKD WLELKVGYRN NSLLNDNALV PVLEKGFYIN ITGKDYNF
|
| |