Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0009 |
Symbol | |
ID | 4808822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 12555 |
End bp | 18077 |
Gene Length | 5523 bp |
Protein Length | 1840 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640105419 |
Product | YD repeat-containing protein |
Protein accession | YP_001036444 |
Protein GI | 125972534 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.886586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATAA AAATAACACC TGAAGAAATG ACAAGGATAG CGGGAAATAT AAGAGAAGTT TCAGATAAGT TTAATGATAT AGCACTGGAA GTAAAAAGGA TAATAAATTC TATTGATTGG GAGCTTAGAA GCAGGAAGGG AATAAACGAA AAATCGTCAA TAGCGCAAGC AACGGCAATA AAAATATCCG ATAGACTGAA TAAAATGTCA CAGAGCCTTA TAAGGGCAAG AGACAAGATG ATAGAGGCTG ACAATAAGGC TTCAGCCACA GCCAAAAAGA TGAAGACTGT AAGCTTTAAA AGGACGGTTG CTGCAGCAAC GTTGTCAAAT TATCCGTCAG ATGTAGAATA TGTTGATCCG TTGTGGAACC ATGTTGTAGG TCCCGGAACT GCAAACTGCC CTAATACATA TGTTGGAGAC CCTATAAATG TGGCATCGGG AAATTTCTAT TTAACAAGGA GAGATATAGC TATACCTTCA AGGGGGATGG GGCTTGAAAT AACCAGATAT TATAATTCGA TGGATAATAC GCAAGGTATA TTCGGTAAAG GTTGGAGGAT AGATTATGAA ACGTGCCTTA AGAAGAAGGA AGACAGTGAA GACATAATAG TAGTATACCC GGAAGGAAAT ATAAGAGTAT TTGAGTATAC CGATACAGGA GATTTCAAAT CTTCCAAGGG TGTGTACGAC ACGCTTTTTA AGGCGGAAGA CGGTACATAC ATATTAAAAG TTCAAAAAGG AATTACATAC AAATATGACC AGGCAGGAAG CCTTATATCA ATTTCCGATT CAAATAACAA CGAGATACAA CTCAAATACA ATCGCGAGGG ATTGCTGTCT ACGGTAATAT CAACACGCAG AAAACTTTTG ATGTTTTCTT ATGAAGGCTA CAGGGTTAGC AGCATAACTG ATCATACGGG AAGGAAGCTG AAATATAAAT ACGATGAAAA AGGGAATCTG ACACAGGTAA TATACCCTGA CGGAGGGAAA ATTACCTATG CCTACGATGA CGTAGGGTTA ATTTCAGTAA CCGACCAAAA CGGCAATACC TATGTCCAAA ACACCTATGA TAAAAACGGA AGGGTAATAA AGCAGCTTGA CCATCAGGAG AATGAATTGA TTATAGAATA TGACGATGAG AATCGTGAAA ACACCTTCAA ATGGTTAAAG AGCGGTATAA CCCGTGTATA CAAATACAAC ACGGACATGC TTGTTACTGA AGTAAGATAT GATGATGGAA GTGTGCAGAA ATACACCTAT GATGAAAATC TTAACAGAAA CAGCGAGACC GACAGAAACG GTAACACAAC ATGCAAAAAA TATGATGATA AGGGCAATTT AATAGAAGTA ACCTCACCGG AACCTTTCTG CTACAAAACA AAGTACAGCT ATGACGAGAA TGGCAGACTG ATAAAAACAG TGTCACCGGG CGGCGGAGAA GTAACATTTG AATATGATGA TAGGGGCAAT CTTTTAAAAC GTACCTTAAA AAACGGAAGC AGAAGTTATT CGGAGTGGTC GTATACCTAT GACCAATACG GTAGGATAAC AAGTTCAAGA GATGCTGAGA ACAATATAAC AAGCTTTGAG TATGGAGAAG AAGACGTAAA CAAGCCAACA TTAATAAAAG ATGCGGTGGG GAATACATTC AGATATGAAT TTGACAAAGT TGGACGAGTG ACAGCAATAA CGACAGGTTA TGGCACAGTG AGGTTAAGAT ATGATGACTG CGACCGGATA ACCCACATAA CCGATGCGGA AGGAAATATT ATAAGACTTA ATTATGATAA AGCAGGAAAT ATGACAAAGA TTATAGCGCC GAAGCAGTAT GCTGAAAAAG GCGAAAACGG GGCAGGATAT ACTTTTGAAT ATGATGCAAT GGATAAGCTG ATAAAGACTA TTGACCCATT GGGTAATGTC TTTGCGGTAA AATATGATGA AAACGGTAAC AAGATAAAAG AAATCAACCC GAACTACTAT AATTCTCAGA AAGATGACGG TATAGGAATA GAATACAAAT ACGACACCAA CCACCGCAGG ATAAGCACAA TATTCCCGGA CGGAAGCATG TCAAGAACGA AATATGACCC TCAGGGCAAT ATAATAAAGA CTATATCATG GAAGGACTAT GACAAGGATT TGGATGACGG ACCGGGAATG GAGTATACTT ATGATGAAAT GAACAGACTT ACGCAAATAA TAGACCCTGA AGGAAATGTA ATAAAGAAAT ACATATATGA CGAAGAGGGA AGGATCGTAA AGGAAATAGA TGCAAAAGGC TACAGCAGTG CAGATAATGA CGAAGAACGT TGGGGAACAA TATATAAATA CAACCTTGCC GGCTGGCTTG TTGAAAAGAG GACACCGCTA CGGCAGAAAG ACGGCAAAAT ATACTATAAC ATAATAGAAT ATGTTCATGA TAAAAACGGA AGAGTAGTGA AGGAGAAAAG ATCTCCGGAA TATGTGACCA AATCGGAATA TCCGAAGAAA TGGAACGTAA TAAACTACAA ATATGATCCG AACGGGAATT TAATAGAGGT AACAGACGGC CTTGGAGCCT CTATAACCTA TGAGTATGAC AGCCTTGGAA AAAAGACTTT GGAAAAGATG AAAATAAACG ATAAAAAACA AAAAGTAACG CGATATGTGT ATAACAGTGT GGGAAGACTG GTAAAAGTAA TACAAGAGCT GGATGGAGAA GACCTTTCAG GCTACAGTAA TGAGAAGGTT TTAGCAGAGA CAATATACAA TTACGATCCA AACGGCAATC TTATAGAAGT AATCTCTCCG GAAGGATATC TTACAGTATT TAAATACGAT GATGCAAACC GTAGGATAAA GAGTATATTG TATCAACCGC AAAACGGTGT GAAACTGATC GGCAGTGCGT ATTGTGCCCT TTTAAATACA AAGGCGAGGA GCATAAGCTA TGAGTATGAC CGGGCAGGGA ACCTTGTAAA ACAGATATTG CCTAACGGTG GCGTCATAAT AAACGAATAT GACGAGATGA ACAGAAAGAT AAAAGTTACT GATCCTGACG GAAACACCAC AAGAATTTTC TATGACAATA CAGGAAACAT TGTAAAATAT GTTGAGCCGG GAAATTACAA TCCGGAGAAA GATGACGGAG TGGGTACTAC ATATCTTTAT GACTCAATGA ACCGTCTTAT ACAGATAGTA AATGCTGCAG GAATAGTGGT GGAAAAGAAT ATATACAATA CTGCGGGAGA AATAATCAAG AAAATTGACT CAGTAGGTTA TAAATCCGCA GATAATGACA ATGACAGGCA TGGAGTTGAA TTTAGTTATG ACCTGGCAGG ACGTTTGGTG GAGATAACAA CGCCGGAAGC GAAGATTCAC GGACGAAAGA GCCAGCAATA CACTTATGAC GCAGAAGGAA ACATAACGGG AATAGTTGAC GGAAACGGAA ACAGCACAAG GTACAGTTTG GACTTATGGG GTAAGATAAT AAACATAACG GAACCTGACG GAACCAATAT AAAATACGAT TACGACTATG CGGGAAACCT TGTGTCCACT ACTGACGGTA ACCGAAACAC CACTCGGTAT ACATACAACA GCTTGAACCT TCTGTCGGAG ATAATAGACC CTGACGGAAG GAAAATAATC TTCAAGTATG ACAGACAGGG AAGAATGGTG CAAAGGATAG GGAAAGACGG ACACAGCACA TATTACACCT ACAATGCGGA CAACAATATA ACCGGCTGTT GGGAAGAAGA AGGACAGATG GAGAGGTACG AGTATAATAT TGACGGAAGT CTGGCTGCGT CAATAAGCGG TACCACTATA CATACTTATA CCTATACCTT GGCAGGGAGA CTGAAAAGTA AGACAACCAA CGGGCGGAAA GTGTTGGAGT ATGATTATAA TAAAAATGGG TTTGTATCCA GGCTTGTTGA CATAAGTGGA ACACCGGTGG AGTATACATA TGACGTGCTC GGAAGAGTAG CTGCGATAAC AAATGGAGGA AAGATTTCCG CAAAATATGA ATACAATATT GACAACACGA TAGCGCAGGT CTTGTATGGG AGCGGAATAT GCGCGAGATA TGAATATGAC CTGGATAAGA ATATAACGAG GCTTTTGAAT ACAGACCCGA CAGGAAAAGA GATGTTTGTG TACAAATATG CCTATGACGG AAATGGAAAT CCGATTTTGA AGGAAGAGAA CGGAAAAGTA ACGGCCTACG GCTATGATGC ACTAAACCGC TTGCAGGAAG TGATATACCC TGGAAATGTA AAAGAGAGAT TTACGTATGA TGCGAACGGC AACAGGCTCA AGAGAGAGTT CAGAGACATA TTGGAGCAAT ATGAGTATGA TAATTGCAAT AGATTGATTC AAAGGATAAA AAATGGACTG GTAACGGAAT ATGAATATGA CGAAAAAGGA AATTTGATAA AAGAAAAAGA AGGTGAGTTG ACTAAATTAT ACAGCTATGA CGGATTTGAC AGACTGATAC GTGTGCAGAA TCCCGACGGA ACATATATGG AGAATATATA TGATGCCGAG GATTTGAGAA CGATATCGAT AGAAAACGGA AGATACAACA GATATATATA CAGCGGAAGA AATATAGCGT GTGAAGTAGA CGAGGACTGG AGACTGAAAG ACAGAATAGT GCGAGGGCAT ACGATATTGC AAAAAGAGGA CGGCAATAAG AATGCTTACT ATTATATTCA CAATTCTCAT GGAGATATTA CCGCTCTTAC TGATGATAAA GGGAAAATAG CAAACGCTTA CAGTTATGAT GCTTTTGGAA ATATATTGGA CAGTGTTGAG AGAGTAGATA ATAGATTCAA GTATTCAGGC GAAGTGTATG ATTCTGTAGC AGGTCAATAT TACCTGAGGG CAAGATACTA TAATCCGAGT ATAGGGAGAT TTATACAGGA AGACCCATTC AGAGGCGATG GACTTAACTT ATATGCCTAT GTTGCCAATA ATCCTGTAAG GTACATTGAC CCGACCGGTA ACTGCAAAGA AGGTGTTAAC TTTAATAACA ACCTTGAGGA AACGACAAAA GTGATTGTTG AAAAAAATGA AAGTAAAGGC AATTCGGTTT GGGATATAGT AAAGGATATT TTTGTAGATA CAATTGAGGG TGCTGTAGAC GAGATAATAG CAAAGAAAGT TATTGAGAGA TTACCTAATT ATTCAGCAAA AACATTGATG GGGCATGCAA AATTTGGTTA TGCTTTTGAG GCAACAAAGG TTAAACCTAT ACCTGGGGTT AAGGGACTAA AATCATTTTC AAAAGTTGCA GGCCCTGTTG CAGTTATGGT GTATGGATAC GAAGTTTATC AGGATATAAA AAAATACGAT GGCTGGGATG CGGTTAAGGC TATTACGGTA ACTACAGTAG CAACGGCAGT AACATTTTTG GCTGGCTTTA TTTTAGCACA AGCTGCTCTT CCGGTAGTTG CAACTCTTAT TGTGAGTGCT GCAGTGGGTA TTGCTATTGG ATTTTTTGCA GATTGGGTTA AGAAAAAATG GATAGGTTAT TAA
|
Protein sequence | MQIKITPEEM TRIAGNIREV SDKFNDIALE VKRIINSIDW ELRSRKGINE KSSIAQATAI KISDRLNKMS QSLIRARDKM IEADNKASAT AKKMKTVSFK RTVAAATLSN YPSDVEYVDP LWNHVVGPGT ANCPNTYVGD PINVASGNFY LTRRDIAIPS RGMGLEITRY YNSMDNTQGI FGKGWRIDYE TCLKKKEDSE DIIVVYPEGN IRVFEYTDTG DFKSSKGVYD TLFKAEDGTY ILKVQKGITY KYDQAGSLIS ISDSNNNEIQ LKYNREGLLS TVISTRRKLL MFSYEGYRVS SITDHTGRKL KYKYDEKGNL TQVIYPDGGK ITYAYDDVGL ISVTDQNGNT YVQNTYDKNG RVIKQLDHQE NELIIEYDDE NRENTFKWLK SGITRVYKYN TDMLVTEVRY DDGSVQKYTY DENLNRNSET DRNGNTTCKK YDDKGNLIEV TSPEPFCYKT KYSYDENGRL IKTVSPGGGE VTFEYDDRGN LLKRTLKNGS RSYSEWSYTY DQYGRITSSR DAENNITSFE YGEEDVNKPT LIKDAVGNTF RYEFDKVGRV TAITTGYGTV RLRYDDCDRI THITDAEGNI IRLNYDKAGN MTKIIAPKQY AEKGENGAGY TFEYDAMDKL IKTIDPLGNV FAVKYDENGN KIKEINPNYY NSQKDDGIGI EYKYDTNHRR ISTIFPDGSM SRTKYDPQGN IIKTISWKDY DKDLDDGPGM EYTYDEMNRL TQIIDPEGNV IKKYIYDEEG RIVKEIDAKG YSSADNDEER WGTIYKYNLA GWLVEKRTPL RQKDGKIYYN IIEYVHDKNG RVVKEKRSPE YVTKSEYPKK WNVINYKYDP NGNLIEVTDG LGASITYEYD SLGKKTLEKM KINDKKQKVT RYVYNSVGRL VKVIQELDGE DLSGYSNEKV LAETIYNYDP NGNLIEVISP EGYLTVFKYD DANRRIKSIL YQPQNGVKLI GSAYCALLNT KARSISYEYD RAGNLVKQIL PNGGVIINEY DEMNRKIKVT DPDGNTTRIF YDNTGNIVKY VEPGNYNPEK DDGVGTTYLY DSMNRLIQIV NAAGIVVEKN IYNTAGEIIK KIDSVGYKSA DNDNDRHGVE FSYDLAGRLV EITTPEAKIH GRKSQQYTYD AEGNITGIVD GNGNSTRYSL DLWGKIINIT EPDGTNIKYD YDYAGNLVST TDGNRNTTRY TYNSLNLLSE IIDPDGRKII FKYDRQGRMV QRIGKDGHST YYTYNADNNI TGCWEEEGQM ERYEYNIDGS LAASISGTTI HTYTYTLAGR LKSKTTNGRK VLEYDYNKNG FVSRLVDISG TPVEYTYDVL GRVAAITNGG KISAKYEYNI DNTIAQVLYG SGICARYEYD LDKNITRLLN TDPTGKEMFV YKYAYDGNGN PILKEENGKV TAYGYDALNR LQEVIYPGNV KERFTYDANG NRLKREFRDI LEQYEYDNCN RLIQRIKNGL VTEYEYDEKG NLIKEKEGEL TKLYSYDGFD RLIRVQNPDG TYMENIYDAE DLRTISIENG RYNRYIYSGR NIACEVDEDW RLKDRIVRGH TILQKEDGNK NAYYYIHNSH GDITALTDDK GKIANAYSYD AFGNILDSVE RVDNRFKYSG EVYDSVAGQY YLRARYYNPS IGRFIQEDPF RGDGLNLYAY VANNPVRYID PTGNCKEGVN FNNNLEETTK VIVEKNESKG NSVWDIVKDI FVDTIEGAVD EIIAKKVIER LPNYSAKTLM GHAKFGYAFE ATKVKPIPGV KGLKSFSKVA GPVAVMVYGY EVYQDIKKYD GWDAVKAITV TTVATAVTFL AGFILAQAAL PVVATLIVSA AVGIAIGFFA DWVKKKWIGY
|
| |