Gene Cthe_0009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0009 
Symbol 
ID4808822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp12555 
End bp18077 
Gene Length5523 bp 
Protein Length1840 aa 
Translation table11 
GC content39% 
IMG OID640105419 
ProductYD repeat-containing protein 
Protein accessionYP_001036444 
Protein GI125972534 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.886586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATAA AAATAACACC TGAAGAAATG ACAAGGATAG CGGGAAATAT AAGAGAAGTT 
TCAGATAAGT TTAATGATAT AGCACTGGAA GTAAAAAGGA TAATAAATTC TATTGATTGG
GAGCTTAGAA GCAGGAAGGG AATAAACGAA AAATCGTCAA TAGCGCAAGC AACGGCAATA
AAAATATCCG ATAGACTGAA TAAAATGTCA CAGAGCCTTA TAAGGGCAAG AGACAAGATG
ATAGAGGCTG ACAATAAGGC TTCAGCCACA GCCAAAAAGA TGAAGACTGT AAGCTTTAAA
AGGACGGTTG CTGCAGCAAC GTTGTCAAAT TATCCGTCAG ATGTAGAATA TGTTGATCCG
TTGTGGAACC ATGTTGTAGG TCCCGGAACT GCAAACTGCC CTAATACATA TGTTGGAGAC
CCTATAAATG TGGCATCGGG AAATTTCTAT TTAACAAGGA GAGATATAGC TATACCTTCA
AGGGGGATGG GGCTTGAAAT AACCAGATAT TATAATTCGA TGGATAATAC GCAAGGTATA
TTCGGTAAAG GTTGGAGGAT AGATTATGAA ACGTGCCTTA AGAAGAAGGA AGACAGTGAA
GACATAATAG TAGTATACCC GGAAGGAAAT ATAAGAGTAT TTGAGTATAC CGATACAGGA
GATTTCAAAT CTTCCAAGGG TGTGTACGAC ACGCTTTTTA AGGCGGAAGA CGGTACATAC
ATATTAAAAG TTCAAAAAGG AATTACATAC AAATATGACC AGGCAGGAAG CCTTATATCA
ATTTCCGATT CAAATAACAA CGAGATACAA CTCAAATACA ATCGCGAGGG ATTGCTGTCT
ACGGTAATAT CAACACGCAG AAAACTTTTG ATGTTTTCTT ATGAAGGCTA CAGGGTTAGC
AGCATAACTG ATCATACGGG AAGGAAGCTG AAATATAAAT ACGATGAAAA AGGGAATCTG
ACACAGGTAA TATACCCTGA CGGAGGGAAA ATTACCTATG CCTACGATGA CGTAGGGTTA
ATTTCAGTAA CCGACCAAAA CGGCAATACC TATGTCCAAA ACACCTATGA TAAAAACGGA
AGGGTAATAA AGCAGCTTGA CCATCAGGAG AATGAATTGA TTATAGAATA TGACGATGAG
AATCGTGAAA ACACCTTCAA ATGGTTAAAG AGCGGTATAA CCCGTGTATA CAAATACAAC
ACGGACATGC TTGTTACTGA AGTAAGATAT GATGATGGAA GTGTGCAGAA ATACACCTAT
GATGAAAATC TTAACAGAAA CAGCGAGACC GACAGAAACG GTAACACAAC ATGCAAAAAA
TATGATGATA AGGGCAATTT AATAGAAGTA ACCTCACCGG AACCTTTCTG CTACAAAACA
AAGTACAGCT ATGACGAGAA TGGCAGACTG ATAAAAACAG TGTCACCGGG CGGCGGAGAA
GTAACATTTG AATATGATGA TAGGGGCAAT CTTTTAAAAC GTACCTTAAA AAACGGAAGC
AGAAGTTATT CGGAGTGGTC GTATACCTAT GACCAATACG GTAGGATAAC AAGTTCAAGA
GATGCTGAGA ACAATATAAC AAGCTTTGAG TATGGAGAAG AAGACGTAAA CAAGCCAACA
TTAATAAAAG ATGCGGTGGG GAATACATTC AGATATGAAT TTGACAAAGT TGGACGAGTG
ACAGCAATAA CGACAGGTTA TGGCACAGTG AGGTTAAGAT ATGATGACTG CGACCGGATA
ACCCACATAA CCGATGCGGA AGGAAATATT ATAAGACTTA ATTATGATAA AGCAGGAAAT
ATGACAAAGA TTATAGCGCC GAAGCAGTAT GCTGAAAAAG GCGAAAACGG GGCAGGATAT
ACTTTTGAAT ATGATGCAAT GGATAAGCTG ATAAAGACTA TTGACCCATT GGGTAATGTC
TTTGCGGTAA AATATGATGA AAACGGTAAC AAGATAAAAG AAATCAACCC GAACTACTAT
AATTCTCAGA AAGATGACGG TATAGGAATA GAATACAAAT ACGACACCAA CCACCGCAGG
ATAAGCACAA TATTCCCGGA CGGAAGCATG TCAAGAACGA AATATGACCC TCAGGGCAAT
ATAATAAAGA CTATATCATG GAAGGACTAT GACAAGGATT TGGATGACGG ACCGGGAATG
GAGTATACTT ATGATGAAAT GAACAGACTT ACGCAAATAA TAGACCCTGA AGGAAATGTA
ATAAAGAAAT ACATATATGA CGAAGAGGGA AGGATCGTAA AGGAAATAGA TGCAAAAGGC
TACAGCAGTG CAGATAATGA CGAAGAACGT TGGGGAACAA TATATAAATA CAACCTTGCC
GGCTGGCTTG TTGAAAAGAG GACACCGCTA CGGCAGAAAG ACGGCAAAAT ATACTATAAC
ATAATAGAAT ATGTTCATGA TAAAAACGGA AGAGTAGTGA AGGAGAAAAG ATCTCCGGAA
TATGTGACCA AATCGGAATA TCCGAAGAAA TGGAACGTAA TAAACTACAA ATATGATCCG
AACGGGAATT TAATAGAGGT AACAGACGGC CTTGGAGCCT CTATAACCTA TGAGTATGAC
AGCCTTGGAA AAAAGACTTT GGAAAAGATG AAAATAAACG ATAAAAAACA AAAAGTAACG
CGATATGTGT ATAACAGTGT GGGAAGACTG GTAAAAGTAA TACAAGAGCT GGATGGAGAA
GACCTTTCAG GCTACAGTAA TGAGAAGGTT TTAGCAGAGA CAATATACAA TTACGATCCA
AACGGCAATC TTATAGAAGT AATCTCTCCG GAAGGATATC TTACAGTATT TAAATACGAT
GATGCAAACC GTAGGATAAA GAGTATATTG TATCAACCGC AAAACGGTGT GAAACTGATC
GGCAGTGCGT ATTGTGCCCT TTTAAATACA AAGGCGAGGA GCATAAGCTA TGAGTATGAC
CGGGCAGGGA ACCTTGTAAA ACAGATATTG CCTAACGGTG GCGTCATAAT AAACGAATAT
GACGAGATGA ACAGAAAGAT AAAAGTTACT GATCCTGACG GAAACACCAC AAGAATTTTC
TATGACAATA CAGGAAACAT TGTAAAATAT GTTGAGCCGG GAAATTACAA TCCGGAGAAA
GATGACGGAG TGGGTACTAC ATATCTTTAT GACTCAATGA ACCGTCTTAT ACAGATAGTA
AATGCTGCAG GAATAGTGGT GGAAAAGAAT ATATACAATA CTGCGGGAGA AATAATCAAG
AAAATTGACT CAGTAGGTTA TAAATCCGCA GATAATGACA ATGACAGGCA TGGAGTTGAA
TTTAGTTATG ACCTGGCAGG ACGTTTGGTG GAGATAACAA CGCCGGAAGC GAAGATTCAC
GGACGAAAGA GCCAGCAATA CACTTATGAC GCAGAAGGAA ACATAACGGG AATAGTTGAC
GGAAACGGAA ACAGCACAAG GTACAGTTTG GACTTATGGG GTAAGATAAT AAACATAACG
GAACCTGACG GAACCAATAT AAAATACGAT TACGACTATG CGGGAAACCT TGTGTCCACT
ACTGACGGTA ACCGAAACAC CACTCGGTAT ACATACAACA GCTTGAACCT TCTGTCGGAG
ATAATAGACC CTGACGGAAG GAAAATAATC TTCAAGTATG ACAGACAGGG AAGAATGGTG
CAAAGGATAG GGAAAGACGG ACACAGCACA TATTACACCT ACAATGCGGA CAACAATATA
ACCGGCTGTT GGGAAGAAGA AGGACAGATG GAGAGGTACG AGTATAATAT TGACGGAAGT
CTGGCTGCGT CAATAAGCGG TACCACTATA CATACTTATA CCTATACCTT GGCAGGGAGA
CTGAAAAGTA AGACAACCAA CGGGCGGAAA GTGTTGGAGT ATGATTATAA TAAAAATGGG
TTTGTATCCA GGCTTGTTGA CATAAGTGGA ACACCGGTGG AGTATACATA TGACGTGCTC
GGAAGAGTAG CTGCGATAAC AAATGGAGGA AAGATTTCCG CAAAATATGA ATACAATATT
GACAACACGA TAGCGCAGGT CTTGTATGGG AGCGGAATAT GCGCGAGATA TGAATATGAC
CTGGATAAGA ATATAACGAG GCTTTTGAAT ACAGACCCGA CAGGAAAAGA GATGTTTGTG
TACAAATATG CCTATGACGG AAATGGAAAT CCGATTTTGA AGGAAGAGAA CGGAAAAGTA
ACGGCCTACG GCTATGATGC ACTAAACCGC TTGCAGGAAG TGATATACCC TGGAAATGTA
AAAGAGAGAT TTACGTATGA TGCGAACGGC AACAGGCTCA AGAGAGAGTT CAGAGACATA
TTGGAGCAAT ATGAGTATGA TAATTGCAAT AGATTGATTC AAAGGATAAA AAATGGACTG
GTAACGGAAT ATGAATATGA CGAAAAAGGA AATTTGATAA AAGAAAAAGA AGGTGAGTTG
ACTAAATTAT ACAGCTATGA CGGATTTGAC AGACTGATAC GTGTGCAGAA TCCCGACGGA
ACATATATGG AGAATATATA TGATGCCGAG GATTTGAGAA CGATATCGAT AGAAAACGGA
AGATACAACA GATATATATA CAGCGGAAGA AATATAGCGT GTGAAGTAGA CGAGGACTGG
AGACTGAAAG ACAGAATAGT GCGAGGGCAT ACGATATTGC AAAAAGAGGA CGGCAATAAG
AATGCTTACT ATTATATTCA CAATTCTCAT GGAGATATTA CCGCTCTTAC TGATGATAAA
GGGAAAATAG CAAACGCTTA CAGTTATGAT GCTTTTGGAA ATATATTGGA CAGTGTTGAG
AGAGTAGATA ATAGATTCAA GTATTCAGGC GAAGTGTATG ATTCTGTAGC AGGTCAATAT
TACCTGAGGG CAAGATACTA TAATCCGAGT ATAGGGAGAT TTATACAGGA AGACCCATTC
AGAGGCGATG GACTTAACTT ATATGCCTAT GTTGCCAATA ATCCTGTAAG GTACATTGAC
CCGACCGGTA ACTGCAAAGA AGGTGTTAAC TTTAATAACA ACCTTGAGGA AACGACAAAA
GTGATTGTTG AAAAAAATGA AAGTAAAGGC AATTCGGTTT GGGATATAGT AAAGGATATT
TTTGTAGATA CAATTGAGGG TGCTGTAGAC GAGATAATAG CAAAGAAAGT TATTGAGAGA
TTACCTAATT ATTCAGCAAA AACATTGATG GGGCATGCAA AATTTGGTTA TGCTTTTGAG
GCAACAAAGG TTAAACCTAT ACCTGGGGTT AAGGGACTAA AATCATTTTC AAAAGTTGCA
GGCCCTGTTG CAGTTATGGT GTATGGATAC GAAGTTTATC AGGATATAAA AAAATACGAT
GGCTGGGATG CGGTTAAGGC TATTACGGTA ACTACAGTAG CAACGGCAGT AACATTTTTG
GCTGGCTTTA TTTTAGCACA AGCTGCTCTT CCGGTAGTTG CAACTCTTAT TGTGAGTGCT
GCAGTGGGTA TTGCTATTGG ATTTTTTGCA GATTGGGTTA AGAAAAAATG GATAGGTTAT
TAA
 
Protein sequence
MQIKITPEEM TRIAGNIREV SDKFNDIALE VKRIINSIDW ELRSRKGINE KSSIAQATAI 
KISDRLNKMS QSLIRARDKM IEADNKASAT AKKMKTVSFK RTVAAATLSN YPSDVEYVDP
LWNHVVGPGT ANCPNTYVGD PINVASGNFY LTRRDIAIPS RGMGLEITRY YNSMDNTQGI
FGKGWRIDYE TCLKKKEDSE DIIVVYPEGN IRVFEYTDTG DFKSSKGVYD TLFKAEDGTY
ILKVQKGITY KYDQAGSLIS ISDSNNNEIQ LKYNREGLLS TVISTRRKLL MFSYEGYRVS
SITDHTGRKL KYKYDEKGNL TQVIYPDGGK ITYAYDDVGL ISVTDQNGNT YVQNTYDKNG
RVIKQLDHQE NELIIEYDDE NRENTFKWLK SGITRVYKYN TDMLVTEVRY DDGSVQKYTY
DENLNRNSET DRNGNTTCKK YDDKGNLIEV TSPEPFCYKT KYSYDENGRL IKTVSPGGGE
VTFEYDDRGN LLKRTLKNGS RSYSEWSYTY DQYGRITSSR DAENNITSFE YGEEDVNKPT
LIKDAVGNTF RYEFDKVGRV TAITTGYGTV RLRYDDCDRI THITDAEGNI IRLNYDKAGN
MTKIIAPKQY AEKGENGAGY TFEYDAMDKL IKTIDPLGNV FAVKYDENGN KIKEINPNYY
NSQKDDGIGI EYKYDTNHRR ISTIFPDGSM SRTKYDPQGN IIKTISWKDY DKDLDDGPGM
EYTYDEMNRL TQIIDPEGNV IKKYIYDEEG RIVKEIDAKG YSSADNDEER WGTIYKYNLA
GWLVEKRTPL RQKDGKIYYN IIEYVHDKNG RVVKEKRSPE YVTKSEYPKK WNVINYKYDP
NGNLIEVTDG LGASITYEYD SLGKKTLEKM KINDKKQKVT RYVYNSVGRL VKVIQELDGE
DLSGYSNEKV LAETIYNYDP NGNLIEVISP EGYLTVFKYD DANRRIKSIL YQPQNGVKLI
GSAYCALLNT KARSISYEYD RAGNLVKQIL PNGGVIINEY DEMNRKIKVT DPDGNTTRIF
YDNTGNIVKY VEPGNYNPEK DDGVGTTYLY DSMNRLIQIV NAAGIVVEKN IYNTAGEIIK
KIDSVGYKSA DNDNDRHGVE FSYDLAGRLV EITTPEAKIH GRKSQQYTYD AEGNITGIVD
GNGNSTRYSL DLWGKIINIT EPDGTNIKYD YDYAGNLVST TDGNRNTTRY TYNSLNLLSE
IIDPDGRKII FKYDRQGRMV QRIGKDGHST YYTYNADNNI TGCWEEEGQM ERYEYNIDGS
LAASISGTTI HTYTYTLAGR LKSKTTNGRK VLEYDYNKNG FVSRLVDISG TPVEYTYDVL
GRVAAITNGG KISAKYEYNI DNTIAQVLYG SGICARYEYD LDKNITRLLN TDPTGKEMFV
YKYAYDGNGN PILKEENGKV TAYGYDALNR LQEVIYPGNV KERFTYDANG NRLKREFRDI
LEQYEYDNCN RLIQRIKNGL VTEYEYDEKG NLIKEKEGEL TKLYSYDGFD RLIRVQNPDG
TYMENIYDAE DLRTISIENG RYNRYIYSGR NIACEVDEDW RLKDRIVRGH TILQKEDGNK
NAYYYIHNSH GDITALTDDK GKIANAYSYD AFGNILDSVE RVDNRFKYSG EVYDSVAGQY
YLRARYYNPS IGRFIQEDPF RGDGLNLYAY VANNPVRYID PTGNCKEGVN FNNNLEETTK
VIVEKNESKG NSVWDIVKDI FVDTIEGAVD EIIAKKVIER LPNYSAKTLM GHAKFGYAFE
ATKVKPIPGV KGLKSFSKVA GPVAVMVYGY EVYQDIKKYD GWDAVKAITV TTVATAVTFL
AGFILAQAAL PVVATLIVSA AVGIAIGFFA DWVKKKWIGY