Gene Cthe_1373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1373 
Symbol 
ID4809368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1671219 
End bp1676972 
Gene Length5754 bp 
Protein Length1917 aa 
Translation table11 
GC content40% 
IMG OID640106797 
ProductYD repeat-containing protein 
Protein accessionYP_001037798 
Protein GI125973888 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATAA AAATAACTCC CGAAGAGATG ACAAGGGTAG CTGAGAGAAT AAGGCAATTA 
TCAGACAAGT TTGATGATAT GGCACAGGAT ATAAAAAGGA TAATAAGTTC TATTGACTGG
GAACTTAGAA GCAGAGAGGG AGTAGATCAA AAAGCGTCGT TGGCTCGTGC AGCTGCAATA
AATATAGCAA CCGGATTAAA AGAAATGTCA CAGGACCTTA TAGAGGCAAG GGACCAAATG
ATAGAGGCTG ACAATAAAGC ATCGGCTGCG GCAAGAAAGA TGAAAACTGC AGATTTTATA
AGCAGAATTG CTTCAGCAAT ACTGACTGGT AACGGTATTG CATCAGCCAT AGTAGCGGGT
TATTCGTTAT GGAACCGGTT GATAGGCCCG GGAACTGCAA ACTGTCCGAA CACGTTTGCC
GGAGACCCTG TAAACGTGGT GTCGGGAAAT TTTTATTTAA AAAGGAGAGA TATAACCATA
CCTTCAAGAG GTATGGCTCT TGAAATAACC AGATATTATA ACTCAATGGA TAATACGGAA
GGAATATTCG GCAAAGGCTG GAAAATAGAT TATGAAACGT GCCTGAAGAA AAAGGAAGAC
AGTGAAGACA TAATAGTGGC ATATCCGGAT GGGAATATAA GAGTATTTGA GTATACCGAT
ACAGGGAGTT TCAAATCTCC AAAGGGAGTA TATGACACTC TTTTAAAAAC AGAGGACGGT
ACATACATAT TAAAAGTTCA AAAAGGAATT ACTTACAAAT ATGACCAGGC AGGAAACCTT
ATATCAATTT CCGATTCGAA CAGCAATGAG ATACAGTTTA AATACAACCG GGAGGGATTG
CTGTCTTCAG TAATGTCACC GGGAGGAAAA CTTTTGATGT TTTCCTATGA AGGTGGCAGG
ATTGTCAGTA TAACCGACCA TACGGGAAGG AATTTGAAAT ATAAATATGA TGAAAAAGGA
AATCTGACAC AGGTAGTATA CCCTGACGGA GGGAAGATTA CCTATGCGTA CGATAACATA
GGACTGATTT CAATAACCGA CCAGAACGGC AACACCTATG TCCAAAACAC CTATGATGAA
AAGGGCAGAG TAGTAAGGCA GCTTGACCAT GAGAACAATG AATTGATTAT AGAATATGAC
GAAGAAAATC GTGAAAACAC CTTCAAATGG ATAAAGAGCG GCATAACCCG TGTATATAAA
TACAATACAG ACATGCTTCT TACTGAAATA AGGTATGATG ACGGAAGCGT GCAAAAGTAT
ACCTATGATG AAAATCTCAA CAGAAACAGT GAAACTGACA GAAATGGCAA CACAACATAT
AAAAAGTATG ATGACAAGGG CAATTTAATA GAAGTAATCT CACCGGAGCC TTTCTGCTAC
AAAACAAAAT ACAGCTATAA TGAGGAAGGC AGACTGATAA AGGTAGTGTC GCCTGGCGGA
GGAGAAGTGT CTTTTGAATA TGACGAAAGG GGAAACCTTT TAAAACGCAT TGTAAAGACC
GGAAGCAGAA GTTATTCGGA GTGGGCGTAT ACCTATGATC AATATGGAAG AATGACAACA
TCAAAAGATG CGGAAAACAA CACGAAGACC TTTGAGTATG GAGAAGAAGA TGTAAACAAA
CCGACATTGA TAAAAGATGC GGTAGGGAAC ATATTTAAAT ATGAATTTGA CAAAGTGGGT
CGGGTAGTGG CCACAACCAC ACATTACGGA ACAGTAAGAA TGAAGTACAA CGAATGTGAC
CGGATAACCC ACATAACCGA TACAGAAGGG AACACAACAA GAATCTGCTA TGACAAAGCA
GGAAACATGA CAAAGGTTAT AGCGCCGAAG CAGTATGCGG AGAAAGGCGA AAACGGGGCG
GGATATGCAT TTGAATATAA TGCAATGGAC AAACTTATAA GGACAATTGA CCCGTTGGGC
AATGTTTTTG CGGTAAAATA CGATGAGAAC GGCAACAAGA TAAAAGAGAT CAACCCGAAC
TACTATAGTT CTGAGAAAGA TGACGGTATA GGAATAGAAT ACAAATATGA CACCAACCAC
CGCAGGATAA ACACAATATT CCCGGACGGA AGCATGTCAA GGATAAAGTA TGACGCGGAA
GGCAATATAA TAAAGACAAT ATCCTGGAAG GATTATAACA AGGATTTGGA TGACGGGCCG
GGGATGGAGT ATACCTATGA TGAAATGAAC AGGCTTACGC AAATAATAGA CCCGGAAGGG
AATGTGATAA AGAAATACAT ATACGATGAA GACGGAAGAA TCGTGAAAGA AATAGATGCA
AAAGGATATA GCAGTGCAGA TAACGATGAA GAACGTTGGG GGACAATATA TAAATACAAC
CTTGCCGGAT GGCTTGTTGA GAAGAGGACA CCGTTACAGC AGAAAAATGG TGAAATATAT
TACAACATAA TAGAATATGT GTATGACAGA AACGGAAGGG TAGTACAGGA GAAAAGATCT
CCGGAATATG TGACCAGGAC AGGATATCCA AAGAAATGGA ACATAATAAA CTATAAATAT
GATCCCAACG GAAATCTGAT AGAGGTAACC GACAGCCTTG GAGCAGTGAT AACCTATGAA
TATGACTGCT TTGGCAAGAG AACACTGGAG AGAATGAAAA TAAACGACAG GAAGCAAAGA
GTAATCAGAT ATGAATATAA TGGAGTGGGA AAATTAACGA GAGTAATACG TGAATTGGAC
GGAGAAGACC TTTCAGGATA CAGTGAAGAT AAGGTTTTGG CGGAGACAAT ATACAATTAC
GACCCAAACG GCAATCTTAT AGAGGTGATT TCTCCTGAAG GGTATTTGAC TGTATTTAAA
TACGATGATG CAAACCGTAG GATAAAGAGT ATATTGTATC AACCGCAGAA CGGTGTGAAA
CTGAGCGGCA GTGCGTATTG TGCCCTTTTA AATACAAAGT CGAGGAGCAT AAGTTATGAG
TATGATCGGG CAGGGAACCT TGTAAGAGAG ATATTGCCCA ACGGTGGCGT CATAATAAAC
GAATATGATG AAATGAACAG AAGAATAAGA GTTACCGACC CTGACGGAAA CACCAGAAGG
ATTTTCTATG ACAATTCAGG GAACGTCGTA AAATATGTTA ATCCGGAGAA TTACGATCCG
GAGAAAGATG ACGGAACAGG TACCACATAC CTTTATGACT CAATGAACCG TCTTATAGAA
ATAGTAAATG CAGCGGGTAT AGTAGTGGAA AGGAATATAT ACAACACAGC GGGAGAGATA
ATCAAGAGGA TAGACTCAGT TGGTTATAGT TCCGCAGATA ATGACAATGA CAGGCATGGA
GTTGAATTTA GTTATGACCT GGCGGGACGT TTGGTGGAGA TAACAACGCC GGAAGCGAAG
ATTCATGGCC GAAAGAGTCA GCAATACACT TATGACGCAG AAGGAAACAT AACAGGAGTA
GTTGACGGAA ACGGAAACAG CACAAGGTAC AGTTTGGACT TATGGGGTAA GATAATAAAC
ATAACGGAAC CTGACGGAAC CAATATAAAA TACGATTATG ACTATGCGGG AAATCTTGTA
TCCACTACTG ACGGTAACGG AAACACTACC CGTTATACAT ACAACAGCTT GAACCTTCTG
TCGGAGATAA TAAATCCTGA CGGAAGGAAA ATAACCTTCA AGTATGACAG ACAGGGAAGA
GTGGTGCAGA GGATAGGGAA AGACGGACGC AGTACATATT ACACCTACAA TGCGGATAAC
AATATAACCG GCCGTTGGGA AGAAGAAGGA CAGATGGAGA GGTACGAGTA TAATATTGAC
GGAAGTCTGG CTGCGTCAAT AAGCGGTACC ACTATACATA CTTATACCTA TACCTTGGCA
GGGAGACTGA AAAGCAAGAC GACTAACGGG CGGAAAGTAT TGGAGTATGA TTACAATAAA
AATGGGCTTG TATCGAAAGT TACCGATATA AGTGGAACTC CGGTGGAGTA TACGTATGAC
GTACTGGGTA GGTTAACTGC GGTAACAAAC GGAGGAAAGA TTTCCGCAAA ATATGAATAC
AATATTGACA ACACGATAGC GCAGGTGCTG TACGGAAGCG GAGTATGCGC AAGATATGAA
TACGACATGG ATAAGAATAT AACGAGGCTT TTAAATATAG ATCCGACAGG AAAAGAGATG
TTTGCGTACA GGTATGCGTA TGACGGGAAC GGCAACCAGA TTTTGAAGGA AGAGAACGAG
AAAGTAACGG CCTACAGTTA TGATGCGCTG AACCGTTTAA AAGAAGTAAT ATACCCGGGG
GATATAAGAG AGAGGTTTGA ATATGATGCG AACGGCAACA GGCTCAAGAG AGAATATGGA
GACATATTGG AGCAATATGA GTATGATAAT TGCAATAGAT TAATTCAAAA GATAAAGAAC
GGCTTAGTAA CTGAGTATGA ATATGACGAA AGAGGAAATC TCATAAGAGA AAGAGAAGGA
GAGTTAAGCA AGCTATACAG CTATGACGGG TTTAACCGAT TGGTACGTGT ACAGAATCCG
GACGGAACAT ATATGGAAAA CATATATGAT GCCGAGAATT TGAGAACGAT ATCAATAGAA
AACGGAAAAT ACAACAGATA TATATACAAC GGAAGGGACA TAGCATGCGA AGTAGGCGAG
GATTGGAGCT TGAAAGACAG GGTGGTTCGA GGACATACGA TATTGCAGAA AGAGGACGGC
AATAAGAATG CTTACTATTA TATCCACAAT GTCCATGGAG ACATTACTGC TCTTACCGAT
GGTAGAGGAG AGATAGTAAA CAACTACAGC TACGATGCGT TTGGAAATAT ATTAGATAGT
GTTGAAAAGG TAGAGAACAG ATTCAAGTAT TCGGGAGAAG TGCTTGATCC CCTGACGGGA
CAATACTACC TGAGGGCAAG ATACTATAAC CCAAGTATAG GAAGGTTTAT GCAGGAAGAC
ACGTTTAGGG GTGATGGACT TAACTTATAT ACCTATGTTG CTAATAATCC GATAAAATAC
GTTGACCCAA CCGGACATTG CAAAGAAGGA ATTGAGTTTA CTGATTCAAA CAGCATTATT
TTAGATAAAG CGCAGACGGG AAATGGTACA GGAAATAATC CTATAAATGG AATTGATACT
ACTAGTAGTA CGTTTGATGT AATGATTTCG TATATAGCAG GAAAGAATGG TGGCACAGCT
GTCTATTCTG AAAAGAGTAT ATTTGGAATA ATTGTATGGG ATAAAAAAGT AACAGTCACT
ATTGGTGATA CTACTAAAGT ATATACTATC GGTAAAAAGG ACTCTGTTGT GAGAATTGTT
AATGGAAAAG CTGTAATTAA TAATAGTGTG TTAATGAATG ATTTTGGATT AGATGAAGTT
TCTTCGACTC ATCAAGTAGG GGATGCTTTC GATAGTGCAG ACCATGCAGC AATGGCATGG
GGATTTATGT ACAATGGAAA GTCTATAACG GACAATATAG AATATGCATC AGCAATATAT
GAGGATGGAG GTAAATACAA ATACACTGCT ATAATTGGAG GACAAAATGC TAACGTGGGA
GTTCCAACTG CACCTTACGG GAAAACACTG AGTGCTATAA TACATGCACA TGCGGCGTAC
GATAAAAGGT ATGATAATGA GAATTTTTCC GGCCAGGATA AAAGTGTAGC AAAATTTTTT
GAAGTTCCTG TATATGTTAC TACACCGGGA GGAAAATTGA AAAAGTATGA TCCTGATATT
GATAAGACAA CAACTATTAA CAAGGATATG CCAAAAGACC CGAATTCACC ATGA
 
Protein sequence
MQIKITPEEM TRVAERIRQL SDKFDDMAQD IKRIISSIDW ELRSREGVDQ KASLARAAAI 
NIATGLKEMS QDLIEARDQM IEADNKASAA ARKMKTADFI SRIASAILTG NGIASAIVAG
YSLWNRLIGP GTANCPNTFA GDPVNVVSGN FYLKRRDITI PSRGMALEIT RYYNSMDNTE
GIFGKGWKID YETCLKKKED SEDIIVAYPD GNIRVFEYTD TGSFKSPKGV YDTLLKTEDG
TYILKVQKGI TYKYDQAGNL ISISDSNSNE IQFKYNREGL LSSVMSPGGK LLMFSYEGGR
IVSITDHTGR NLKYKYDEKG NLTQVVYPDG GKITYAYDNI GLISITDQNG NTYVQNTYDE
KGRVVRQLDH ENNELIIEYD EENRENTFKW IKSGITRVYK YNTDMLLTEI RYDDGSVQKY
TYDENLNRNS ETDRNGNTTY KKYDDKGNLI EVISPEPFCY KTKYSYNEEG RLIKVVSPGG
GEVSFEYDER GNLLKRIVKT GSRSYSEWAY TYDQYGRMTT SKDAENNTKT FEYGEEDVNK
PTLIKDAVGN IFKYEFDKVG RVVATTTHYG TVRMKYNECD RITHITDTEG NTTRICYDKA
GNMTKVIAPK QYAEKGENGA GYAFEYNAMD KLIRTIDPLG NVFAVKYDEN GNKIKEINPN
YYSSEKDDGI GIEYKYDTNH RRINTIFPDG SMSRIKYDAE GNIIKTISWK DYNKDLDDGP
GMEYTYDEMN RLTQIIDPEG NVIKKYIYDE DGRIVKEIDA KGYSSADNDE ERWGTIYKYN
LAGWLVEKRT PLQQKNGEIY YNIIEYVYDR NGRVVQEKRS PEYVTRTGYP KKWNIINYKY
DPNGNLIEVT DSLGAVITYE YDCFGKRTLE RMKINDRKQR VIRYEYNGVG KLTRVIRELD
GEDLSGYSED KVLAETIYNY DPNGNLIEVI SPEGYLTVFK YDDANRRIKS ILYQPQNGVK
LSGSAYCALL NTKSRSISYE YDRAGNLVRE ILPNGGVIIN EYDEMNRRIR VTDPDGNTRR
IFYDNSGNVV KYVNPENYDP EKDDGTGTTY LYDSMNRLIE IVNAAGIVVE RNIYNTAGEI
IKRIDSVGYS SADNDNDRHG VEFSYDLAGR LVEITTPEAK IHGRKSQQYT YDAEGNITGV
VDGNGNSTRY SLDLWGKIIN ITEPDGTNIK YDYDYAGNLV STTDGNGNTT RYTYNSLNLL
SEIINPDGRK ITFKYDRQGR VVQRIGKDGR STYYTYNADN NITGRWEEEG QMERYEYNID
GSLAASISGT TIHTYTYTLA GRLKSKTTNG RKVLEYDYNK NGLVSKVTDI SGTPVEYTYD
VLGRLTAVTN GGKISAKYEY NIDNTIAQVL YGSGVCARYE YDMDKNITRL LNIDPTGKEM
FAYRYAYDGN GNQILKEENE KVTAYSYDAL NRLKEVIYPG DIRERFEYDA NGNRLKREYG
DILEQYEYDN CNRLIQKIKN GLVTEYEYDE RGNLIREREG ELSKLYSYDG FNRLVRVQNP
DGTYMENIYD AENLRTISIE NGKYNRYIYN GRDIACEVGE DWSLKDRVVR GHTILQKEDG
NKNAYYYIHN VHGDITALTD GRGEIVNNYS YDAFGNILDS VEKVENRFKY SGEVLDPLTG
QYYLRARYYN PSIGRFMQED TFRGDGLNLY TYVANNPIKY VDPTGHCKEG IEFTDSNSII
LDKAQTGNGT GNNPINGIDT TSSTFDVMIS YIAGKNGGTA VYSEKSIFGI IVWDKKVTVT
IGDTTKVYTI GKKDSVVRIV NGKAVINNSV LMNDFGLDEV SSTHQVGDAF DSADHAAMAW
GFMYNGKSIT DNIEYASAIY EDGGKYKYTA IIGGQNANVG VPTAPYGKTL SAIIHAHAAY
DKRYDNENFS GQDKSVAKFF EVPVYVTTPG GKLKKYDPDI DKTTTINKDM PKDPNSP