Gene Cthe_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1371 
Symbol 
ID4809366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1664451 
End bp1670330 
Gene Length5880 bp 
Protein Length1959 aa 
Translation table11 
GC content40% 
IMG OID640106795 
ProductYD repeat-containing protein 
Protein accessionYP_001037796 
Protein GI125973886 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAA AAGTATTGCC GGAAAAGATG ACTGAAATTG CGAGCAGTTT AAAGCGCTTG 
TCCGATGAAT TTGATGGTAT AATACGGGAT ATAAACAGTA TAGTAAAGTC AATTGACTGG
GAACTGAGAA GCAAAGAAGG GGTAGATCAG AAAGCGTCGG ATGCGATTAG AGTTGCAAAG
AAAATATCCG GAAGTCTTGA GTCAATGGCA AAAGACCTTG AGTTTGCCCG AGACAGAATG
ATTGAAGAAG ATAAAAAAGC ATCAAACATT GCCGGGAAAA TGAAGAGTGC GGTCATCGGA
GCTTCTGTGG CTACAGCTTC CGCCGGAAAT GCAAAAGTTG CGGATGTGAA TTATACATCG
GATTACAAAA ACTTAGGTCC CGGACGAGCA AACTGCCCGA ATACCTTTGT CGGTGACCCT
GTAAACGTAA CAACGGGAAA CTTCTATGTG ACTAAAAAGG ATATCCAAAT CCCCACAAGA
GGTATTCCCC TTGAAATAAG GCGATATTAC AATTCCATTG ACCAAACCTG CGGAATTTTG
GGAAAGGGCT GGAGAATAGG ATATGAAACC GGCCTTATGA GAGTGGAAGA CAGTCAGGAT
ATCCTGACGG TGTATCCTGA CGGAAGCATA GGCGTATTTG AGTTTGATGA AAAAGACAAT
AAATATATTG CCCCACGGGG AATTTTTGAT ATACTACAAA AGAACGAAGA TGAAAGCTTT
ACGTTGAAAC TTCATGACGG AACTACCTAC AGATATGACA AATCCGGAAA TCTCACATCA
ATAAGCGACC TTAACGGTAA TGCAATTTCC ATAAAATATA ACGGCCAAGG TATGATTTCA
TCGGTAATAT CACCGGGCGG AAAATTTCTG GATTTTTCAT ATGAAGATTT AAAACTGAGA
AAAATAACTG ACCATACCGG TCGGGAGATA ATTTACTCCT ATGATAAATC CGGCAACCTT
ATTCAGGTAA AATATCCTGA CGGAGGAATC ATTAAATACG GATATGACAA TAAAGGCATG
ATATCTATCA CCGACCAGAA CGGAAACACA TATGTCCAAA ATACATACGA CGAAGCAGGC
CGGGTTGTAA AACAACTTGA CCATGAAGGA AACGAGCTGG TTATAGAATA CTGCCCGGGA
GAGTGCAAGA ATATATTTAA ATGGCAGAAA AGCGGTATAA CCCGCATATA CAAATACAAC
GAGGAAAAGT TGCTGACGGA AATCATATAT GATGACGGTA CCAAAGAAGT ATACACCTAT
GATGAGGATA AAAACAGAAA CAGCATAACC GACAGAAATG GCAGGACAAG AAGGTTTAAG
TATGATGAAA GAGGGAATCT CATAGCCGAA ATAATGCCGG AACCTTTCAA CTATACCGTA
CGATACAGCT ATGACGGAAA CAACAGAAGG ACAAAAATCA GCACTCCGGC AGGCGGTCTT
GCAAGATTTG AGTATGACGA AAAAGGAAAT CTCTTAAAAC GTATTGTGAA AATAAACAGC
AATACCAATT CCGAAACTGT GTATACATAT GATGAATTCG GAAGAGTTTT GACAATTACC
GATGCCGAGA ACAACACCAC ATCCTTTGAA TATAATGATG ACGATATAAA CAAGCCCTCT
GCAATAATTG ACCCGGAAGG AAACAGATTT ACCTATGATT TTGATGCCAT TGGAAGAGTT
GTGGCGATAA CAACGGGTTA TGGAACTGTA AAAATAGAAT ATAATGAGCG TGATAAGATA
ACAGGTTTGA TTGATGCGGA AAATAACAAG ATCCGAATAA AATATGATGC CGTCGGAAAC
ATGGTTGAAG TTATTGCTCC GGAACAATAT GCGCAAAAAG GCGATAAAGC TCAAAGCTAT
ACCTTTGCTT ATGATGCAAT GGACCGTATG ATAAAGCAGA TTGACCCCTT GGGAAACGTA
TTTATGGTAA AATACGATGA GCACGGAAAT AAAATCAAGG AGGTCAATCC CAATTACTAC
AATGAAGAGG AAGACGACGG CATAGGAATG GTCTATAAAT ATGACTCCAG CCATAGGCAA
ATAAACACAA TTTTCCCTGA CGGAAGCAAA TCCAGAATAA AGTATGACCC TCAAGGGAAT
ATAATAAAAA CCATATTGCC GGGCGATTAT AACGAAGAAA CCGATGACGG TCCGGGAATG
CAGTTTACGT ATGATGAAAT GGACCGGCTT GACAAGATCA TTGACCCGAA TGGAAATGTA
ATTGCAAAAT ATTTGTATGA TGAGGACGGC AGAGTAATAA AGGAAATAAA TGCAAAAGGG
TACAGTAGCG CGGACAATGA TGAAGAACGC TGGGGCACAT TGTACAAATA TAATCTTGCC
GGCTGGCTTG TTGAAAAAAG GACTCCCCTT GAAAGCATAA ACGGGCAAAT CTTTTACAAT
GTAACTGAAA ATGTCTATGA CAGAAACGGC CGGCTTGTAC AGCAAAAAAT TTCTCCTGAA
TATGTGACCA AAACCGGCTA TCCAAAGACA TGGAATATCA TAAGCTATAA GTATGACAAA
AACGGAAGAA TTATAGAAGT ATCGGACAGC ATTGGTTCAT GTGTGCAGTA TGAGTACGAC
TGTCTTGGTA AGAAGACTTT GGAAAAGAAA AAAATAAATG ACAATACATA TAAAATAACC
AGATTTGAAT ACAGCAGTGC CGGAAGACTC AAAAAAGTAA TTGAAGAGTA TGATGGAAAA
GATTTGGCGG GCGAAGAAAA AGGAACCGTG AAAGCAGAGA CCTTGTTTGA GTATGACAGA
AACGGCAATA TTACAGCGGT TATTTCCCCG GAAGGGTACA GAAAAAGATT CATTTATGAT
GCGGCAAACA GAAGAATAGG TGTTGAAGAA TATTTGCCTC CGGAAGGAGT GAAAATATCA
GGAAATGCCG TCAATGCACT GTTAAAAACT ATAGTCAGAA AGACACTGTA CAGCTATGAT
AAAGCCGGAA ACCTTGTACA GCAGAGGCTT CCCAACGGAA GAACCGTTGA AACCGAATAT
GACGAAATGA ACAGAAGAAT CAGGATAAAA GATGCGGAGG GAAATATAAC AAGGCTGTTT
TATGATGCGT CAGGCAATTT AATAAAATAT GTGGAACCGG AAAATTACGA TCCAAAAATC
GATGACGGTC CGGGAACATC CTATTTTTAT GATTCAATGA ATCGGCTTTT ACAGGTGACA
AACGCGGCGG GTATTGTTGT GGAAAGAAAT ATATACAACA CGGCCGGAGA AATAATAAAG
AAAATTGACG CAAGAGGATA TCTGGCTGCA GCGAATGATA ATGACAGATA TGGTGTGGAA
TACGGATATG ATCCGGGAGG AAGACTTAGA TATATTACAA CGCCGGAGGC AAAAGCCAAA
GGAATTGTAA GCCAGCAATA CAACTATAAC TCTTTGGGGT ATATTACGGA AATTATCGAC
GGAAAAGGAA ACAAAACCGA ATACACTCTT GATTTATGGG GAAAAATCAG GGAAGTTCAC
GAGGCTACCG GTTCTGTTTT CAGATACGAA TATGACTATG CAGGAAATCT CACAGCAGTA
ATTGACGGAA ACGGAAATGT GACGCGGTAC AATTACAACA GTCTCAATAT TCTTTCAGAG
ATTATTGATC CACTGGGCGG CAGGATCTCT TATAAATATG ACAGGCAGGG AAGGATGGTT
TGGACTTCTG ACAGGGAAGG AAGGGTAACA CATTACAGAT ACAACTTTGA CGATAAACTT
GTGAGCATTT GGAGTGAAAA CGGAATATTT GAAAAATATG AATATAATCT TGACGGAAGT
CTTGCAGCTT CGATAAGTGA CAGGACGATT CATTCATATA CCTATACTCC GTCGGGAAGA
CTCAAGAAAA AGAACACAAA TGGTGTTACG GTTTTGGATT GCGAGTATGA CAAAAGCGGA
CGGGTGACAA AACTTACCGA TGTAAGCGGA AAAACGATAG AGTATACTTA TGATATTCTT
GGCAGACTCA CGAATGTAAT AAATGAAGGA AGAAAAACAG CTCAATACGA ATATAATCCG
GACAACACCA TATCCAGGAT TCTATATGGC AGCGGCGTTT TTGCAAAATA TGACTATGAC
ATGGATAAAA AGATAATTGG AATTTTGAAT GTTGATCCTT TTGGCCGGGA ACTCTTTAAC
GGAAAGTATT TCTATGATAC CAACGGTAGC CAAATAAAGA AGGAAGAAAA CGGCAGGGTA
ACGTTATACG GCTATGACAG CGTAAACAGA CTTGAGAAAG TTTCTTATCC TGAAGGAATC
GATGAGAAAT TTGCTTATGA TAATGCAGGA AACAGGATTG CCCGGGAATT TGGCAAGTTA
CTCGAGAATT ATAAATATGA CAAAAGCAAC AGATTGATAC AGAAAGTGTC CAATGGCATA
GTGACAGATT ATGAATATGA TGCCGGGGGC AACCTTGTAA AGGAAATTGA AGGAGAAAGT
GTAAGAAGAT TTGAGTATGA TGACTTTAAA AGGCTTGTAA AAGTAATAAA TCCTGACGGT
ACATATATGG AAAACATATA TGATGCCGAG GGAATGAGAG TGCAGACGGT AGAAAACGGT
GAATACCGAA GATTCATATT TGACGGAAAT AATGCAATAG CCGAGGTTGG AGAAGATTGG
AGTTTAAAGG GCAGGAACGT TAGAGGACAT GCATTGTTGG AATTGGAAGA CGAAAACAAT
AACACATATC ACTATTTGCA CAATGCACAT GGCGACGTGG CCAATCTTGC GGACAGTTCG
GGAAAGATTG TAAACACTTA TGATTATGAT GCTTTCGGTA ACACTTTAAG CGTAAAGGAA
ACAATACATA ACAGATTTCG CTATGCGGGA GAACAGTATG ATGATTTTAC AGGTCAGTAT
TATTTGAGGG CAAGGTTCTA TAGTCCGTCT TTAGGGCGGT TCACCCAGGA AGATACATGG
AGAGGTTTCA CATATAATCC TGCAAGTTTG AACCTATACA CATATGTTGA AAACAATCCT
GTAATGTTTG TCGACCCTAC AGGACATTGG CCCAAATTTA TTGACAATGC GTTGGATTGG
GTCGGTAATA AAGTCAATGA AGCTGCCGAC TGGGTGGGAA ACAGAGTAAA TGATGTTGTT
GACTGGGCAG GCGACAGAAT CAATGATGCA CGCAATTTCA TAACAAATAC GGCAACAGGC
GTTAAAAACT GGTGGGTTGA AAACAATGTC GGAGCATATG TTGTCGGAGG TTTGAAAATA
TTAGGCGGGA TTGCCATAGG TGCGGGAGCA ATTGCAGTTA CTGTTTTAAC CTGTGGAGCA
AGTGCACCGC TGACCGGTGC ATTGATTGGA GCGGGTATAA GTTTGGCGGC AACTTATGCA
ACGGATGTTG TCGGCAACTT TATGAAAAAT GACTGGAAAT GGTCCGCATC CAATTTCCTG
CCGTCATCCT CACCGGGTGA ATATATAGCA AATATGGTCT CGGGTGCTGT TGGTGGAATG
CTGCCGGGAG GTTTTGGCTT CTGGAAGAGT TTGGGGCTAT TGGGTGTGGA TGCTGTAATG
GCTGCAGGTA TATCCACGTT GATAGATTCA GGAATTGAAG ATATACGTAC AGGAAAGATA
AATGAAATTG ATATAGAAAA AGGCTTCGAG GACGCAATGC TGAATTTTGC CGGAAATATT
GTCGGAGCTG TAGCGGCAGG ACTTGTTCCG GATGTTTATG TACCTGCGTC AAGACAAAAA
GCAAGGTATA TATTGAGAAA AGCAAATCAA GCATACAATG GTCCTGCTGC CAGAACTTTT
ATTAAGAAAA TGCAATTCCG GAAACAAATA TTTGACTTAT TGGGAAATAT TATTGATGAG
CTGGTAGGTG GAATAACATC GGAAATAATT GAAGGACACG GTAAGTGTAC TGCAAATTAG
 
Protein sequence
MRIKVLPEKM TEIASSLKRL SDEFDGIIRD INSIVKSIDW ELRSKEGVDQ KASDAIRVAK 
KISGSLESMA KDLEFARDRM IEEDKKASNI AGKMKSAVIG ASVATASAGN AKVADVNYTS
DYKNLGPGRA NCPNTFVGDP VNVTTGNFYV TKKDIQIPTR GIPLEIRRYY NSIDQTCGIL
GKGWRIGYET GLMRVEDSQD ILTVYPDGSI GVFEFDEKDN KYIAPRGIFD ILQKNEDESF
TLKLHDGTTY RYDKSGNLTS ISDLNGNAIS IKYNGQGMIS SVISPGGKFL DFSYEDLKLR
KITDHTGREI IYSYDKSGNL IQVKYPDGGI IKYGYDNKGM ISITDQNGNT YVQNTYDEAG
RVVKQLDHEG NELVIEYCPG ECKNIFKWQK SGITRIYKYN EEKLLTEIIY DDGTKEVYTY
DEDKNRNSIT DRNGRTRRFK YDERGNLIAE IMPEPFNYTV RYSYDGNNRR TKISTPAGGL
ARFEYDEKGN LLKRIVKINS NTNSETVYTY DEFGRVLTIT DAENNTTSFE YNDDDINKPS
AIIDPEGNRF TYDFDAIGRV VAITTGYGTV KIEYNERDKI TGLIDAENNK IRIKYDAVGN
MVEVIAPEQY AQKGDKAQSY TFAYDAMDRM IKQIDPLGNV FMVKYDEHGN KIKEVNPNYY
NEEEDDGIGM VYKYDSSHRQ INTIFPDGSK SRIKYDPQGN IIKTILPGDY NEETDDGPGM
QFTYDEMDRL DKIIDPNGNV IAKYLYDEDG RVIKEINAKG YSSADNDEER WGTLYKYNLA
GWLVEKRTPL ESINGQIFYN VTENVYDRNG RLVQQKISPE YVTKTGYPKT WNIISYKYDK
NGRIIEVSDS IGSCVQYEYD CLGKKTLEKK KINDNTYKIT RFEYSSAGRL KKVIEEYDGK
DLAGEEKGTV KAETLFEYDR NGNITAVISP EGYRKRFIYD AANRRIGVEE YLPPEGVKIS
GNAVNALLKT IVRKTLYSYD KAGNLVQQRL PNGRTVETEY DEMNRRIRIK DAEGNITRLF
YDASGNLIKY VEPENYDPKI DDGPGTSYFY DSMNRLLQVT NAAGIVVERN IYNTAGEIIK
KIDARGYLAA ANDNDRYGVE YGYDPGGRLR YITTPEAKAK GIVSQQYNYN SLGYITEIID
GKGNKTEYTL DLWGKIREVH EATGSVFRYE YDYAGNLTAV IDGNGNVTRY NYNSLNILSE
IIDPLGGRIS YKYDRQGRMV WTSDREGRVT HYRYNFDDKL VSIWSENGIF EKYEYNLDGS
LAASISDRTI HSYTYTPSGR LKKKNTNGVT VLDCEYDKSG RVTKLTDVSG KTIEYTYDIL
GRLTNVINEG RKTAQYEYNP DNTISRILYG SGVFAKYDYD MDKKIIGILN VDPFGRELFN
GKYFYDTNGS QIKKEENGRV TLYGYDSVNR LEKVSYPEGI DEKFAYDNAG NRIAREFGKL
LENYKYDKSN RLIQKVSNGI VTDYEYDAGG NLVKEIEGES VRRFEYDDFK RLVKVINPDG
TYMENIYDAE GMRVQTVENG EYRRFIFDGN NAIAEVGEDW SLKGRNVRGH ALLELEDENN
NTYHYLHNAH GDVANLADSS GKIVNTYDYD AFGNTLSVKE TIHNRFRYAG EQYDDFTGQY
YLRARFYSPS LGRFTQEDTW RGFTYNPASL NLYTYVENNP VMFVDPTGHW PKFIDNALDW
VGNKVNEAAD WVGNRVNDVV DWAGDRINDA RNFITNTATG VKNWWVENNV GAYVVGGLKI
LGGIAIGAGA IAVTVLTCGA SAPLTGALIG AGISLAATYA TDVVGNFMKN DWKWSASNFL
PSSSPGEYIA NMVSGAVGGM LPGGFGFWKS LGLLGVDAVM AAGISTLIDS GIEDIRTGKI
NEIDIEKGFE DAMLNFAGNI VGAVAAGLVP DVYVPASRQK ARYILRKANQ AYNGPAARTF
IKKMQFRKQI FDLLGNIIDE LVGGITSEII EGHGKCTAN