Gene Cthe_0327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0327 
Symbol 
ID4808476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp413675 
End bp416248 
Gene Length2574 bp 
Protein Length857 aa 
Translation table11 
GC content40% 
IMG OID640105741 
ProductS-layer-like domain-containing protein 
Protein accessionYP_001036758 
Protein GI125972848 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA TTTGGGGCAG AAAAGTTTTG GGATTTGCGG TTGGTGTATG CTTACTTTTA 
ACTATGATTT GCCAAAATGT GGCTTTTGTA TCGGCAGAAC CTGAGGAAAA AACTGTTGTC
GTAAATTTCG AAGAAAGCTT GAATGAAACT ATGTCAAAAA CCATTGAAAT ACTAAATCTT
TTTGACATTT CCGAGATTGT TGTCGATAGC GGAAAAGTGT CTTACAGCAG AGAAGGAGAC
AAAGTAACGG TTACTGTTTC GGAAGGCGTT TATAGAGTAG GCCCGCATAC ACAAAATGTT
ACTTTAACTA TAGAGGATGA TAAAGGTATT TTTGATGAGA AAACATCCTA TAATGTTGAC
GGCTATGAAG GAATCCTTAC GAAAATAGAT TCTGGTTATG ATGAGGCAAA AGGACTGTAC
TGGGTAAAAT ATCAAGGTAG TGTAACTAAA GAAAACTGCA AATTTTACCA ATATACAGTA
ACAATCAAAT ATACGGAAAA TGTTCGTCCA GAAGTGTATC TTACTGAGCC GAAACGAGGC
ATGATTACTA ATGGTAAGAT TAATGTTCAG GGTTATGTCA GAGATGAAAA CATAGGTGAT
GAGCTGAAAT TGTTCTTTAG CTTTGACAGT TACGATGAAA GCATGACAGG AAATCTTCTG
AATGAGAATT CCATTATATC CGACGGAACC TGGCAGGAGA TAAGCGGAAC TATAGATTTA
TCACCTTTCA ATCTTAAAGA CGGCGATCAC GACTTTTATT TTTGGGCAGT GGATAAAAGG
GGAGTAAGAT CGGTTGGCGA GATCATAAGG TTTACGCTTG ATACAGTACC GCCTGAAGCG
CCTGTTCTTA CTCCGGATAA AACAGAGTCT ACGAATCAGA GTGTTGTAGT GTCAGTATAT
TATCCACCGG ATGCTGTAGG CAGGGAGATA AAGATAAATG ACGGACCTTG GATGCCGATA
ACCGATATAA CCAAAAATGA TCAGATAATA ATGGATGAGA ACGGCAAGAT AGAGGCAAGG
GCAATTGATG AAGCGGGAAA TATTTCAGAG GTGGCGGAAC TTGAGATAAA GAATATTGAC
AAAATTCCTC CGACAGCACC GACAATCAAT ACAAGTGCTG ATGAAACTAC AGAGCAACCG
ATTAAGGCGA CGATAGTGCC GGGAGTTGAT AATGAGTCAG GTGTGGATCG AACCGAATAT
TGTTTAAGAG GAGCAAGTAC AAAAGATTGG GAGAAATACG ATGAAGGAAC CGAAATAACA
ATAACTGCGT TGGGAGAAAC AGAAATTTGT GCAAGAACAA TTGACAATGC CGGAAACATC
TCCGCTGAAA CAGTTAAGAA AGTTACAATA AAGAAGAAAG AGGACAGTGG CGGTAACAAC
GGTGGAAGTG GCGGCACAGG CGGAAACAGC GGTAACAACG GTGGCAGCGG TGGCACAGGC
GGAAGCGGTA GTAGCGGTGG AAGCGGAAGT AACAGTGGTG GCGGAAATAA TGACGGAAAT
GGCAACGACG GGAAAAAAGA CGATGAAATA CTGCAGCCCG AACCCAATAT TCCAGGCGCA
GGAGGCAGTC CTGTGGATTT GTCCGTGTTT ATAAGTGCGG ATAAATCAAA ATATGAAGAA
GGTGAAGTAA TTACTTTCAA TATTACATAC AAAAACAAAA CCAATGTTCA GGCAAACAAC
GTTATTGTGA AAGCAGGAAT ACCGGCAAAC ACAACTGTTG AGGATATAGC CGGAGGTACT
CAAAATGGAA ATGACATTGA ATGGAAAATT GAATCGCTTA AAGCAAACTC TTCAGGCAAG
ATTCAATACA AAGTCAAGGT GAATTTGCTT GAGGTGCCGG AAATAAGTTC TTCTGCTACT
GCTTCAATAA CTGCAAGTGG AACTCTTATT AACAAGGATG ACGATGAATC AAGAACTATA
TTCCTTCTTT ATTCGAACCG TTTTGGTGAA AACTTTCACG GCAAATATAT TACAGGCTAT
GAGGACAATA CATTCAGACC GTTGAATAAT ATAACAAGAG CTGAAGTGGC AACAATTATG
ACCAACATTT TGGGATTGAA GCAGGAGGTT GCAGGAGGCA AAACATATAC AGATTTGTCA
AAGAGCCATT GGGCATATAA CAATATAATT GCGGTAACCG AAAAAGGTTT GTTCACAGGA
TATGAAGACG GTTCGTTCCG TCCGGACAAC TTTATCACAA GGGCGGAATT TGCTACGGTG
CTGGCTAATT ATTTGGGACT TAAGAATGTT GAGCATGATG AGTTGAACTT TGCGGATATC
GAAAATCACT GGGCTAAGAA CTTTATAGAG GAAATATACA GAGTAAGATT GATAGAAGGT
TATCTGGAAA ATGGCTTAAG ACTGTTTAAG CCTGACAACT ACATAACCAG AAGTGAAGCG
GTGACAATAA TAAACAAGAT GCTGTTCAGA GGTCCGCTTG AAGGAGCAAA GGTGCCGTTT
ACCGATGTTG AGGAAGGATA CTGGGCTTAC GGACATATAT TGGAAAGCTC TATAGATCAT
TACTACGTAA GAAATAAAGA TCAGAGCGAA ACAATAGTAA ACAAGAAACA GTAA
 
Protein sequence
MKKIWGRKVL GFAVGVCLLL TMICQNVAFV SAEPEEKTVV VNFEESLNET MSKTIEILNL 
FDISEIVVDS GKVSYSREGD KVTVTVSEGV YRVGPHTQNV TLTIEDDKGI FDEKTSYNVD
GYEGILTKID SGYDEAKGLY WVKYQGSVTK ENCKFYQYTV TIKYTENVRP EVYLTEPKRG
MITNGKINVQ GYVRDENIGD ELKLFFSFDS YDESMTGNLL NENSIISDGT WQEISGTIDL
SPFNLKDGDH DFYFWAVDKR GVRSVGEIIR FTLDTVPPEA PVLTPDKTES TNQSVVVSVY
YPPDAVGREI KINDGPWMPI TDITKNDQII MDENGKIEAR AIDEAGNISE VAELEIKNID
KIPPTAPTIN TSADETTEQP IKATIVPGVD NESGVDRTEY CLRGASTKDW EKYDEGTEIT
ITALGETEIC ARTIDNAGNI SAETVKKVTI KKKEDSGGNN GGSGGTGGNS GNNGGSGGTG
GSGSSGGSGS NSGGGNNDGN GNDGKKDDEI LQPEPNIPGA GGSPVDLSVF ISADKSKYEE
GEVITFNITY KNKTNVQANN VIVKAGIPAN TTVEDIAGGT QNGNDIEWKI ESLKANSSGK
IQYKVKVNLL EVPEISSSAT ASITASGTLI NKDDDESRTI FLLYSNRFGE NFHGKYITGY
EDNTFRPLNN ITRAEVATIM TNILGLKQEV AGGKTYTDLS KSHWAYNNII AVTEKGLFTG
YEDGSFRPDN FITRAEFATV LANYLGLKNV EHDELNFADI ENHWAKNFIE EIYRVRLIEG
YLENGLRLFK PDNYITRSEA VTIINKMLFR GPLEGAKVPF TDVEEGYWAY GHILESSIDH
YYVRNKDQSE TIVNKKQ