Gene Cthe_3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3171 
Symbol 
ID4809621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3746814 
End bp3749261 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content40% 
IMG OID640108604 
ProductS-layer-like domain-containing protein 
Protein accessionYP_001039559 
Protein GI125975649 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.412777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAA TTAAGATTGT GAATATTCTG ATACTTTTGG TTCTTTTAAT CAGTTTTAAT 
TTGGAGACAG TCAATGTGTT TGCCGGTCCA AATGATATTA TAATTACAAA GGTAGTGCAA
AAAGAGGAGA ATGTTTCAGC CGGTAAAAGT TTTAAACTCG AGGTTTACTA TAAAAATGTG
TTGGGTGTAC CTTTGAAAGA TGTTTATATT TCCGTAGATA AAAGTTCTTC TTTTTATATT
GATAATGATC ATTATCAAAC CGAGTACCTG AAAGACATGG CTGTTGGGGA CGGAGAAGAA
CCTATAATAT TATATCTGGT CTATAAAGGT ACAGGAAACG AATTAACTTT GATATTTGAT
TATTTAAAAG AAGGTGCAAC CGATCGAGAA CAACTTTCAC AAACCCTGTT TCTTAGCGTT
AAAAAAGAAA AAGAACAAAC TTCCGGTGGT TCACAAACCA ACACGGCCGA ATACAAGCCG
AATTTCAGAA TTGTGGGAAA GATAAGCAGC AAACAGGAAG GAAAAAATGT TTCTGTTGAA
TTTCCCATTA AAAATGTGTC CAATTTTACC GCCAAAGACA TTCAAATAAC CATGTCAGCC
GATTCGGCGG ACTCACCTTT TTCGGCGCCG ATGGGGCATC TTTCCGTATC CGTTGACGAG
ATTAAACCCG ATGCCGAAAA AAAGATAAAG CTTGACCTTG CTGTAAAACC GAATACCAAA
AGCGGTATAT ATCCCTTGAA ACTTGAGTTT AAATACGGCA ATTTGTATGG TGATTCATTC
TCTTCATCGG AAGTTATATA TGTTGACATT GAAAACAATG ATAAAAGCCC AAGCCTCATT
TTAAAAGGTG TGGAAATGCT TCCCCAAAAA CCTGCACCGG GGGACAGGTT CAGTGCTTCC
ATAGAACTTG AAAACCTCGG AACTCTTGGA GCAAAAGACG TAAAGGTCAC CTTGAAGGGA
TTAACGGTTG ACGGGATTTA TTCTGAGCTT GTGGGAGTCA ATTATTTGAA AACCATTGAG
GGAGGCAGGA CGGGCAAGCT GAATTTCAGC CTTGTTGCGT CCAATAAAAT AAATGTTCAA
AGTTTTCCGC TTGAAATAGC CGTTGACTAT AAAGACGAAT TTGGCAATTC ATATGCCGAA
AGCTTTATAT ATTATGTGCC CATAAAGCAA AAAAGCGAAG GAAAAGCTTC ATTGAAGATT
GACAACATTA CTTCTCCGGC GACAGTTGTT GCACCGGATG AGGATTTTAA AGTTGGCTTT
GATATTGTGA ATGACGGAAC AAAAGAACTT TCCGACTTAA AGGTGTCGGT AACGGCTGAA
AACGGCATTA TCTGCAAATC ACAAAGCATA ACTGTTGTTG ATTCCTTGAA AGTTGGGGAA
AAGAAAAGCT TTGAGTTTCT TTTCACTGCC TTGACCGATG CAGTCACTAA AAACTACCCT
ATTGCCATAA ATGTGGAATA CGATGATGAA GGTTCTTCGG GAGGAACAAA AGAAAAGCGG
ACTGTTACAC AATATGTTGG AGTTTATGTT GAAAATCCAA AGGAAGAGGA GAAAAAAGAA
AATACTTCCA CTCCGAGGCT TATAATTGAC CGGTACAGTA TATCCACGGG GCAGGCGATA
GCGGGAAAGA GCTTTGAAAT TGAGCTTGGC ATATTGAACA CCCATAAAAA TATGAATGTT
GAAAATATTG CTGTTTCTTT CCTTGCGGAT GAAGGAGTGT TTTTGCCGGC GGCGGAAAGC
GGCAGCACCA TATTCATTGA CCAAATAAAA GCCGGAGAAA GAGTTGTTAA AAAGATGACT
TTTGCGACCA AATATGATGC TGTGCCAAAG AGTTATTTGC TCAATATAAA CTTTGAATAC
GAAGACGAAC AGAACAAGGC ATATACCTTG AAAGAAAGCA TCAGCATACC TGTTATTCAG
GAACAGAGGC TTGAGATAAG TGAAATACAG ACGGGAATGG ATGCAGTTGT GGGACAGCCG
GTTTCTGTAA ATTTGAATTT TTATAACATG GGCAAGTCAA CATTGAACAA TCTTATGGTA
AGGTGCAAGG GAGATTTTGA GCTGCAGCCC AGTTCAGAAT ATTTTGCGGG TAATTTTGAA
CCCGGCAGAA GCGACTATTA TGAAGCATAT ATTGTACCCA ACAAGGAAGG ACAGGTAAAG
GGAAGCATTA TTTTCACATT TGAAGATAAC AACGGCGAGG TTAAAGAGAT TGAGAAAGAG
TTCGAGATTT TTGTCCAGGG TCAGCCTTCA GTAATGAAAG GTGATGTTAC GATAGTGGAG
CCCGGCATGG CGGAAGCGGG AATGAAGTTT GGAAAAGCAG GTTTTCCCGT GCGCAGACTG
TTAATCCTTG CAGGTGTTTC AGTGCCGGTG ATAGCAGGTG TTGTGGTTCT CATAATAATT
CTGGCAAAAA GAAAGAAAGC GAGAGCTGAT TTGTATGAGA ATATCTGA
 
Protein sequence
MSKIKIVNIL ILLVLLISFN LETVNVFAGP NDIIITKVVQ KEENVSAGKS FKLEVYYKNV 
LGVPLKDVYI SVDKSSSFYI DNDHYQTEYL KDMAVGDGEE PIILYLVYKG TGNELTLIFD
YLKEGATDRE QLSQTLFLSV KKEKEQTSGG SQTNTAEYKP NFRIVGKISS KQEGKNVSVE
FPIKNVSNFT AKDIQITMSA DSADSPFSAP MGHLSVSVDE IKPDAEKKIK LDLAVKPNTK
SGIYPLKLEF KYGNLYGDSF SSSEVIYVDI ENNDKSPSLI LKGVEMLPQK PAPGDRFSAS
IELENLGTLG AKDVKVTLKG LTVDGIYSEL VGVNYLKTIE GGRTGKLNFS LVASNKINVQ
SFPLEIAVDY KDEFGNSYAE SFIYYVPIKQ KSEGKASLKI DNITSPATVV APDEDFKVGF
DIVNDGTKEL SDLKVSVTAE NGIICKSQSI TVVDSLKVGE KKSFEFLFTA LTDAVTKNYP
IAINVEYDDE GSSGGTKEKR TVTQYVGVYV ENPKEEEKKE NTSTPRLIID RYSISTGQAI
AGKSFEIELG ILNTHKNMNV ENIAVSFLAD EGVFLPAAES GSTIFIDQIK AGERVVKKMT
FATKYDAVPK SYLLNINFEY EDEQNKAYTL KESISIPVIQ EQRLEISEIQ TGMDAVVGQP
VSVNLNFYNM GKSTLNNLMV RCKGDFELQP SSEYFAGNFE PGRSDYYEAY IVPNKEGQVK
GSIIFTFEDN NGEVKEIEKE FEIFVQGQPS VMKGDVTIVE PGMAEAGMKF GKAGFPVRRL
LILAGVSVPV IAGVVVLIII LAKRKKARAD LYENI