Gene Cthe_2402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2402 
Symbol 
ID4811054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2868273 
End bp2869841 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content42% 
IMG OID640107815 
Productpeptidoglycan-binding LysM 
Protein accessionYP_001038797 
Protein GI125974887 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTTG AACTTGTTAG AGAGTCCACG AAAGTCAATT ACGTTGTTGG TGAGGAATTG 
GCGCAGACAA TTGTTGAACA TGATATAATT GTACCTGACG TTAATCCGGA CGTGGCAAGG
ATTCTCCTTA TTGACGGTGA GATAAGAGAA GGGGATTCTG AGGCTTCACA GGATAGAATA
CATGTGGACG GTACTATTTA TTACAAAATT CTCTATGTTT CCGACGATCC GGAACAGTCG
GTAAAAAGTA TAAACACTTC CAGTGATTTT TCATATACCG TTGATGTGGC AAACAGCCGC
GCGGGAATGA AAGCAAAAGT AAAATGCGAC ATAGAACATA TTGATTATGA GATACTGAAC
GGAAGAAAAA TCAACGCAAA GACCATTCTC AGGATTTACG CAAAGGCTGT AAATGACGTT
GAACAGGAAT TTGTAAGTGA TTTGAGAGGA ATTGAAGACA TACAGGTTTT AAAGGACAAC
GTGGATATAT ACTGTTATCT TGGAGAGAAC ACTGTTAATT GCACTTCCGA GGAAATGCTG
GAGGTTCCTG CGGGAAAACC GGCCATAAAA GAAATACTGA GAAACGATGT AAAGATAGTG
GGCAAAGACT ACAGAATTTC TGATGACAAG ATTATTGCCA AAGGTGACAT AAACATACTT
ACGTTGTACA TAGGGGACAA TGAGGAGAGA AGCATTCAGT TTATGGAGCA TGAAATCTCC
TTTACGCAGT TTATTGACCT TCCCGGCATA AGTGAAAGTT CCGAGTGCGA GGTGGATTAC
AGGATAAAGA ACGCAACCTT TATCCCTCAG GAGGACAGCG ACGGTGAATT AAGAATTTTA
AAAGCCGAGG TGACAGTGGG TCTTACGGCT GAGGCAACTG ACAAGAAAAA TGTGGAGATT
GTGTCCGATG CATATGGCTT AAGATCAAAT ATTGAGAATG AGAGGCAGGC ATTTAAAATC
AACAGGGTTG TGGCAAGAAA CAGAAGTCAG GTAACCTTAA AAGAGGTAGT GGAGTTTGAC
GGTAACAGTC CCGATATATC AGAAGTGTTC AATGTTTTGT GCAAACCTGG CCTGCTTGAA
TGCAGTGCAG GGGATGGATA TGTCAATGTG GAGGGAATTG TTAAAAACAA CATCCTCTAT
GTGGCAAACA ATACAGAACA GCCGGTGTTT GCCTATAGCT ATGAAATGCC TTTCAGCCAG
AGAATTGAGC TTGAAGGCGC AAAACCCGGA ATGAGGTGTG ACGTGGATTT GGAAGTTGAC
CACTATAGCT ACAGCGTGAT TTCGGCCAAT GAAGTCGAGC TAAGGGTTGT GGTGGATATT
AACGCCCGAT TGATGGAACA GACGGATGTA TCCCTTATAA CAAATGTTAC TGAAACTCCT
GCCGAAGACA GAAGCAATGA GCAGTATCCC AGCATAACCA TATATTTCTC CCAGCCCGGT
GACACTCTTT GGAAAATAGC AAAAAAATAC CGCACAACGG TGGAGGATAT TTTAAGAGTT
AACGAGTTTG GTGCGGATGA TGTTATCGGG GTGGGACAGC AAATAATAGT TCCAAGAAAA
ATAAGTTAA
 
Protein sequence
MSLELVREST KVNYVVGEEL AQTIVEHDII VPDVNPDVAR ILLIDGEIRE GDSEASQDRI 
HVDGTIYYKI LYVSDDPEQS VKSINTSSDF SYTVDVANSR AGMKAKVKCD IEHIDYEILN
GRKINAKTIL RIYAKAVNDV EQEFVSDLRG IEDIQVLKDN VDIYCYLGEN TVNCTSEEML
EVPAGKPAIK EILRNDVKIV GKDYRISDDK IIAKGDINIL TLYIGDNEER SIQFMEHEIS
FTQFIDLPGI SESSECEVDY RIKNATFIPQ EDSDGELRIL KAEVTVGLTA EATDKKNVEI
VSDAYGLRSN IENERQAFKI NRVVARNRSQ VTLKEVVEFD GNSPDISEVF NVLCKPGLLE
CSAGDGYVNV EGIVKNNILY VANNTEQPVF AYSYEMPFSQ RIELEGAKPG MRCDVDLEVD
HYSYSVISAN EVELRVVVDI NARLMEQTDV SLITNVTETP AEDRSNEQYP SITIYFSQPG
DTLWKIAKKY RTTVEDILRV NEFGADDVIG VGQQIIVPRK IS