Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3005 |
Symbol | |
ID | 4811153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3528781 |
End bp | 3530052 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108426 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_001039394 |
Protein GI | 125975484 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1388] FOG: LysM repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000564074 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTTA AGAAAACTGT AATTTGTGGG GTTTTGAGCA TAGGTATCAT GGCAGGAAGT TCAGGATTTG CTTTTGCGCA GAATGTTAAC TACAAAGTAC AGTCCGGTGA TACGTTCTGG AAGATAGGGC AAAAGTATAA TATTTCGACA GCCGCCCTTT TAAAGGCAAA CAATGCTAAT GAGAATACGG TGTTGTATCC GGGACAGACA ATAGTGCTTC CGATAAAAGA CGAATCGGTG TATATAGTTC AATCCGGAGA TACTTATTGG AATATCAGCC AGAAATACGG GATAAACTTT AAAGAGTTAT TGGCACTGAA CAATGCCAAT GAAAATTCGA TGCTGAATGT AGGGGATAAA GTAATTCTTC CCGCTACGGC AAATTATACC GTGCAAAAAG GAGACACTTA CTGGACAATA AGCCAGAAAT TCAAAGTAAA CTTTACGGAA CTTTTGAAAC TGAACGGTGC CAATGAAAAA AGTTATCTTG ACATAGGCCA GGTAATAAAA ATACCTGTAA CCAGCATGTC ACAGGTTCCG GCAACTTCTT CAAATCAAAA CAATAATTCA ACCAATACTT CAAATAACAA TTCAGGCAAC AACTTAAGCG GTCCGTACAT TACTTATACA AGCTATACAG TTCAAAAAGG CGATACCGCG TGGAGTATTG CAGAAAAGTT CGGAATTTCG ATGTATGAGC TTATGGAAGC AAATAATATC AATTCCTCGA CGGTGCTTAA TATTGGACAG AAATTAAAGA TACCGGTACA TAATGTACCT GTAAAAAGCA CTCCTGGTGA AAAATACGGG GAACTGCTGG ACTGGTGGAC GGAAGCACAG TATTTGATTC CAAGGGGAAG CACTTTTGAG GTGGTGGATT TCTACACCGG CAAATCCTTT TTTGTAAAAC GTACGGGCGG TTCAAACCAT GCCGACTGTG AAACTCTTAC TGTAAAAGAC ATTAATATAA TGAAGGAAAT ATGGGGTGGC TTCAGCTGGG TAAGAAGACC TGTGATAATC AAATATAACG GAAGAAAAAT TGCCGCCAGC ATGACGGCAA TGCCGCATGC GGGAAATGAC AGCGCCCCGG GTGGTGTCTG GACATCGTGG AGGAGCGGAG ATTACGGCGC GGGCACAAAC TATGACTATA TTAAAGGCAA TGGTATAGAC GGCCACTTTG ATATACATTT TTACAACAGT ACAAGGCATA AGGACGGAAA ACTGGATCCA AACCATCAGC AGTGCATAAA AATATCGGCA GGGGTGCAAT AA
|
Protein sequence | MKLKKTVICG VLSIGIMAGS SGFAFAQNVN YKVQSGDTFW KIGQKYNIST AALLKANNAN ENTVLYPGQT IVLPIKDESV YIVQSGDTYW NISQKYGINF KELLALNNAN ENSMLNVGDK VILPATANYT VQKGDTYWTI SQKFKVNFTE LLKLNGANEK SYLDIGQVIK IPVTSMSQVP ATSSNQNNNS TNTSNNNSGN NLSGPYITYT SYTVQKGDTA WSIAEKFGIS MYELMEANNI NSSTVLNIGQ KLKIPVHNVP VKSTPGEKYG ELLDWWTEAQ YLIPRGSTFE VVDFYTGKSF FVKRTGGSNH ADCETLTVKD INIMKEIWGG FSWVRRPVII KYNGRKIAAS MTAMPHAGND SAPGGVWTSW RSGDYGAGTN YDYIKGNGID GHFDIHFYNS TRHKDGKLDP NHQQCIKISA GVQ
|
| |