Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1611 |
Symbol | |
ID | 4809601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1940837 |
End bp | 1941841 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107027 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_001038028 |
Protein GI | 125974118 |
COG category | [S] Function unknown |
COG ID | [COG1652] Uncharacterized protein containing LysM domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000139366 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTTT TTACAAAATA CATGACGCGA AACGATTGCT ATACAGCAGG TCGGAAAATC ACGCCTAAAG GAATCATGGT ACATTCGACG GCTGTGCCGG GTGTAATGGC GGCTGAGTGG TTTTCCCGTT GGAACAAATC TTACAAGGCC GGCGAAATAA ATAGGCAGGT ATGTGTTCAC GCTTTTGTAG ACGATAAAGA GGTTTGGCAA TACCTGCCTT GGGATCATCG CGGGTGGCAT GCGGGAGGAG CAGCAAACAA TACCCATATT GGCTTTGAAA TCTGTGAGCC TGCTGGGTTT TCGTATAAAT CCGGGTCGGT GATGGTGGGT TATGATGCAG CAAAGCAGGA AGATTATTTC TTTAAAGCGT GGCAGAATGC GGTTGAACTC TGCGTTATGC TCTGCAGGGA GTACGGACTT GACGAGAATG ACATCATCTG CCACTCCGAA GGATACAAGC TCGGTATTGC CAGCAACCAT GCTGATGTGA TGCACTGGTT CCCCAAGCAT GGGGAGAATA TGGACACTTT CCGAAAAGCA GTGAAAAAAG CGCTGGAGAA CAGTACAGAT AACAATACTG ATATTGGAAT TGGAGATATG GTGGAGTTTA AGGTCAGTGT AAAGAATTAC TACCCCGGCA GTGTGGAAGT TCCAACGTGG GTCAAAAATG ACTATTACCA CAGGGTCACA CAGACTTTAT ACAAAGGCAA GCCGGTCATA AAAGGCGGCA AAGAATGTGT ATTGCTTGGC AAAAAGGTTA AGAAATCCGG CGGTCAAGAA ATCGCAGGCA TAAACACTTG GGTAGCAAAA GAAAACCTTG TAATTGTAAA CAGCATTCCT GATAACAAGG GCAATAGAAC CTATACAGTG CAAAAAGGCG ATACCTTATG GAGAATAGCG GAAAAAGAAC TTGGTAGAGG AACAAGATAT CCGGAGATTA AGAAACTCAA TGGCCTGACT TCAGATACTA TTTACCCCGG ACAAGTTCTG AAATTGCCGG AATAA
|
Protein sequence | MKLFTKYMTR NDCYTAGRKI TPKGIMVHST AVPGVMAAEW FSRWNKSYKA GEINRQVCVH AFVDDKEVWQ YLPWDHRGWH AGGAANNTHI GFEICEPAGF SYKSGSVMVG YDAAKQEDYF FKAWQNAVEL CVMLCREYGL DENDIICHSE GYKLGIASNH ADVMHWFPKH GENMDTFRKA VKKALENSTD NNTDIGIGDM VEFKVSVKNY YPGSVEVPTW VKNDYYHRVT QTLYKGKPVI KGGKECVLLG KKVKKSGGQE IAGINTWVAK ENLVIVNSIP DNKGNRTYTV QKGDTLWRIA EKELGRGTRY PEIKKLNGLT SDTIYPGQVL KLPE
|
| |