Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1800 |
Symbol | |
ID | 4809784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2126418 |
End bp | 2127929 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640107214 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_001038214 |
Protein GI | 125974304 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.199953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGTATA CCGTACAGCC GGGAGATTCT CTTTATACCA TTTCCCAAAG ATTTGGAGTA ACAATTGCAC AGATAAAAAG TGCCAACCAA CTTACAAGCG ATATTATCTA TGTAGGTCAG CGTCTATACA TTCCTATAGG AATTCAAGCA CCTGTAGTAT ACACCGTAAG ACCCGGTGAT ACACTGTATC TGATAGCCCG AAGATACAAC ACCACCGTGG ACAGTCTTAT GGCTCTTAAC AATCTTAGCA GTACTGAGCT GAGAATCGGC CAGCAGCTAA CCATCCCCCT TTATACCGAA GCAGTGGTCA ATGTGGGTAC CGCAAACATC CGCAGAGGCC CGGGAACCAA CTTTGGCATA ATTACTCGCA TGACAAACGG TGCAAGGCTT CCGGTTATCG GTTTTAGCAA CAACTGGTAC CAGGTACGTC TTTACAACGG AAGAGAAGGC TGGATTTCGG GAAGCATCGT CACCCGCAAT GTTTACAGCG GACGCAGGCC TATAACAGGA GTACTGGGAT TCTACACCCT TGAGGAAGGT CCCACCCTCC CAAGCTCTTT TACATCCTTT GCAAACAATA CAGGGCAGCT TTCCTCAACC GCATTGTTCA TGTTCAGAAT CAGCGCCGCC AACCCAACCA CCATCGAAAA ATTTGGAGAG TTTACCGACC AGGATGTTCG CAACCTGGTG GCAATTGCCC ACAGGAACAA TGTAAAAATT ATGCCTGTGG TTCACAACCT GCTGTACAGA CCCGGGGGAA CCACTCTTGC CAAAAACGTT GTAAAAACTT TGGTCTCAGA CCCAAGAAAC AGAAATGCCT TTGCGCTGAA CCTTGTAAAT CTCATAGAAA GATACGGCTT TGACGGTGTA AATATTGACA TCGAGGATGT GTTTATAGAA GACAGCGACA ATCTTTCCCT CCTGTACACC GAGATATCCG AAGTCCTGAG GCCAAGAGGA TATTTCTTCT CTGCATCGGT TCCTTCAAGG GTAAGCGATG AACCCTTCAA TCCTTTCTCC GATCCGTTTA ACTACAGTGT GATCGGAAGA GCGGTGGACG AATTTGTCGT AATGCTATAC AACGAATTCG GATGGCCGGG AAGCCCGCCG GGACCTGCGG TCACCATAGG TTGGATGGAA CGCGTGCTAA GATACACCAT GAGCAAAATG CCGAGGGATA AGATTATGGC AGCCGTGTCC GTGTTTGGAT TCGACTTTAA TCTCACCACA GGCCGAAACA CCTATGTGAC TTACCAGTCG GCGATCAACC TTGCCCGAAG GTACAACAGT GAAATTATTT TCAACGAGGA AAGACAGACG CCCATGTTTA CCTACAGAGA CGCACAGGGA AATCAGCACG AAGTATGGTT CGAAGATGCC CGAAGCCTCA GATCCAAAAT TCAGCTGGCC TGGGAACTCG GCATAAAGGG CGTTGCTTTG TGGAGACTTG GGATGGAAGA CCCAAACATC TGGCCAATGC TTCGAAATGA AGTGGTGGTA AGGAAGTTTT AA
|
Protein sequence | MWYTVQPGDS LYTISQRFGV TIAQIKSANQ LTSDIIYVGQ RLYIPIGIQA PVVYTVRPGD TLYLIARRYN TTVDSLMALN NLSSTELRIG QQLTIPLYTE AVVNVGTANI RRGPGTNFGI ITRMTNGARL PVIGFSNNWY QVRLYNGREG WISGSIVTRN VYSGRRPITG VLGFYTLEEG PTLPSSFTSF ANNTGQLSST ALFMFRISAA NPTTIEKFGE FTDQDVRNLV AIAHRNNVKI MPVVHNLLYR PGGTTLAKNV VKTLVSDPRN RNAFALNLVN LIERYGFDGV NIDIEDVFIE DSDNLSLLYT EISEVLRPRG YFFSASVPSR VSDEPFNPFS DPFNYSVIGR AVDEFVVMLY NEFGWPGSPP GPAVTIGWME RVLRYTMSKM PRDKIMAAVS VFGFDFNLTT GRNTYVTYQS AINLARRYNS EIIFNEERQT PMFTYRDAQG NQHEVWFEDA RSLRSKIQLA WELGIKGVAL WRLGMEDPNI WPMLRNEVVV RKF
|
| |