Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1016 |
Symbol | |
ID | 4811310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1216468 |
End bp | 1217526 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106434 |
Product | cell wall hydrolase/autolysin |
Protein accession | YP_001037441 |
Protein GI | 125973531 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0860] N-acetylmuramoyl-L-alanine amidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000634005 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAGC CAGTAATTTT ATTAACTGTA TTAATCCTTG TGATACTCAT GACAGTTCCC GGCTTTGCCG CAAACGGCCA GTATTATTCG GGATATTGGG GAGTAACTTC CAATGCTTTG ATAGCTGTGA ACGGAAAGGT TTTAACTCTC GAACGAAACC CGGTAATTAT CGAGGGAAGA ATTTTAGTAC CGGCGAGAAC TACTTTTCAA GTTTTGGGAG TAACCACCGA ATGGTATTCC TCTTCCGGAG TTGTAAAGAT GGTGAAAGGA AATAAGGCCG TTAAAATGAC TGTTGGAAGT AAAGTGGCGC ACTCGGGAGG TTCTGTAAAG TACATGGAAG CTCCGCCTAT TCTCGTGAAC GGCACAGTTA TGGTTCCAAT GAGATTTGTT GCTGAAACCT TCGGCGAGAA TGTAGGCTGG GATGCAAAAA ACGAAATGGC TTATATCGGA AACAAACCGG CGGAAATACC TTCAAGAAGC GGTCTTAAAT CCAATAGAAC ATACAAGGTC GTTATTGATG CCGGTCACGG AGGGAGCCAA TCGGGCGCTG TGTACGGAGG AGTAAAAGAA AAAGATTTGA ACTTGGATAT TGCCAAGCGG CTGAATACCT TGCTTAAAGC TGAAGGCATA AAAACATATA TGACCCGGGA AAAAGACATA ACAGTGGGGC TTTATACTAG ATCTGACCTT GCGAACAAAG AAAAAGCCGA TTTGTTTGTC AGCATTCATA ACAATGCCGG GAACAGCAAA ACTTCAGGGT CCATGACTTT GTATCATCCT GACAGCGGCA AAAAAAAGGG AAATCTTACC GCATATGAAT TTGCTCAAAT AGTGCAAAAA AATTTGAACA AGACCCTGGG GTCAAAGAAT ATGGGAGTTA TTCAAAGACC AAATCTTGCA GTATTAAGGA CGACAAACAT GCCCGCGGTT ATAGCGGAAA TAGGTTATAT GTCCAACAGT GCGGAGCTTG CAAAATTAAA AACCGATTCA TACAGGCAGA AAGCTGCCGA AGCGTTGAGA GATGCCGTTA TTGAAAGTCT TGAAAAAATG TATAAGTAA
|
Protein sequence | MKKPVILLTV LILVILMTVP GFAANGQYYS GYWGVTSNAL IAVNGKVLTL ERNPVIIEGR ILVPARTTFQ VLGVTTEWYS SSGVVKMVKG NKAVKMTVGS KVAHSGGSVK YMEAPPILVN GTVMVPMRFV AETFGENVGW DAKNEMAYIG NKPAEIPSRS GLKSNRTYKV VIDAGHGGSQ SGAVYGGVKE KDLNLDIAKR LNTLLKAEGI KTYMTREKDI TVGLYTRSDL ANKEKADLFV SIHNNAGNSK TSGSMTLYHP DSGKKKGNLT AYEFAQIVQK NLNKTLGSKN MGVIQRPNLA VLRTTNMPAV IAEIGYMSNS AELAKLKTDS YRQKAAEALR DAVIESLEKM YK
|
| |