Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1231 |
Symbol | |
ID | 4809923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1473277 |
End bp | 1474575 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106654 |
Product | Serine-type D-Ala-D-Ala carboxypeptidase |
Protein accession | YP_001037656 |
Protein GI | 125973746 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1686] D-alanyl-D-alanine carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGGC TATTGTTACT TTTAATCATG ACAATCATTC TGTTTTCAAA TATTGTGACT TTAAAAGCAC AACCATTGGA TATTAATGCA CAAGCATACA TTTTGATAGA TTCCAAAACT GGTCAGGTTC TTGCTGAACA CAATCCGGAT CTTAGAACTT ATCCTGCCAG CACCACTAAA ATAATGACAG CAATACTGGC ACTCGAACTT GGAGATCTCA ATCAAATAAT GACTGTCAGC CAGTCTGCCA TAGATGACAT AGGTCCTGGC GGCATGCATA TCGGTTTGCT GCCGGGCGAA CAACTGGAGC TCAGATACTT ACTGGATGCT CTCCTGGTGA GATCAGCCAA TGAGACTGCT TATGTTATTG CCGAAAACCT CTGCTCCTCC CGCGAGGAAT TTTACAGACT TATGAACGAA AAGGCAAGGG AGCTTGGGGC TACCAATACA AATTTTGTAA ATCCCTGCGG TATTGACAAT GGAGAAAAGG GAAAAAATCA TCTTACCACG GCAAGAGATC TCGCCAAAAT AGCGCAGTAT GCAATGACGA TACCGGAATT TAGGGAAATC GTTCAAAAAA CTATTATCAA AATACCTCCT ACAAACAAGC ATGCTGAAGA GGTTATTGTC GGTACTACCA ATAAATTGCT GCTCTACAGC AACTCAAAAT ACAAATCGGA ACACTATACA AAAATAACCG GTATAAAAAC GGGTTATACC GACAGGGCCC TTAACAACTT GGTTTCTTCC GCCGTCAACG ATGAAGGAAC GGAATTGATT GCTGTGGTTC TCGGCGTTGA GAATTATGAC ATGGTGTTCG AATATTCCAA AATGCTGTTG GAATACGGTT TCAAAAACTA CTCCGTTCAG CCTGTTATTG CACCGAACTC GTATATAACC TCTGTACCCG TTTTAAAAGC AGCGGGAAAT CACAACTTGG ATATTCTGGC ATCGCCGGAG GGACTCAAAT GCCTGCTGCC CAACAATTCA ACTAAAAATG ATTATGAAAT TGAACAACAT ATTTTAGAAA ACATAGAAGC TCCGGTAAAA AAAGGGGATG TTCTCGGATA CATCGAGGTT AAAAAAGACG GTGTCACCAT CGGAAAAATA GATGCAGTCG CTTCAAGGGA TGTTGAAAAA CTTGAGCCGC CGGTTGAACC TCAAAACATA ATTATTAAAA CAGCAAACGA TCCGATTTTG AAAAAAGTTA CAACAGGAGC ATTGATCTTC CTGTTAATGT TCCTTATGTT AAGATTTACT TTGCGCAGAA TTTCACGAAG CCTTCATTCA AAAAGATAA
|
Protein sequence | MKRLLLLLIM TIILFSNIVT LKAQPLDINA QAYILIDSKT GQVLAEHNPD LRTYPASTTK IMTAILALEL GDLNQIMTVS QSAIDDIGPG GMHIGLLPGE QLELRYLLDA LLVRSANETA YVIAENLCSS REEFYRLMNE KARELGATNT NFVNPCGIDN GEKGKNHLTT ARDLAKIAQY AMTIPEFREI VQKTIIKIPP TNKHAEEVIV GTTNKLLLYS NSKYKSEHYT KITGIKTGYT DRALNNLVSS AVNDEGTELI AVVLGVENYD MVFEYSKMLL EYGFKNYSVQ PVIAPNSYIT SVPVLKAAGN HNLDILASPE GLKCLLPNNS TKNDYEIEQH ILENIEAPVK KGDVLGYIEV KKDGVTIGKI DAVASRDVEK LEPPVEPQNI IIKTANDPIL KKVTTGALIF LLMFLMLRFT LRRISRSLHS KR
|
| |