Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0679 |
Symbol | |
ID | 4810297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 835293 |
End bp | 836507 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106096 |
Product | Serine-type D-Ala-D-Ala carboxypeptidase |
Protein accession | YP_001037107 |
Protein GI | 125973197 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1686] D-alanyl-D-alanine carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.877856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGAGAA AGGTTATAAT CTGTTTTACA GCGGCGGTAT TTTTGCTGAA CATAATGGCA ATATCCGTTC TTTCAGCTCC AGCTGTCGGC GAGCCGGTAT ACGATGTCGA AACCGTTGCT GATATAATGG CAAATACCAA TGTATTTGAA TTGCAGGCAA AGAGCTATGT TTTGATGGAT GCCGAAACCG GTCAGATTCT TTTGGAAAAC AGAAGCCACG AAAGATTGCC CATAGCAAGT ATAACAAAAA TCATGTCCAT GTTGTTGGTA ATGGAAGCAA TTGATTCGGG GAAAATCAAC ATGGATGATA TAGTGACGGC TTCAGAGTAT GCCGCAAGCA TGGGAGGTTC CCAGGCTTAT ATTGAGCCGG GCGAACAGTA TACGGTAAGA GATGCTCTAA AAGCAGTTGC AACCCACTCA TCCAATGACG TAACCGTTTC CCTGGCCGAA CTTGTGGCCG GAAGCGAACA GGTGTTTGTG GTGCTTATGA ACGAAAAGGC AAAAGAACTT GGCATGAACA ACACAAACTT CCTTGATTGT ACCGGGCTTA CGGATGAAGG CCATTACAGC ACCGCATATG ATGTTGCGCT TATGTCGAGA GAGCTTATTG TAAAACACCC GAAAATTCTT GAGTTTACCT CCATCTGGAT GGATACCTTC AGAAACGGAG AATTTCAGTT GGTAAACACC AACAAACTTG TTCATTTCTA TGAAGGATGT GACGGACTGA AAACAGGGTT TACCCGTGCA GCAGGTCACT GTCTTTCGGC AACCGCCAAA AGAAATGACA TGAGACTTAT TTCCGTGGTG TTGGGTGAGC CGGATTCAAA CACCAGATTT GCAGAAACGA GAAAACTTTT AGACTACGGA TTTGCCAATT ATGAATCAAA ACAGGTAAAC AAAAAAGGTG AAGTGGTCAA CGAAATTGAA GTAAAAAGAG CATTGATTCC CAAAATAAAG GCCCTTTACG GCGATGATGT AAAACTCCTT TTTGCAAGAA GTGACAAAGG CAAGGTAGTA AGAGAAGTAC GATTGAAAAG CTTTCTTACA GCCCCGGTGG CAAAAGGCGA GAAAGTGGGC GAAGTGGTAT ATAAAATAGG TGAAAAGGAG ATTGCAAAAG TGGATCTTGT TTCCGACAGG GATGTAGAGA AGGCTTCTTT TGGAAAATTG TTTATAAACA TGCTTTCAAG CTGGTTCAGT CTGGGAAGAA GTTAA
|
Protein sequence | MLRKVIICFT AAVFLLNIMA ISVLSAPAVG EPVYDVETVA DIMANTNVFE LQAKSYVLMD AETGQILLEN RSHERLPIAS ITKIMSMLLV MEAIDSGKIN MDDIVTASEY AASMGGSQAY IEPGEQYTVR DALKAVATHS SNDVTVSLAE LVAGSEQVFV VLMNEKAKEL GMNNTNFLDC TGLTDEGHYS TAYDVALMSR ELIVKHPKIL EFTSIWMDTF RNGEFQLVNT NKLVHFYEGC DGLKTGFTRA AGHCLSATAK RNDMRLISVV LGEPDSNTRF AETRKLLDYG FANYESKQVN KKGEVVNEIE VKRALIPKIK ALYGDDVKLL FARSDKGKVV REVRLKSFLT APVAKGEKVG EVVYKIGEKE IAKVDLVSDR DVEKASFGKL FINMLSSWFS LGRS
|
| |