Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2387 |
Symbol | |
ID | 4811039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2853597 |
End bp | 2854868 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107800 |
Product | gamma-D-glutamyl-{L}-meso-diaminopimelate peptidase I. metallo peptidase. MEROPS family M14C |
Protein accession | YP_001038782 |
Protein GI | 125974872 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2866] Predicted carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAAGTAC TAAGACTTGG CTCCGTCGGA CCTGATGTAA AACTGGCTCA AAGCCTGCTT AATAAAATTG GCTATCCGGT CGGGGCTGTG GACGGAATAT ACGGAACACG AACCCGGCAG GCGGTAATTG CTTTTCAGAG AAACAACGGA CTTGTGGCCG ACGGAATTGT GGGACCGGCA ACGTGGAGCG TCTTTGAACA ATTTCTGAGA GGATACGCAA TTTACTATGT CAGACCCGGA GATACTTTAT ATAATATTGC CGGAAGGTTT TATACATCGG TCAACTCCAT AGTAACAGCC AATCCGGGAA TAAACCCCAA TGTCATAAAT ATAGGGCAAA GGTTGGTGGT TCCCTACGGA ATAGATGTGG TATTTACCGA CATTGACTAT ACTTATGAAA TAATGGAAAG GGATATCCAG GGTCTTAAAG CAAGATATCC GTTCCTGGAG ACAGGAGTCG CGGGCACCAG TGTCCTTGGA CGAAATCTTT ATTATCTGAG ACTCGGAACC GGTCCAAGAC AGGTATTTTA CAATGCCGCC CATCATGCCA TTGAATGGAT AACCACCGTG CTTCTTATGA AATTTGCGGA AAACTTCCTG AAAGCATATT CCACCGGCAG CAGGATTCGT GGTTATAATG TAAGGGAAAT ATGGAATCAA AGCAGTATAT ACATTGTACC CATGGTAAAT CCGGACGGAG TGGATCTTGT CCTTAACGGA CTTAGCCCGA CAAACCCGTA TTATGCGGAC CTGCTGCGCT GGAATACCAC AGGCAGACCG TTTTCCCAGG TGTGGAGTGC CAATATCAGG GGAGTTGATT TAAACAGAAA TTATCCGGCA AGTTGGGAAG AAGCAAAAGC GCAGGAAGAA GCATTGGGTA TATTCGGTCC TGGCCCCACA AGATACGGAG GACCGTATCC TCTTTCAGAG CCCGAGTCAT CCGCCATGGT GAGCTTTACA AGAACTCATG ATTTCAGGCT TGCCCTGGCA TACCATTCGC AGGGAAGAGT AATATACTGG AACTATTTAA ATCTTGCTCC ACCTGAGTCC CTGACAATTG CAAATGCTTT TGCAAGGGTA AGCGGATACA TTGTTTTGGA TGTCCCTTAC GAGGCTGCCT ATGCCGGATA CAAGGATTGG TTTATACAGG AGTACAGAAG ACCCGGATTC ACTATTGAAG TGGGATTGGG GCAAAATCCT CTTCCCATAT CCCAATTTAA TACTATTTAT AATGATAATG AAGAAATTCT GCTTCTTGCA TCTTTAATTT AA
|
Protein sequence | MQVLRLGSVG PDVKLAQSLL NKIGYPVGAV DGIYGTRTRQ AVIAFQRNNG LVADGIVGPA TWSVFEQFLR GYAIYYVRPG DTLYNIAGRF YTSVNSIVTA NPGINPNVIN IGQRLVVPYG IDVVFTDIDY TYEIMERDIQ GLKARYPFLE TGVAGTSVLG RNLYYLRLGT GPRQVFYNAA HHAIEWITTV LLMKFAENFL KAYSTGSRIR GYNVREIWNQ SSIYIVPMVN PDGVDLVLNG LSPTNPYYAD LLRWNTTGRP FSQVWSANIR GVDLNRNYPA SWEEAKAQEE ALGIFGPGPT RYGGPYPLSE PESSAMVSFT RTHDFRLALA YHSQGRVIYW NYLNLAPPES LTIANAFARV SGYIVLDVPY EAAYAGYKDW FIQEYRRPGF TIEVGLGQNP LPISQFNTIY NDNEEILLLA SLI
|
| |