Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2207 |
Symbol | |
ID | 4811072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2635725 |
End bp | 2636900 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107613 |
Product | isoaspartyl dipeptidase |
Protein accession | YP_001038602 |
Protein GI | 125974692 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01975] isoaspartyl dipeptidase IadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAAAA AGATATTTAA ACTTCTAAAA AATGGTACAT GCTATATTCC GGAGTTTATT GGGAAAAAAG ATATTTTGGT GGTTCAAAAT AAAATTTACA AAATTGAAAA CCATATAGAT GAGTCCGTTG TTCCGGACCT TGAAATTATT GATTGCAGCG GAAAAATAGT ATGTCCCGGA CTCATTGACC AGCATATCCA TATTACAGGA GGTGGAGGAG AAGAGGGGCC GGGCAGCAGA ATACCTGAAA TTATGCTGTC CGAGTGCCTT ACGGCCGGGA TAACAACGGC TGTGGGGGTG TTGGGAGCCG ACAGTATAAC CAGGAACATA TCGGGGCTTT TGGCGAAAGC AAGGGCGCTG GAGGAAGAAG GGCTTAACAC ATATATATAT ACCGGAAGCT ACAGGATTCC TACGGCCACG CTTACAGGAA GCGTGATGTC CGATATAGCC CTGATTGACA AGATTATTGG GGTCGGCGAA ATAGCTGTTT CGGACCACAG GTCTTCCCAC CCTTCCCTTG ACATGCTCAA GGCTGTAGCC AGTGAAGCCA GGATGGGGGG ATTGGTCGGA AAAAAGGCCG GAGTCGTTCA TATACATGTG GGAGACGGGA AAAAAGGGCT TGAGCCGGTT ATTGAACTTG TGGCAAATTC TGATTTTCCC ATAGAGATGT TTGTACCCAC CCATTTAAAC AGAAACAGAA CTTTGTTTTT TCAGGCAATA GAATATGCCA AAATGGGGGG AAACATAGAT TTGACGGCCG GGGAGACCAA TGAGACAGGG TATGCGGTGC CGGATGCCTT AAAACTCCTT GCAGATGCAG GGGTGGATAT GGACAGGGTA ACTGTGTCAT CGGATGCCAA CGGCAGCATT CCGGCAAAAG AAGGATGCGG TCCGGGGGTT GGCAGGGCGG ACGAGCTTAT TAATGACATA AGGAGCAGTA TTTTAAGCGG TAAATTGACT GTGGAGCAGG CTTTAAAAAC TGTCACCGTG AATGTGGCAA AAGTACTTAA ACTTTACCCG AAAAAAGGTG TTATTAGACC GGGAAGTGAT GCGGACATAC TTGTCTTTGG AAAAGAAGAC CTGAAGCTGG ACAAGGTGTT TGTAAACGGT GAGCAGTTTG TGAATAACGG CAAGGTGCAA AAATGGGGAC GGTATGAAGA AAAATGGCAT GGATAA
|
Protein sequence | MEKKIFKLLK NGTCYIPEFI GKKDILVVQN KIYKIENHID ESVVPDLEII DCSGKIVCPG LIDQHIHITG GGGEEGPGSR IPEIMLSECL TAGITTAVGV LGADSITRNI SGLLAKARAL EEEGLNTYIY TGSYRIPTAT LTGSVMSDIA LIDKIIGVGE IAVSDHRSSH PSLDMLKAVA SEARMGGLVG KKAGVVHIHV GDGKKGLEPV IELVANSDFP IEMFVPTHLN RNRTLFFQAI EYAKMGGNID LTAGETNETG YAVPDALKLL ADAGVDMDRV TVSSDANGSI PAKEGCGPGV GRADELINDI RSSILSGKLT VEQALKTVTV NVAKVLKLYP KKGVIRPGSD ADILVFGKED LKLDKVFVNG EQFVNNGKVQ KWGRYEEKWH G
|
| |