Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0756 |
Symbol | |
ID | 4810374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 922460 |
End bp | 923395 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106173 |
Product | thermostable dipeptidase |
Protein accession | YP_001037184 |
Protein GI | 125973274 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTATAG TTGATGCACA CTGTGATACA ATAACAAAAA TAATGGAGAA GGGTACACAA CTTCGTAAGA ATGACTGCCA TGTTGACATA GATAGGCTTA AAGCAAAAGG AAACTATGTT CAGTTTTTTG CAGCATTTAT AGACCCTGCT TACTGTCAGG CATACGCATT AAAAAGAGCT TTGCAGATAA TTGATGAGTT TTACAGACAG ATTGAAGTTA ATAAAGACGA CATTATGATA TGTTGTAATT ACAATGATAT TGAAGAGGCT GTAAAGGCTA ATAAGATTGC TGCAGTGCTT TCAATAGAAG GCGGTGAGGC CCTGCAGGGA GACCTTGGTG TTTTAAGGAT GCTTTACAGA CTTGGTGTAA GGAGCATTTG CCTGACATGG AATCACCGCA ATGAAATAGC CGACGGGGTC AAAGACGAAT CTTCGGGAGG CGGCCTTACG CCTTTTGGAA GAGAAGTGGT AAAAGAAATG AACCGGCTGG GAATGCTTAT TGACCTTTCC CATATATCAA AAACGGGCTT TTGGGATGTA TTGGAGTGTA CTTCGGCTCC GGTCATTGTA TCCCATTCAA ATGCCCAAAG GCTTTGTGCG CACAGGAGGA ACCTCACAGA CAAACAGATA ATGGCCGTAA AAGATAATGG CGGAGTAATT GGAATAAACC TGTATCCGGA ATTTTTAAAC AACTCCAAGG AAGCTACGAT AAAGGATATT ATCAATCATA TTGAGTACAT AGCAAGCCTT GCCGGTCCTG ACCATATTGG GCTTGGAGCT GATTTTGACG GTGTTGACGG TTTGCCGGCA GGAATAAATG GAGTACAGGA TATTGAAAAG ATATTTAATG AGCTTGCAAA ATTAAATTAT TCCAGTGAAA ATATAGAAAA ATTTGCCGGA AAGAACTTTC TCAGGGTAAT TCAAAATGTT CTGTAA
|
Protein sequence | MIIVDAHCDT ITKIMEKGTQ LRKNDCHVDI DRLKAKGNYV QFFAAFIDPA YCQAYALKRA LQIIDEFYRQ IEVNKDDIMI CCNYNDIEEA VKANKIAAVL SIEGGEALQG DLGVLRMLYR LGVRSICLTW NHRNEIADGV KDESSGGGLT PFGREVVKEM NRLGMLIDLS HISKTGFWDV LECTSAPVIV SHSNAQRLCA HRRNLTDKQI MAVKDNGGVI GINLYPEFLN NSKEATIKDI INHIEYIASL AGPDHIGLGA DFDGVDGLPA GINGVQDIEK IFNELAKLNY SSENIEKFAG KNFLRVIQNV L
|
| |