Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3149 |
Symbol | |
ID | 4809712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3721116 |
End bp | 3722546 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108582 |
Product | aminoacyl-histidine dipeptidase |
Protein accession | YP_001039537 |
Protein GI | 125975627 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.35499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAACAA TTCACACAAT CGAGCCGCGC AAAGTGTTTC ACTGGTTTTA TCAAATTAAC CAGATTCCGC GATGTTCCGG CAATGAAAAA AGAATCAGCG ATTTTTTGGT GAATTTCGCC AGAGAAAGAA ATCTGGAAGT TTATCAGGAC GAACTTTACA ACGTAATCAT CAAAAAGCCT GCAACTCCGG GCTATGAAAA TGCGCCTGCC GTCATTATAC AAGGCCACAG TGACATGGTC TGCATAAAAG GTGAAGGTTC CAACCATAAT TTTGACACGG ATCCTATCGA AATGATTGTG GAAGGTGACA TTTTAAGAGC GAACAACACA ACCCTTGGCG GAGACGATGG GATTGCCGTT GCTTACGGTT TGGCAATTTT GGATTCCGAT GATTTAAAAC ATCCTGCCAT TGAACTTTTG GTCACGACGA GGGAAGAGAC GGGCATGGAC GGGGCGATGG CTCTGACCGG TGAACATTTA AGCGGGAAAA TACTGCTTAA CATTGATTCA GACGAGGAAG GTGTTTTTTT AGTCAGCTGT GCCGGCGGTG CAAACCAGAT TGTTACCTTC CCGTTAAAAA AGGAGAAGAA AAGAGGCACG GGCCTTAAAA TTAAAGTTTC CGGCCTTAAA GGCGGCCACT CCGGAATGGA GATTGTCAAG CAAAGGGCAA ACGCAATTAA ACTTTTGGCC CGTATTTTGG ACCAATGCAG GGACAAGGTT ACTTTGGCAA AGATTACGGG TGGCAGCAAA CATAATGCAA TTCCAAAGGA AGCGGAAGCG GTTGTTTTGA CAGAAGATTT GGAAGGCACG GTACGTATAG TAGAGTCCCT GGCAAAGGAA TTGAAAGAAG AATACCGGGT GGAAGACAGC GGACTTACTG TTACCGTAAC GGAAGTTGGA GTTGAAGAAG TTTTCTTAAA GCAAATATCC AACGATGTAA TTGATTTTCT GATGATGACG CCGGACGGCG TTCAGTATAT GTCAAAGGAT ATCGAAGGTT TGGTTCAGAC GAGTGTCAAC AATGCCGTGG TGGAAGAAAA AGAAGGGCGG CTCGTTGTTA CAATATCTCT CCGTTCTTCA TCGGAAAGTT CTTTAAGAGA AATGTTAAAC CGTGTAGCGC TGATTGCAAA AAGGACAAAC GGGATGGCCA AAGAAAGCAA TTTTTATCCG GCATGGGAGT ATGATGACAA GTCTGAAATA AGAAAAACGG CAGTCAGGGT TTATGAAGAG ATGTTTGACA AAAAAGCAAA ATTGACTGCG GTTCATGCGG GACTTGAATG CGGTGTGCTC AAGAAAAAAT TGCCTGATGT TGATATGATT AGTTTCGGGC CAAATTTATA TGACATTCAT ACGGAAAAAG AGCATCTTAG CATCTCATCG GTGGAAAGAG TTTGGAGATT TCTAATTCGT TTGCTTGAAG AGATAAAATA A
|
Protein sequence | MSTIHTIEPR KVFHWFYQIN QIPRCSGNEK RISDFLVNFA RERNLEVYQD ELYNVIIKKP ATPGYENAPA VIIQGHSDMV CIKGEGSNHN FDTDPIEMIV EGDILRANNT TLGGDDGIAV AYGLAILDSD DLKHPAIELL VTTREETGMD GAMALTGEHL SGKILLNIDS DEEGVFLVSC AGGANQIVTF PLKKEKKRGT GLKIKVSGLK GGHSGMEIVK QRANAIKLLA RILDQCRDKV TLAKITGGSK HNAIPKEAEA VVLTEDLEGT VRIVESLAKE LKEEYRVEDS GLTVTVTEVG VEEVFLKQIS NDVIDFLMMT PDGVQYMSKD IEGLVQTSVN NAVVEEKEGR LVVTISLRSS SESSLREMLN RVALIAKRTN GMAKESNFYP AWEYDDKSEI RKTAVRVYEE MFDKKAKLTA VHAGLECGVL KKKLPDVDMI SFGPNLYDIH TEKEHLSISS VERVWRFLIR LLEEIK
|
| |