Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0985 |
Symbol | |
ID | 4811279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1177642 |
End bp | 1178925 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106403 |
Product | peptidase M16-like protein |
Protein accession | YP_001037410 |
Protein GI | 125973500 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATGA AAGTTGTTGA ATATAAAAAC ATCGATGAAA CGGTATATGT TCATGAACAT TCAAGCGGTC TTAAATCCTT TGTGGTGCCC AAAAAAGGTT ATTCAAAAAA ATATGCGAAT TTTGCAACCC ATTACGGTTC CATCAATAAT GAGTTTGTCG TGCCCGGGGA AAAAGATTCC ATCAGAGTTC CCGACGGAAT AGCCCACTTT TTGGAGCATA AGCTTTTTGA ACAAAAAGAC GGAAGCGTTA TGGATAAGTT TTCACAGCTT GGTTCAAATC CCAATGCATA TACAAGCTTT GCCCAGACGG TTTATCTTTT TTCCTGCACT GACAGGTTCG AGGACAATTT CCGACTCCTT TTGGATTTTG TTCAAAATCC TTTTATAACA GAGGAAAGCG TGGAAAAAGA AAAGGACATA ATAGCTCAGG AAATCAGGAT GTATGAAGAT GATCCGAACT GGAGAGTTTT CTTCAACTTG CTGGATGCAT TTTATGTAAA TAATCCTGTA AAAATTGACA TAGCAGGAAC GGTTGAAAGT ATAAGCAAAA TCAACCGAGA CATTTTGTAC AAGTGCTATA ATACTTTCTA CCATCCTTCC AATATGATGA TTCTGGTAGT TGGAGATGTT GAACCGAAAG AAGTGTTCGG ACAGATTGAG GAAAGTATAG ATGCAAAGAG CAGCAAGCCT GAAATAAAGA GGATTTTTCC CGAGGAACCC AAAACAATCA ACCGGGACTA TGTTGAACAG AAGCTTGCGG TTGCCATGCC CATGTTTCAA ATGGGATTTA AAGACAATGA TTTTAATTCA AAGGGAATTG AGTGCTTAAA GAGGGAAGTT GCGGTAAAGC TCATACTTGA AATGATAATG GGCAGAAGTT CAAGCCTTTA TAACGAGTTG TACAACGAGG GTCTTATCAA CAACACCTTT GATTTTGATT ACACCATTGA AGAGAATTAT GCATATTCGG CTTTCGGTGG CGAATCCAAA GATCCCTTGA TGGTAAAAGA AAGAGTCGTG GATGAAATCA GGAAAATACA GGCGAACGGG CTTGACAAAA ACAGCTACGA ACGGATTAAA AGAGCCATGA AAGGAAGATT CATAAAGCAG CTCAACTCGG TGGAGAGAAT TTCGCACATG TTTATATCGG TGTATTTTAA AGATGTAAGC ATGTTTGACT ATCCGGATGT TTATGACAAT ATGACTTTTG ATTATGTAAA AGAGGTTTTT GAGAATCATT TCAATTTGGA TAATCTGGCT GTATCGGTGG TAAACCCGGT ATAG
|
Protein sequence | MNMKVVEYKN IDETVYVHEH SSGLKSFVVP KKGYSKKYAN FATHYGSINN EFVVPGEKDS IRVPDGIAHF LEHKLFEQKD GSVMDKFSQL GSNPNAYTSF AQTVYLFSCT DRFEDNFRLL LDFVQNPFIT EESVEKEKDI IAQEIRMYED DPNWRVFFNL LDAFYVNNPV KIDIAGTVES ISKINRDILY KCYNTFYHPS NMMILVVGDV EPKEVFGQIE ESIDAKSSKP EIKRIFPEEP KTINRDYVEQ KLAVAMPMFQ MGFKDNDFNS KGIECLKREV AVKLILEMIM GRSSSLYNEL YNEGLINNTF DFDYTIEENY AYSAFGGESK DPLMVKERVV DEIRKIQANG LDKNSYERIK RAMKGRFIKQ LNSVERISHM FISVYFKDVS MFDYPDVYDN MTFDYVKEVF ENHFNLDNLA VSVVNPV
|
| |