Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0419 |
Symbol | |
ID | 4808422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 527446 |
End bp | 528705 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640105833 |
Product | peptidase M16-like protein |
Protein accession | YP_001036850 |
Protein GI | 125972940 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAAAC GTATAAAATT AGAAAACGGT GTCAGAGTTG TTTGCGAAAA AATTCCCTAT CTCAGATCTG TTTCTATCGG TATCTGGGTT GGAACCGGTT CAAGGAATGA AAGCCAATCA AACAACGGAA TATCTCACTT TATTGAACAT ATGCTTTTTA AAGGGACTGA CAACAGAAGT GCCCGGGAGA TTGCCGACAG CATTGACAGC ATTGGGGGTC AACTTAATGC GTTTACAGGA AAGGAATGTA CCTGTTATTA TACAAAGACT TTGGATTCCC ATGCTGATAT TGCCTTGGAC GTTCTTTCGG ATATGTTTTT TAATTCAAGG TTTGAAGAAA AAGATATAGA AGTTGAAAAG AAAGTTATTT TGGAAGAAAT AGGCATGTAT GAGGATTCTC CGGAGGAGCT GGTGCATGAT ATCCTGTCTG AAACCGTATG GGAGGATAAT TCACTTGGAC TTCCCATTTT GGGAACACGG GAAACCCTTT TGAATATCAA CAAAGATAAA ATCAAAGCTT ACATTAATGA GAGATATTTG CCGCAGAATA CGGTTATAGC TGTGGCCGGG AATTTTGAAG AGGACAGAAT AATTGATGTT ATAAAAGAAA AATTCGGCGG ATGGAATGCC AGCGGAAAAG ACAGTAAAAC TATTGAAGAT GCAAAGTTTA AGGTGAATTC CAAAATCAAG GTGAAAGATA CGGAACAAAT ACACATATGT ATGGGATTTG AAGGAGTTGC GCACGGAAGT GATGAGTTGT ATCCTCTGCT TGCCGTAAAC AATGTATTGG GCGGCGGAAT GAGCTCCAGA ATGTTTCAGA AAATCAGGGA AGAAAAAGGA TTGGTGTATT CCATATACTC ATACCCGTCA TCTTATAAAA ATGCCGGTTT GTTTACGATA TATGCAGGAA TGAATGCAGA GCATTTGGAA AAGGTTGTGG AGCTTATAAT AAAAGAAATA AAGATACTTT TAAAAGAAGG ACTTTCTAAG GATGAACTTG AAAAATCCAA AGAACAGCTT AAGGGAAGCT ATATACTCGG ACTTGAGAGT ACCAGCAGCA GAATGAACAG TATGGGAAAA TCGGAAGTAC TTATGGACAG AATATATACT CCGGATGAGA TATTAAAGAA GATTGATGCG GTAAATCAAG AAAGTGTTGA ACGGGTTATA AAACAAATTT TTTGTTTGGA TAAAATCAGT TTTGCGATAG TGGGTAACAT AAAAAAGGAA ATAGATATTA GAAAAATAAT TAATGCTTAA
|
Protein sequence | MYKRIKLENG VRVVCEKIPY LRSVSIGIWV GTGSRNESQS NNGISHFIEH MLFKGTDNRS AREIADSIDS IGGQLNAFTG KECTCYYTKT LDSHADIALD VLSDMFFNSR FEEKDIEVEK KVILEEIGMY EDSPEELVHD ILSETVWEDN SLGLPILGTR ETLLNINKDK IKAYINERYL PQNTVIAVAG NFEEDRIIDV IKEKFGGWNA SGKDSKTIED AKFKVNSKIK VKDTEQIHIC MGFEGVAHGS DELYPLLAVN NVLGGGMSSR MFQKIREEKG LVYSIYSYPS SYKNAGLFTI YAGMNAEHLE KVVELIIKEI KILLKEGLSK DELEKSKEQL KGSYILGLES TSSRMNSMGK SEVLMDRIYT PDEILKKIDA VNQESVERVI KQIFCLDKIS FAIVGNIKKE IDIRKIINA
|
| |