Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0251 |
Symbol | |
ID | 4808599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 307053 |
End bp | 309608 |
Gene Length | 2556 bp |
Protein Length | 851 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105663 |
Product | transglutaminase-like protein |
Protein accession | YP_001036683 |
Protein GI | 125972773 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0634947 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGAT TGAGTGTGAA TGATAGAAAA AATTTGATAA ATCTTGCTGT TTTCCTTTTG GTTGCTTTGT TATCGACGCT TCTTATTATG GCCGCTATAA GTATGTTGGA TTTGCGTGGG AACGGGTCGC CTGGAGGTGA AAATAATAAG AGGGCTTCCC AAAATGACGG AAGCGGCAAA AATCGCAATA AGCAGGACCG AAATAATAAC AATAAAAACA AAAAAGGAAG CGGCCAAAGG GAGGATGACA ACGGTTCCAA CTGGAAGGAA AGGGCTCAAA GAAGGCGTGG CGAGAGAAGA AGTTTTGACA GGGGAGATTA TGCTTATGGG CAAGGACCCG AAAGCATGAT TCCCGGAGAC GGATGGGAAT ATGATATTGC GCCTGATCAA ATGCCCAATC TTGACATATC CGGCAATGAA GGACTGCCCG ACATGTCTTT TGGAATGGGT GGCAATACGT TTGTTTTAAA AGAGGGCAAA AATTCCCACA TACCTGCTTT TGAAGTTTTG GGTGTTCCCA ATTACCCTTT TGTAAGAGTT ATGGCTATGG ACAACTATCG TCGGAGTCAC TGGAGTATGA TAAACGAAGC TCCGGAGCTG ATGTTTTTGT TTGGGGAGAA AGTGGACAGA AAATTTTCCG AAAATACTGT CAAGATAAAG CCTGTGGAGC CTTCAAAGGG GTATATTCCG GTATTGTCGG GCAACTTTGA AATGAAATAT GAGTTTAGCC TTTTGGAATA TAAACAATCC GGAGTCTTTT ATTCCACCGG AGTTATTGAA AATTTTTACG AAATGAAGTA CGAGGATCCT CCGACGGAAG CGGAATTGAT TAATGCAAAA ACCGATGATG ATTACTATTA TGATATTTAT GTGCCTGAAG TGGTCGAGAG AATAGTTGAT GAAGTGATTG AAAATTGCGA AACCGACTAT GAGGCAATAA AATTTGTTGA GAAATTCCTT TTGGAAAACT ACACTTATGA TAACACTGTG CTGAACAATT ACGGAAGCGG CGATGCGGTG GTTTCATTCC TTACAGGAAA AGACAGGGTG GGAAACCATC TGGATTTTGT ATCTGCCTAT GCCATTATTT TAAGGGCTGC GGGCATTCCA TGCAGACTTG CGCTGGGCTA TAAACTTTTG CCCGGTGTCA AATACCAGGT GGTGTATGCA GATCAGGTCT ATATTTATCC CGAGATAAAG TTTGAGGATT ATGGATGGGT TCCGATGGAC GTTTTCCCCT ATGATGTTTT TTACCGTCCG CCGAAAGAAA CAATAACCCA AATTACTTTT GCCGACGGTA CGACAAAAAG GGGAGAAACC GTTACGGTAA GAGGAACTGT TACGGATTCA TCAGGCAATC CCATAGACAA CATGACGGTT CTTGTCTATT TGAAACAATA CAAGAGCGAG CCGTGTATTT CTTATGCGAA AGCCTATGTC ACAAACGGCA ACTTTGAAGC CGTGTTCGAT ATAAAAGGGG ATATAAGTGC AGGAAAGTAT CATATTATTG CGGACGTTCT GGAAAATGAT GTTTACAGGA CATCCTCAAG CGACCCCGAA CTTAAAATTC TTGCGGATAC TTTTATTGAT CTGGAGGAAC GGAGTGATAT AATCGGCAAC AAACTGAATT TTGCAGGAAG GATTGTTGAC TTTTTTACCT ATGAAGGAAT AGAAGGCCTT GAGGTCCATG TATCCTTTGA AGGAATGGAT TTGGTTGAGA CGGTTGTGTC CGAAGAAGAC GGAAAACTTT ACAAAGAGAT TGAAATTGAA GTTCCTGAGG ATTATCCATA CTATAAAAAC TTTTTTTTTG CAGGAAGATA TTTGTTGTTT TACGGTATTG AATTTAAGGG AACTGAGATA TATACTCCCT ATTTCACAAG AAGGGGAGTG TATATGTGGA AGATATACTG GATAAACGTT ACTGTTGCGG TTGTTTTGCT TTTGGGCGTT GTACTTCTTT GTGTTGCAAT TGTGCTGAGG AAAAAAGGAG CATTCCGGAG GGACGGCGGA AAATTTCCCG TTTTGGCCGC GGAAGGTCCG GGAATGATTG TGGCAGCGGA TGCGGGAGCG GAATCGGTGG AAAGAAACCA TGGCAAAGTT TATATAGAGT TTCCCCAAAT TGGGGAGGGT TTACCCGATG TATGGGGAAT AAAAGAAAAT TTGGCCGTTG TTTTTCATGA CGACGAAGGC AACAGGGGAG AGATTGGGGC GGTATTCCAT AAAAAAGGGG AGTACAGAAT CAAAATTTCC GGGAAAAACG ACGAATACGG CGCAAGAAAC ATACGGATAG TTGACTACAG GGAAGAAATA ATTGCAATCG GAAAGAATTT TTTAAAAGAA ATGTCCGCAA AGATTTCAGG AATTACCGAT TTCATGACTC TAAGGGAAAT ACATGATATA ATTAAGCCGA ACATTGCTTC CGAAAGGCAT TGGGTTTTGG AAGATGCTTT CATGGTGTTT GAAAAGGCTG TGTACAGTGA TGAGGATATT GTAAGAAGTG ATTATGAAAA GTTTTATGTT TTTGCCAGAG AACTTGGAAA AAACAGCTCA ATTTAA
|
Protein sequence | MERLSVNDRK NLINLAVFLL VALLSTLLIM AAISMLDLRG NGSPGGENNK RASQNDGSGK NRNKQDRNNN NKNKKGSGQR EDDNGSNWKE RAQRRRGERR SFDRGDYAYG QGPESMIPGD GWEYDIAPDQ MPNLDISGNE GLPDMSFGMG GNTFVLKEGK NSHIPAFEVL GVPNYPFVRV MAMDNYRRSH WSMINEAPEL MFLFGEKVDR KFSENTVKIK PVEPSKGYIP VLSGNFEMKY EFSLLEYKQS GVFYSTGVIE NFYEMKYEDP PTEAELINAK TDDDYYYDIY VPEVVERIVD EVIENCETDY EAIKFVEKFL LENYTYDNTV LNNYGSGDAV VSFLTGKDRV GNHLDFVSAY AIILRAAGIP CRLALGYKLL PGVKYQVVYA DQVYIYPEIK FEDYGWVPMD VFPYDVFYRP PKETITQITF ADGTTKRGET VTVRGTVTDS SGNPIDNMTV LVYLKQYKSE PCISYAKAYV TNGNFEAVFD IKGDISAGKY HIIADVLEND VYRTSSSDPE LKILADTFID LEERSDIIGN KLNFAGRIVD FFTYEGIEGL EVHVSFEGMD LVETVVSEED GKLYKEIEIE VPEDYPYYKN FFFAGRYLLF YGIEFKGTEI YTPYFTRRGV YMWKIYWINV TVAVVLLLGV VLLCVAIVLR KKGAFRRDGG KFPVLAAEGP GMIVAADAGA ESVERNHGKV YIEFPQIGEG LPDVWGIKEN LAVVFHDDEG NRGEIGAVFH KKGEYRIKIS GKNDEYGARN IRIVDYREEI IAIGKNFLKE MSAKISGITD FMTLREIHDI IKPNIASERH WVLEDAFMVF EKAVYSDEDI VRSDYEKFYV FARELGKNSS I
|
| |