Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1181 |
Symbol | |
ID | 4810133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1408476 |
End bp | 1410758 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106603 |
Product | transglutaminase-like protein |
Protein accession | YP_001037606 |
Protein GI | 125973696 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.160419 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGCA CACATAGGCT CGAAAAACAA ATACTTCCGA TAGCAATATC CGTCTCTTTG GTAAACTGGA CATTAAGGTC AATGTTTGGT CCGTTTGACG TGGGGGTTTT GATTCTCACC GTTGTTTTTA CTTGTGTGAT ATTTGCAGCC TACAATTTTG CACAAAAACA CGAGAGAATA AAAGTGCTTT TGTTTTTCGG GTTTTCTTTT TTGTATCTGA TTGCATGTCA GGTTGCCGTA AGTGCAAGAT ATATGGATTG GGTTGTTTTT TTGGTGGCCG TTTACGGTTT TGCTTCGACG GTTTATTATT TTACGATAAT ACGATACCGG GTTGCGATAG TCTTTTTAAC CGGTCTTATT CCCTTTTTAA CACATTCTGC AAGGACGGCC AAAGGAATAA CGGTACATTT TCTGATATAT CTCTTGCTAT TCTTTCTTCT GTATTTTGAA CGTACAAGAA AGAAAAGAGC GGAAGCGCAG GGGTGTAATA TTTCAATTAA TAAATGGTAT TGTATATCTA TGTCAGTTTT CCTTGCAGTG ATATTTGTCA TTTCGCTGGT TGTTCCAAAG CCCAGCGTGA TTCCAAGGCT TGCTTATGTA AATGCAGTGA TAGAACAGGT TGTGCAACCT CTGGGAGGAG GTGCGGCACA GCAAAACCTG TTTCAGAGTA TAAACAGCAA TCTTTACAAT CCTCTAAGTC TTAAAAAACA GAGTCAACTG GATTCCATGA CCGCTCCGCT GAGTGACAGG ATACTTTTTG AGGTTGCCGC GGAAGAACCT CTGTATCTTA GAATTCAGAG CTGGGATAAA TATGAAAACA ATGTATGGAA AGTTGGAAAT AAAGAGCTGC AGGAATACAA GCCTGTTTCC GGTTTCTATA ATGACGGGAT TAAGTACAAT GTGTTTGTAA ATTTGGTAAA AAAAGCAAAG GAAGAAGGAA TTGTATTGCC GCTGCCGGCA GATGCTTCGG AAATCTGGAA TTATAATTCA ACTCCCCAGG CCAGAAAAAA AGCAACCATA ATAAATGTCA ACAATTTTTC AACCAGATTG ATTCCCGTGC CCATAGGAGT AATTGATGTT TATTCAGAAT ATGAAAATTC CGAAAATATA TATATGAATA AATCGGGAAG TTGTAATATC GGAAAGGATG ATACACCAAA AAACTGGCAA AGCTATGATG TGGAATACAT GACCCAGAGA ATACCTAAAA GTTCCTTTGA GCACAAGCTT ATAGAGGTTT TAGACAGAAC GACGGCGGAG TCTTTGCTTG ACATGAATAC ATATAAAAAG AATAACGGAG AGCCGCTGGA TATAAGCTAT GATAATTTAA ATGTTTTATA TTGGGCTGCC GAAGACTTGA AGAATGTTTA TGAAAACTAT ACCGAACTAC CGCAGAATAT ATCTGACAGA ATTTACAACT TAGCCCAAAG GATAACCGAA GGAAAGGAAA GTCCCTATGA AAAAGCGCTG GCTATAGAAC AGTATTTTCA CAATTCAGGT TATGTGTATG ATTTGGATCC TCCGCGTCTG CCAAGAGGCG CTGAGGCAGT GGATTATTTC CTTTTTGAAA GCAAAAAAGG TTTCTGTATC CACTATGCGT CCGCCATGGT GATACTTGCC AGGGCATGCG GCTTGCCGGC AAGATATTCT GAAGGATATG TGGCGGATGA GTTTGACAGC GGCACCGGGC GGTTTATTGT CAGGGATAAG GATGCCCATG CGTTTCCTGA AGTTTATATA CCGGGATATG GATGGATGGT CTTTGAACCC ACTGTGAGTG TCAGGGAACA GGATGCAATC AGCCAATTTT TCAGTAAAGT AAAGTCCGTA CTTACAAGTT TTAGAGAAAC AGTTGTGAAT TTTGTTGAAA TTATGCCGCC CTGGGTTAGA ATAATGTTCA TACCGTTTTT CCTTTTCTCA CTGATGTTCT GGATATGGTT TTTAAGGAAA ATGTATGTTC ACTCCTGGAA GAAAAGGATG CTGAAACTGG AGAGCGACAA AGCAGTTGAC AGAATTCTTT TGAAGATAAT AAAGCTGCTT AATGTTGTAA ATTTAAACAG GAATTTGCAT GAGACTCCTT TGCAGTATGG GCAAAGAATT TACAAGGAAA GCGGAATAGA CATTTTGGGT TTTGTTGAAG TGTTCAATAA GTCAAAATAT GCAAAAATAA AGCCGTCGGT TGAAGATGTA AAGCTGGGGA TATCTCTTTA TGGGGATACT GTGATTTACG TAAAAGGACG TCTGAAATGG TTTAACCTTT TGAAATATTT CTGGTTTGTA TAA
|
Protein sequence | MESTHRLEKQ ILPIAISVSL VNWTLRSMFG PFDVGVLILT VVFTCVIFAA YNFAQKHERI KVLLFFGFSF LYLIACQVAV SARYMDWVVF LVAVYGFAST VYYFTIIRYR VAIVFLTGLI PFLTHSARTA KGITVHFLIY LLLFFLLYFE RTRKKRAEAQ GCNISINKWY CISMSVFLAV IFVISLVVPK PSVIPRLAYV NAVIEQVVQP LGGGAAQQNL FQSINSNLYN PLSLKKQSQL DSMTAPLSDR ILFEVAAEEP LYLRIQSWDK YENNVWKVGN KELQEYKPVS GFYNDGIKYN VFVNLVKKAK EEGIVLPLPA DASEIWNYNS TPQARKKATI INVNNFSTRL IPVPIGVIDV YSEYENSENI YMNKSGSCNI GKDDTPKNWQ SYDVEYMTQR IPKSSFEHKL IEVLDRTTAE SLLDMNTYKK NNGEPLDISY DNLNVLYWAA EDLKNVYENY TELPQNISDR IYNLAQRITE GKESPYEKAL AIEQYFHNSG YVYDLDPPRL PRGAEAVDYF LFESKKGFCI HYASAMVILA RACGLPARYS EGYVADEFDS GTGRFIVRDK DAHAFPEVYI PGYGWMVFEP TVSVREQDAI SQFFSKVKSV LTSFRETVVN FVEIMPPWVR IMFIPFFLFS LMFWIWFLRK MYVHSWKKRM LKLESDKAVD RILLKIIKLL NVVNLNRNLH ETPLQYGQRI YKESGIDILG FVEVFNKSKY AKIKPSVEDV KLGISLYGDT VIYVKGRLKW FNLLKYFWFV
|
| |