Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1056 |
Symbol | |
ID | 4811354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1261445 |
End bp | 1264117 |
Gene Length | 2673 bp |
Protein Length | 890 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106478 |
Product | transglutaminase-like protein |
Protein accession | YP_001037481 |
Protein GI | 125973571 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGGCA AAAAAATTCT CGATTATCTG GCCGGTTTTG TTCTTGCGGC TTTAATGAGT TTCAGCCTGG TTTATCCTCT TACCACCACG CTGGGTTTCC CTTATTCGTC TTTCCGTATT TTAGGGCTTG TAATCTTTAT TCTGTTTATA TATTCGGTGC TGTTTGCAAA CAGAAACGTC TCAAGAATTT CTGTGCCTCT TGCAATAATT TCCATGGCAG CCGGCATTGT CTTTTTAGCG GTTAAAAAAG ATTTGGCATA TATCATCCAA CCTTTTCAGT GGTTTACTAA ATATATTTAT GACGAAGCCA TACTTCCGGA CAATTACTAT CCTCTTTTTA TTACCGTGAT TTTAGCGCTG TCGCTTACAC TGCTTGTATT TATATTTACC ATGAAAAAAT TCAATTTTAT AATCATTCTC TCAAGCGGAA CAGCCATCTT TGTATCCCAA TGGATATTGA ATTTTTTCGT GCCGTCCGCC TATATATCTT TTTACACCTT TGTCATTTCA ATACTGGCAT ACTATCTTGT GCATATATAC AGAAAAAAAA GCCTGGAAAA TTCAAATAAT GATTTTGCCT CCCCTTCGAA GTTCATACTG GGAATCGCTC CCATATGCGT CGTAATTGTC TTTCTGGCAA ATTCAATACC TGTAAGGTCA AAACCTATTG AGTGGAAATG GCTGGATAAC AAAATCAACA GCTTTTACAA TCAGCTCGGA TTTGGCACTC TCGGCAAAGG TTCCGCAGGT TTTGATTCCG ATTACTTTTC TTTTTATTCC ACTGCCGGAT TTGGAAATGA CAGCAATTTA GGCGGAAACA TAACACCCAA CGATATAAAG ATAATGGAAG TTACCACCGA TCGCAGCATC TACCTGAGGG GACGGGCATG TAATTTATAT ACGGGAAATA GTTGGTTGAA TTCACAACCG GACAACATCC CTTTGGACGG CACAAATAAA ATGAGCTTTG ATATTTTAGA AATGAAAACA GGCTTACCCC TCCTTGTCAA CAAACTGAAT CCGGAAAACA ACACATACAA GCTTTCAGAC ACCGATGGAA TCATCCCCAA CATTATTTCA AAACATAATG TCAAGGTAAA ATATGAAAAT GTGAGAACCA AATCCCTTTT TGTACCGCTT AAATCAGAAA ACTTCATTTT TCCGTCCAAA GTAGCTGATG CAGTATTAAT AAACCAGGAT GGAATACTTA CCTCTAATAA ATTCCTAAAG AAAGATTTCA CCTATTCCTT TGAAAGTTAT AGCCTAAATA CTGTCAGCGA AGATTTCAAA AACCTTCTGA GACAAAGTAT GCGCGGCCTG TACAGCGTTG AACTGAACAG ACTTACAGAT GAAATCTATG AACATTTGTA CAGCGAATAT TTAAAGGATC TGTTGGATGA AGCAGAAAGG ATATACAATT CCGATCTTTT TATATTAAAT AGATATTCAG TTCAATTTCC AATAAAAAAT ATAATAAGAA ATACTCCGGA TAATATAACT TTAAGCGACC TGCTATATGA ATACCTCGTT AATGTTTTCA TAGATGAATT TAATCTCGAC AGAGTTTTCG AGCGCGAAGA TTTGGAAAGA TATTTCCAAA TGATCATTAA CGAGCTTAAC AATTCGGAGG ACATAAAGAA GCTTGCGCAA CTTAGGTCTT TAAGCTCCAA TTCAGTCTAT ATTTACAACA CTTATCTAAA CATACCCTCC GAGCTTCCTC AGAGGGTAAA AGACCTTGCC GTTTCAATAA CAGCCAATGA AACCAATAAT TTCGACCGGG CAAAAGCAAT TGAAAAATAT CTTTCTGCAA ATTACGGCTA CACACTGACT CCGGGAGATA CCCCGCCTGA CAGGGATTTT GTGGACTACT TCCTTTTTGA ACAAAAGGAA GGCTATTGTG TGTATTTTGC CTCTGCCATG GTAATCCTTG CACGCAGTAT AGGGCTTCCG GCCCGTTATG TTGAAGGGTT TGTTCTTCCC GTGAGATCAA AAGACGGTGT CTATGAGGTT ACAAACAAAC AGGCCCACGC CTGGCCGGAG ATCTACTTTG AGGGATTCGG CTGGGTCTCT TTCGAACCGA CTCCCGTCTA TCAGCAAAAC TCGTTTTACA GTTCCGGCAG CTTTAGGCCC AACATGAGCG GAATGCTTCC CCAAACCAAC GGCACAGGGC TTCAAAACCA AAACAATGAT GAAGGCAACA AGCCCGACAT GGCTCCCCAA CCTGTCCAAA ACCAAAATCC GTTTATCAAT ATCCTTTTGA TTACGGCCGG GATACTTGCA GGTTTGGTTT CTTTTGTGTT GATTATCGTG GGTATCAATA AAATCAGAAA AAAACATTGG CTTAAATCCA TTTTAAACAT GTCGCCCAAA GAAGCGGTAA TAAAGTTATA CGAAACATAC CTTAATCACC TTTTATATCA ATATATGCCT GTGCGTCCCG CGGAAACCCC TCTTGAATAT GCAAAGAGGC TCGACGATTA TGGATATTTT GCTCCCAGGA AGTTTACTGA TGTTGCATCC ATATTCGTAA AAGCGAGATA CAGTCAAAAC GAGGTGACCG AGGCAGACAG GGCAAGTGCT TTGGAATTTT ACAAACCCAT AGTTTTAAAA ACACGAAGCT CCATGGGACG CCTAAAGTAT TTCTTCCTGG CGCATATACT TGGAAAGATA TAA
|
Protein sequence | MDGKKILDYL AGFVLAALMS FSLVYPLTTT LGFPYSSFRI LGLVIFILFI YSVLFANRNV SRISVPLAII SMAAGIVFLA VKKDLAYIIQ PFQWFTKYIY DEAILPDNYY PLFITVILAL SLTLLVFIFT MKKFNFIIIL SSGTAIFVSQ WILNFFVPSA YISFYTFVIS ILAYYLVHIY RKKSLENSNN DFASPSKFIL GIAPICVVIV FLANSIPVRS KPIEWKWLDN KINSFYNQLG FGTLGKGSAG FDSDYFSFYS TAGFGNDSNL GGNITPNDIK IMEVTTDRSI YLRGRACNLY TGNSWLNSQP DNIPLDGTNK MSFDILEMKT GLPLLVNKLN PENNTYKLSD TDGIIPNIIS KHNVKVKYEN VRTKSLFVPL KSENFIFPSK VADAVLINQD GILTSNKFLK KDFTYSFESY SLNTVSEDFK NLLRQSMRGL YSVELNRLTD EIYEHLYSEY LKDLLDEAER IYNSDLFILN RYSVQFPIKN IIRNTPDNIT LSDLLYEYLV NVFIDEFNLD RVFEREDLER YFQMIINELN NSEDIKKLAQ LRSLSSNSVY IYNTYLNIPS ELPQRVKDLA VSITANETNN FDRAKAIEKY LSANYGYTLT PGDTPPDRDF VDYFLFEQKE GYCVYFASAM VILARSIGLP ARYVEGFVLP VRSKDGVYEV TNKQAHAWPE IYFEGFGWVS FEPTPVYQQN SFYSSGSFRP NMSGMLPQTN GTGLQNQNND EGNKPDMAPQ PVQNQNPFIN ILLITAGILA GLVSFVLIIV GINKIRKKHW LKSILNMSPK EAVIKLYETY LNHLLYQYMP VRPAETPLEY AKRLDDYGYF APRKFTDVAS IFVKARYSQN EVTEADRASA LEFYKPIVLK TRSSMGRLKY FFLAHILGKI
|
| |