Gene Cthe_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1056 
Symbol 
ID4811354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1261445 
End bp1264117 
Gene Length2673 bp 
Protein Length890 aa 
Translation table11 
GC content39% 
IMG OID640106478 
Producttransglutaminase-like protein 
Protein accessionYP_001037481 
Protein GI125973571 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGCA AAAAAATTCT CGATTATCTG GCCGGTTTTG TTCTTGCGGC TTTAATGAGT 
TTCAGCCTGG TTTATCCTCT TACCACCACG CTGGGTTTCC CTTATTCGTC TTTCCGTATT
TTAGGGCTTG TAATCTTTAT TCTGTTTATA TATTCGGTGC TGTTTGCAAA CAGAAACGTC
TCAAGAATTT CTGTGCCTCT TGCAATAATT TCCATGGCAG CCGGCATTGT CTTTTTAGCG
GTTAAAAAAG ATTTGGCATA TATCATCCAA CCTTTTCAGT GGTTTACTAA ATATATTTAT
GACGAAGCCA TACTTCCGGA CAATTACTAT CCTCTTTTTA TTACCGTGAT TTTAGCGCTG
TCGCTTACAC TGCTTGTATT TATATTTACC ATGAAAAAAT TCAATTTTAT AATCATTCTC
TCAAGCGGAA CAGCCATCTT TGTATCCCAA TGGATATTGA ATTTTTTCGT GCCGTCCGCC
TATATATCTT TTTACACCTT TGTCATTTCA ATACTGGCAT ACTATCTTGT GCATATATAC
AGAAAAAAAA GCCTGGAAAA TTCAAATAAT GATTTTGCCT CCCCTTCGAA GTTCATACTG
GGAATCGCTC CCATATGCGT CGTAATTGTC TTTCTGGCAA ATTCAATACC TGTAAGGTCA
AAACCTATTG AGTGGAAATG GCTGGATAAC AAAATCAACA GCTTTTACAA TCAGCTCGGA
TTTGGCACTC TCGGCAAAGG TTCCGCAGGT TTTGATTCCG ATTACTTTTC TTTTTATTCC
ACTGCCGGAT TTGGAAATGA CAGCAATTTA GGCGGAAACA TAACACCCAA CGATATAAAG
ATAATGGAAG TTACCACCGA TCGCAGCATC TACCTGAGGG GACGGGCATG TAATTTATAT
ACGGGAAATA GTTGGTTGAA TTCACAACCG GACAACATCC CTTTGGACGG CACAAATAAA
ATGAGCTTTG ATATTTTAGA AATGAAAACA GGCTTACCCC TCCTTGTCAA CAAACTGAAT
CCGGAAAACA ACACATACAA GCTTTCAGAC ACCGATGGAA TCATCCCCAA CATTATTTCA
AAACATAATG TCAAGGTAAA ATATGAAAAT GTGAGAACCA AATCCCTTTT TGTACCGCTT
AAATCAGAAA ACTTCATTTT TCCGTCCAAA GTAGCTGATG CAGTATTAAT AAACCAGGAT
GGAATACTTA CCTCTAATAA ATTCCTAAAG AAAGATTTCA CCTATTCCTT TGAAAGTTAT
AGCCTAAATA CTGTCAGCGA AGATTTCAAA AACCTTCTGA GACAAAGTAT GCGCGGCCTG
TACAGCGTTG AACTGAACAG ACTTACAGAT GAAATCTATG AACATTTGTA CAGCGAATAT
TTAAAGGATC TGTTGGATGA AGCAGAAAGG ATATACAATT CCGATCTTTT TATATTAAAT
AGATATTCAG TTCAATTTCC AATAAAAAAT ATAATAAGAA ATACTCCGGA TAATATAACT
TTAAGCGACC TGCTATATGA ATACCTCGTT AATGTTTTCA TAGATGAATT TAATCTCGAC
AGAGTTTTCG AGCGCGAAGA TTTGGAAAGA TATTTCCAAA TGATCATTAA CGAGCTTAAC
AATTCGGAGG ACATAAAGAA GCTTGCGCAA CTTAGGTCTT TAAGCTCCAA TTCAGTCTAT
ATTTACAACA CTTATCTAAA CATACCCTCC GAGCTTCCTC AGAGGGTAAA AGACCTTGCC
GTTTCAATAA CAGCCAATGA AACCAATAAT TTCGACCGGG CAAAAGCAAT TGAAAAATAT
CTTTCTGCAA ATTACGGCTA CACACTGACT CCGGGAGATA CCCCGCCTGA CAGGGATTTT
GTGGACTACT TCCTTTTTGA ACAAAAGGAA GGCTATTGTG TGTATTTTGC CTCTGCCATG
GTAATCCTTG CACGCAGTAT AGGGCTTCCG GCCCGTTATG TTGAAGGGTT TGTTCTTCCC
GTGAGATCAA AAGACGGTGT CTATGAGGTT ACAAACAAAC AGGCCCACGC CTGGCCGGAG
ATCTACTTTG AGGGATTCGG CTGGGTCTCT TTCGAACCGA CTCCCGTCTA TCAGCAAAAC
TCGTTTTACA GTTCCGGCAG CTTTAGGCCC AACATGAGCG GAATGCTTCC CCAAACCAAC
GGCACAGGGC TTCAAAACCA AAACAATGAT GAAGGCAACA AGCCCGACAT GGCTCCCCAA
CCTGTCCAAA ACCAAAATCC GTTTATCAAT ATCCTTTTGA TTACGGCCGG GATACTTGCA
GGTTTGGTTT CTTTTGTGTT GATTATCGTG GGTATCAATA AAATCAGAAA AAAACATTGG
CTTAAATCCA TTTTAAACAT GTCGCCCAAA GAAGCGGTAA TAAAGTTATA CGAAACATAC
CTTAATCACC TTTTATATCA ATATATGCCT GTGCGTCCCG CGGAAACCCC TCTTGAATAT
GCAAAGAGGC TCGACGATTA TGGATATTTT GCTCCCAGGA AGTTTACTGA TGTTGCATCC
ATATTCGTAA AAGCGAGATA CAGTCAAAAC GAGGTGACCG AGGCAGACAG GGCAAGTGCT
TTGGAATTTT ACAAACCCAT AGTTTTAAAA ACACGAAGCT CCATGGGACG CCTAAAGTAT
TTCTTCCTGG CGCATATACT TGGAAAGATA TAA
 
Protein sequence
MDGKKILDYL AGFVLAALMS FSLVYPLTTT LGFPYSSFRI LGLVIFILFI YSVLFANRNV 
SRISVPLAII SMAAGIVFLA VKKDLAYIIQ PFQWFTKYIY DEAILPDNYY PLFITVILAL
SLTLLVFIFT MKKFNFIIIL SSGTAIFVSQ WILNFFVPSA YISFYTFVIS ILAYYLVHIY
RKKSLENSNN DFASPSKFIL GIAPICVVIV FLANSIPVRS KPIEWKWLDN KINSFYNQLG
FGTLGKGSAG FDSDYFSFYS TAGFGNDSNL GGNITPNDIK IMEVTTDRSI YLRGRACNLY
TGNSWLNSQP DNIPLDGTNK MSFDILEMKT GLPLLVNKLN PENNTYKLSD TDGIIPNIIS
KHNVKVKYEN VRTKSLFVPL KSENFIFPSK VADAVLINQD GILTSNKFLK KDFTYSFESY
SLNTVSEDFK NLLRQSMRGL YSVELNRLTD EIYEHLYSEY LKDLLDEAER IYNSDLFILN
RYSVQFPIKN IIRNTPDNIT LSDLLYEYLV NVFIDEFNLD RVFEREDLER YFQMIINELN
NSEDIKKLAQ LRSLSSNSVY IYNTYLNIPS ELPQRVKDLA VSITANETNN FDRAKAIEKY
LSANYGYTLT PGDTPPDRDF VDYFLFEQKE GYCVYFASAM VILARSIGLP ARYVEGFVLP
VRSKDGVYEV TNKQAHAWPE IYFEGFGWVS FEPTPVYQQN SFYSSGSFRP NMSGMLPQTN
GTGLQNQNND EGNKPDMAPQ PVQNQNPFIN ILLITAGILA GLVSFVLIIV GINKIRKKHW
LKSILNMSPK EAVIKLYETY LNHLLYQYMP VRPAETPLEY AKRLDDYGYF APRKFTDVAS
IFVKARYSQN EVTEADRASA LEFYKPIVLK TRSSMGRLKY FFLAHILGKI