Gene Cthe_1181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1181 
Symbol 
ID4810133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1408476 
End bp1410758 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content39% 
IMG OID640106603 
Producttransglutaminase-like protein 
Protein accessionYP_001037606 
Protein GI125973696 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.160419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGCA CACATAGGCT CGAAAAACAA ATACTTCCGA TAGCAATATC CGTCTCTTTG 
GTAAACTGGA CATTAAGGTC AATGTTTGGT CCGTTTGACG TGGGGGTTTT GATTCTCACC
GTTGTTTTTA CTTGTGTGAT ATTTGCAGCC TACAATTTTG CACAAAAACA CGAGAGAATA
AAAGTGCTTT TGTTTTTCGG GTTTTCTTTT TTGTATCTGA TTGCATGTCA GGTTGCCGTA
AGTGCAAGAT ATATGGATTG GGTTGTTTTT TTGGTGGCCG TTTACGGTTT TGCTTCGACG
GTTTATTATT TTACGATAAT ACGATACCGG GTTGCGATAG TCTTTTTAAC CGGTCTTATT
CCCTTTTTAA CACATTCTGC AAGGACGGCC AAAGGAATAA CGGTACATTT TCTGATATAT
CTCTTGCTAT TCTTTCTTCT GTATTTTGAA CGTACAAGAA AGAAAAGAGC GGAAGCGCAG
GGGTGTAATA TTTCAATTAA TAAATGGTAT TGTATATCTA TGTCAGTTTT CCTTGCAGTG
ATATTTGTCA TTTCGCTGGT TGTTCCAAAG CCCAGCGTGA TTCCAAGGCT TGCTTATGTA
AATGCAGTGA TAGAACAGGT TGTGCAACCT CTGGGAGGAG GTGCGGCACA GCAAAACCTG
TTTCAGAGTA TAAACAGCAA TCTTTACAAT CCTCTAAGTC TTAAAAAACA GAGTCAACTG
GATTCCATGA CCGCTCCGCT GAGTGACAGG ATACTTTTTG AGGTTGCCGC GGAAGAACCT
CTGTATCTTA GAATTCAGAG CTGGGATAAA TATGAAAACA ATGTATGGAA AGTTGGAAAT
AAAGAGCTGC AGGAATACAA GCCTGTTTCC GGTTTCTATA ATGACGGGAT TAAGTACAAT
GTGTTTGTAA ATTTGGTAAA AAAAGCAAAG GAAGAAGGAA TTGTATTGCC GCTGCCGGCA
GATGCTTCGG AAATCTGGAA TTATAATTCA ACTCCCCAGG CCAGAAAAAA AGCAACCATA
ATAAATGTCA ACAATTTTTC AACCAGATTG ATTCCCGTGC CCATAGGAGT AATTGATGTT
TATTCAGAAT ATGAAAATTC CGAAAATATA TATATGAATA AATCGGGAAG TTGTAATATC
GGAAAGGATG ATACACCAAA AAACTGGCAA AGCTATGATG TGGAATACAT GACCCAGAGA
ATACCTAAAA GTTCCTTTGA GCACAAGCTT ATAGAGGTTT TAGACAGAAC GACGGCGGAG
TCTTTGCTTG ACATGAATAC ATATAAAAAG AATAACGGAG AGCCGCTGGA TATAAGCTAT
GATAATTTAA ATGTTTTATA TTGGGCTGCC GAAGACTTGA AGAATGTTTA TGAAAACTAT
ACCGAACTAC CGCAGAATAT ATCTGACAGA ATTTACAACT TAGCCCAAAG GATAACCGAA
GGAAAGGAAA GTCCCTATGA AAAAGCGCTG GCTATAGAAC AGTATTTTCA CAATTCAGGT
TATGTGTATG ATTTGGATCC TCCGCGTCTG CCAAGAGGCG CTGAGGCAGT GGATTATTTC
CTTTTTGAAA GCAAAAAAGG TTTCTGTATC CACTATGCGT CCGCCATGGT GATACTTGCC
AGGGCATGCG GCTTGCCGGC AAGATATTCT GAAGGATATG TGGCGGATGA GTTTGACAGC
GGCACCGGGC GGTTTATTGT CAGGGATAAG GATGCCCATG CGTTTCCTGA AGTTTATATA
CCGGGATATG GATGGATGGT CTTTGAACCC ACTGTGAGTG TCAGGGAACA GGATGCAATC
AGCCAATTTT TCAGTAAAGT AAAGTCCGTA CTTACAAGTT TTAGAGAAAC AGTTGTGAAT
TTTGTTGAAA TTATGCCGCC CTGGGTTAGA ATAATGTTCA TACCGTTTTT CCTTTTCTCA
CTGATGTTCT GGATATGGTT TTTAAGGAAA ATGTATGTTC ACTCCTGGAA GAAAAGGATG
CTGAAACTGG AGAGCGACAA AGCAGTTGAC AGAATTCTTT TGAAGATAAT AAAGCTGCTT
AATGTTGTAA ATTTAAACAG GAATTTGCAT GAGACTCCTT TGCAGTATGG GCAAAGAATT
TACAAGGAAA GCGGAATAGA CATTTTGGGT TTTGTTGAAG TGTTCAATAA GTCAAAATAT
GCAAAAATAA AGCCGTCGGT TGAAGATGTA AAGCTGGGGA TATCTCTTTA TGGGGATACT
GTGATTTACG TAAAAGGACG TCTGAAATGG TTTAACCTTT TGAAATATTT CTGGTTTGTA
TAA
 
Protein sequence
MESTHRLEKQ ILPIAISVSL VNWTLRSMFG PFDVGVLILT VVFTCVIFAA YNFAQKHERI 
KVLLFFGFSF LYLIACQVAV SARYMDWVVF LVAVYGFAST VYYFTIIRYR VAIVFLTGLI
PFLTHSARTA KGITVHFLIY LLLFFLLYFE RTRKKRAEAQ GCNISINKWY CISMSVFLAV
IFVISLVVPK PSVIPRLAYV NAVIEQVVQP LGGGAAQQNL FQSINSNLYN PLSLKKQSQL
DSMTAPLSDR ILFEVAAEEP LYLRIQSWDK YENNVWKVGN KELQEYKPVS GFYNDGIKYN
VFVNLVKKAK EEGIVLPLPA DASEIWNYNS TPQARKKATI INVNNFSTRL IPVPIGVIDV
YSEYENSENI YMNKSGSCNI GKDDTPKNWQ SYDVEYMTQR IPKSSFEHKL IEVLDRTTAE
SLLDMNTYKK NNGEPLDISY DNLNVLYWAA EDLKNVYENY TELPQNISDR IYNLAQRITE
GKESPYEKAL AIEQYFHNSG YVYDLDPPRL PRGAEAVDYF LFESKKGFCI HYASAMVILA
RACGLPARYS EGYVADEFDS GTGRFIVRDK DAHAFPEVYI PGYGWMVFEP TVSVREQDAI
SQFFSKVKSV LTSFRETVVN FVEIMPPWVR IMFIPFFLFS LMFWIWFLRK MYVHSWKKRM
LKLESDKAVD RILLKIIKLL NVVNLNRNLH ETPLQYGQRI YKESGIDILG FVEVFNKSKY
AKIKPSVEDV KLGISLYGDT VIYVKGRLKW FNLLKYFWFV