Gene Cthe_0251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0251 
Symbol 
ID4808599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp307053 
End bp309608 
Gene Length2556 bp 
Protein Length851 aa 
Translation table11 
GC content41% 
IMG OID640105663 
Producttransglutaminase-like protein 
Protein accessionYP_001036683 
Protein GI125972773 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0634947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGAT TGAGTGTGAA TGATAGAAAA AATTTGATAA ATCTTGCTGT TTTCCTTTTG 
GTTGCTTTGT TATCGACGCT TCTTATTATG GCCGCTATAA GTATGTTGGA TTTGCGTGGG
AACGGGTCGC CTGGAGGTGA AAATAATAAG AGGGCTTCCC AAAATGACGG AAGCGGCAAA
AATCGCAATA AGCAGGACCG AAATAATAAC AATAAAAACA AAAAAGGAAG CGGCCAAAGG
GAGGATGACA ACGGTTCCAA CTGGAAGGAA AGGGCTCAAA GAAGGCGTGG CGAGAGAAGA
AGTTTTGACA GGGGAGATTA TGCTTATGGG CAAGGACCCG AAAGCATGAT TCCCGGAGAC
GGATGGGAAT ATGATATTGC GCCTGATCAA ATGCCCAATC TTGACATATC CGGCAATGAA
GGACTGCCCG ACATGTCTTT TGGAATGGGT GGCAATACGT TTGTTTTAAA AGAGGGCAAA
AATTCCCACA TACCTGCTTT TGAAGTTTTG GGTGTTCCCA ATTACCCTTT TGTAAGAGTT
ATGGCTATGG ACAACTATCG TCGGAGTCAC TGGAGTATGA TAAACGAAGC TCCGGAGCTG
ATGTTTTTGT TTGGGGAGAA AGTGGACAGA AAATTTTCCG AAAATACTGT CAAGATAAAG
CCTGTGGAGC CTTCAAAGGG GTATATTCCG GTATTGTCGG GCAACTTTGA AATGAAATAT
GAGTTTAGCC TTTTGGAATA TAAACAATCC GGAGTCTTTT ATTCCACCGG AGTTATTGAA
AATTTTTACG AAATGAAGTA CGAGGATCCT CCGACGGAAG CGGAATTGAT TAATGCAAAA
ACCGATGATG ATTACTATTA TGATATTTAT GTGCCTGAAG TGGTCGAGAG AATAGTTGAT
GAAGTGATTG AAAATTGCGA AACCGACTAT GAGGCAATAA AATTTGTTGA GAAATTCCTT
TTGGAAAACT ACACTTATGA TAACACTGTG CTGAACAATT ACGGAAGCGG CGATGCGGTG
GTTTCATTCC TTACAGGAAA AGACAGGGTG GGAAACCATC TGGATTTTGT ATCTGCCTAT
GCCATTATTT TAAGGGCTGC GGGCATTCCA TGCAGACTTG CGCTGGGCTA TAAACTTTTG
CCCGGTGTCA AATACCAGGT GGTGTATGCA GATCAGGTCT ATATTTATCC CGAGATAAAG
TTTGAGGATT ATGGATGGGT TCCGATGGAC GTTTTCCCCT ATGATGTTTT TTACCGTCCG
CCGAAAGAAA CAATAACCCA AATTACTTTT GCCGACGGTA CGACAAAAAG GGGAGAAACC
GTTACGGTAA GAGGAACTGT TACGGATTCA TCAGGCAATC CCATAGACAA CATGACGGTT
CTTGTCTATT TGAAACAATA CAAGAGCGAG CCGTGTATTT CTTATGCGAA AGCCTATGTC
ACAAACGGCA ACTTTGAAGC CGTGTTCGAT ATAAAAGGGG ATATAAGTGC AGGAAAGTAT
CATATTATTG CGGACGTTCT GGAAAATGAT GTTTACAGGA CATCCTCAAG CGACCCCGAA
CTTAAAATTC TTGCGGATAC TTTTATTGAT CTGGAGGAAC GGAGTGATAT AATCGGCAAC
AAACTGAATT TTGCAGGAAG GATTGTTGAC TTTTTTACCT ATGAAGGAAT AGAAGGCCTT
GAGGTCCATG TATCCTTTGA AGGAATGGAT TTGGTTGAGA CGGTTGTGTC CGAAGAAGAC
GGAAAACTTT ACAAAGAGAT TGAAATTGAA GTTCCTGAGG ATTATCCATA CTATAAAAAC
TTTTTTTTTG CAGGAAGATA TTTGTTGTTT TACGGTATTG AATTTAAGGG AACTGAGATA
TATACTCCCT ATTTCACAAG AAGGGGAGTG TATATGTGGA AGATATACTG GATAAACGTT
ACTGTTGCGG TTGTTTTGCT TTTGGGCGTT GTACTTCTTT GTGTTGCAAT TGTGCTGAGG
AAAAAAGGAG CATTCCGGAG GGACGGCGGA AAATTTCCCG TTTTGGCCGC GGAAGGTCCG
GGAATGATTG TGGCAGCGGA TGCGGGAGCG GAATCGGTGG AAAGAAACCA TGGCAAAGTT
TATATAGAGT TTCCCCAAAT TGGGGAGGGT TTACCCGATG TATGGGGAAT AAAAGAAAAT
TTGGCCGTTG TTTTTCATGA CGACGAAGGC AACAGGGGAG AGATTGGGGC GGTATTCCAT
AAAAAAGGGG AGTACAGAAT CAAAATTTCC GGGAAAAACG ACGAATACGG CGCAAGAAAC
ATACGGATAG TTGACTACAG GGAAGAAATA ATTGCAATCG GAAAGAATTT TTTAAAAGAA
ATGTCCGCAA AGATTTCAGG AATTACCGAT TTCATGACTC TAAGGGAAAT ACATGATATA
ATTAAGCCGA ACATTGCTTC CGAAAGGCAT TGGGTTTTGG AAGATGCTTT CATGGTGTTT
GAAAAGGCTG TGTACAGTGA TGAGGATATT GTAAGAAGTG ATTATGAAAA GTTTTATGTT
TTTGCCAGAG AACTTGGAAA AAACAGCTCA ATTTAA
 
Protein sequence
MERLSVNDRK NLINLAVFLL VALLSTLLIM AAISMLDLRG NGSPGGENNK RASQNDGSGK 
NRNKQDRNNN NKNKKGSGQR EDDNGSNWKE RAQRRRGERR SFDRGDYAYG QGPESMIPGD
GWEYDIAPDQ MPNLDISGNE GLPDMSFGMG GNTFVLKEGK NSHIPAFEVL GVPNYPFVRV
MAMDNYRRSH WSMINEAPEL MFLFGEKVDR KFSENTVKIK PVEPSKGYIP VLSGNFEMKY
EFSLLEYKQS GVFYSTGVIE NFYEMKYEDP PTEAELINAK TDDDYYYDIY VPEVVERIVD
EVIENCETDY EAIKFVEKFL LENYTYDNTV LNNYGSGDAV VSFLTGKDRV GNHLDFVSAY
AIILRAAGIP CRLALGYKLL PGVKYQVVYA DQVYIYPEIK FEDYGWVPMD VFPYDVFYRP
PKETITQITF ADGTTKRGET VTVRGTVTDS SGNPIDNMTV LVYLKQYKSE PCISYAKAYV
TNGNFEAVFD IKGDISAGKY HIIADVLEND VYRTSSSDPE LKILADTFID LEERSDIIGN
KLNFAGRIVD FFTYEGIEGL EVHVSFEGMD LVETVVSEED GKLYKEIEIE VPEDYPYYKN
FFFAGRYLLF YGIEFKGTEI YTPYFTRRGV YMWKIYWINV TVAVVLLLGV VLLCVAIVLR
KKGAFRRDGG KFPVLAAEGP GMIVAADAGA ESVERNHGKV YIEFPQIGEG LPDVWGIKEN
LAVVFHDDEG NRGEIGAVFH KKGEYRIKIS GKNDEYGARN IRIVDYREEI IAIGKNFLKE
MSAKISGITD FMTLREIHDI IKPNIASERH WVLEDAFMVF EKAVYSDEDI VRSDYEKFYV
FARELGKNSS I