Gene Cthe_0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0071 
Symbol 
ID4808766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp102852 
End bp105668 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content46% 
IMG OID640105480 
Productcellulose 1,4-beta-cellobiosidase 
Protein accessionYP_001036505 
Protein GI125972595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAA GATACAAGGT AAGGCGAAGA AGAAGGATTA TTACATGCTG CGGCATAATT 
GCCGCCGTTA TCGTTGTTTC AACTCTGATA ATCACAATCA AAAACAGCTT TAAACCTTCC
AGACAATCAT CCAATTCAAA TGTTACATAT TCCAAAGAAA ATGTTCAATC AGTCTATTCC
GACAGGTTTA TAGCTCTTTT TGAAGACATC CAGAAACAAG GTTATCTCAG CGAAGAAGGA
ATTCCGTACC ATTCCATAGA AACACTTTTG GTGGAAGCTC CGGACTATGG ACATCTCACT
ACCAGTGAAG CCATGAGCTA CATGGTCTGG CTCGGAGCAA CCTACGGGAA ACTTACCGGT
GACTGGACAT ATTTTAAAGA TGCATGGGAC AAGACAGAAC AATACATAAT ACCCGATCCC
GAAAGAGACC AACCCGGCGT TAATTCATAC ATTCCGACCC AGCCTGCCCA ATATGCGCCG
GAAGCCGACT CTCCCGAAAA ATATCCAACA CCAGGCGATA TCAATGCACC CACAGGCATA
GATCCCATTG CAGACGAACT TGCCTCAACT TACGGTACAA AAGCCATATA TCAAATGCAC
TGGCTCCTGG ACGTTGACAA CTGGTACGGG TACGGAAATC ACGGGGACGG TACAAGCCGC
TGTTCTTATA TAAACACTTA TCAGAGAGGC TCCGGTGAAT CCGTATGGGA AACAATACCC
CACCCCTCGT GGGAAGACTT TAGATGGGGA CAAGTCAACA ACGGAGGATT TCTAAAACTT
TTTGGAAACT TCGGCGAGCC TGTAAGGCAG TGGCGTTACA CCTCAGCCTC CGATGCCGAT
GCAAGACAGA TTCAGGCAAC TTACTGGGCT TACCTTTGGT CCAAAGAGCA AGGCAAAGAA
AAGGAACTTC AGCCCTATTT TGAAAAAGCG GCAAAAATGG GTGATTATCT AAGGTATACT
TTCTTTGACA AATATTTCAG ACCTATAGGG GTGCAAGACA GCGGAAGAGC CGGCACAGGA
TATGACAGCT GCCATTATCT TTTATCCTGG TATGCCTCAT GGGGAGGCGA TATCAACGGA
ACCTGGAGCT GGAGAATCGG AAGCTCCCAT TGCCATCAGG GCTACCAAAA TCCGATGGCC
GCCTACGCAT TGGCAAAGGA ATCCATTTTC ACTCCAAAAT CCAAAAACGC TAAAAAGGAT
TGGGAGCAAA GTCTTGACCG TCAGATTGAA CTGTTTTTGT ACCTTCAAAG TGCCGAAGGT
GCCATAGCCG GCGGTGTAAC CAACAGCTGG TCAGGTGCGT ACGGCAAATA TCCTGAAGGC
ACAAGCACAT TCTACGATAT GGCTTATGAT CCGCATCCTG TTTACAATGA TCCACCGAGC
AACAGATGGT TTGGATTTCA GGCATGGTCA ATGGAAAGGA TTATGGAATA CTATTATCTT
ACAGGAGACA GCCGGGTTAA GGAACTCTGC AAAAAATGGG TCTCATGGGC AATAGAAAAC
ACCCGTTTAA AAAGTGACGG AACCTACGAG ATTCCTTCCA CCCTGGAATG GTCCGGCCAG
CCCGACCCAT GGACAGGAAA ACCCAGTGAA AATAAAAACC TTCACTGTAC CGTCACAGAG
TGGACAGTGG ACGTGGGAGT AACCGCTTCA TATGCAAAAG CGCTTATTTA CTATGCCGCC
GCCACGGAAA AACATGAAAA GAAAATTGAT GACAAAGCCC GTGAAACAGC CAAACAGCTT
CTCGACAGAA TGTGGCACAA TTACAGGGAT AAAAAGGGTG TGGCCGCAAA GGAACCAAGG
GCTGATTACA AACGTTTCTT TGATGAAGTT TACATCCCTC ATGATTTTTC CGGCATCAAT
GCCCAGGGCG CCGAAATAAA GAACGGAATA ACCTTTATAG ATCTTCGTCC AAAATACAAA
GAAGACAAAG ATTATAAAAT GGTGGAAGAA GCCATAAAAA GCGGCAAAGA TCCGGTAATG
ACCTATCATC GTTACTGGGC ACAGGCCGAA GTGGCAATGG CAAACGCCAT GTATCACATT
TTCTTTGAGC AGAAAAAAGA CGGACTGGTA CCGGGTATAA ACTCTGATGA CAGCAATTCA
TCCTCTGAAA CTCAAAAATC CGAAACACCG GATTATACCC CGGACAGTTT GTCAACCTCC
GATGCAACAA TAACAGAAAG TCCGGCAAAC ACAAACAGCC CGGATGAAAA TCCATCACCG
CAGAACACCT CAGCACCAAC CAACGTACCG CTTAATCCCG CCAACACTCC ATATAATTCA
TCCAACGCAG CACCCAACTC ACCCAATACA CCAGCCAGAC CGTCCAACAC AACAGCCGGA
TCACCCGCAG GTAATAACAC CGTAGGCAGG TTAATACTGC AATATGCCAA CGGCAACGGT
TCAGATACTA CAAATACTAT AAATCCAAGG TTTAAACTCA TCAACAACTC CGGTTCACCT
GTTAAACTGT CAGATGTCAA GATAAGATAC TATTACACCA TTGATGGGGA AAAAGGTCAG
CAATTCTGGT GTGACTGGAG CTCTGCAGGA AACTCCAACG TAACCGGAAA ATTTGTAAAA
CTTTCCTCTC CCAAAAACAA TGCCGACTAT TACCTGGAAA TAGGCTTTAC CGAAGGAGCA
GGAAGCATAG AGCCCGGCAT GAGCGTGGAA GTGCAGGCAA GATTTTCAAA GGATGACTGG
TCCAATTACA GCCAGGCAAA CGATTACTCC TTCTCTGCAT CCGCCAATGA CTACGGAAAT
TCAAACCACA TAGCCCTCTA TATATCCGGA AGGCTTGTAT CGGGAAACGA ACCGTAA
 
Protein sequence
MKLRYKVRRR RRIITCCGII AAVIVVSTLI ITIKNSFKPS RQSSNSNVTY SKENVQSVYS 
DRFIALFEDI QKQGYLSEEG IPYHSIETLL VEAPDYGHLT TSEAMSYMVW LGATYGKLTG
DWTYFKDAWD KTEQYIIPDP ERDQPGVNSY IPTQPAQYAP EADSPEKYPT PGDINAPTGI
DPIADELAST YGTKAIYQMH WLLDVDNWYG YGNHGDGTSR CSYINTYQRG SGESVWETIP
HPSWEDFRWG QVNNGGFLKL FGNFGEPVRQ WRYTSASDAD ARQIQATYWA YLWSKEQGKE
KELQPYFEKA AKMGDYLRYT FFDKYFRPIG VQDSGRAGTG YDSCHYLLSW YASWGGDING
TWSWRIGSSH CHQGYQNPMA AYALAKESIF TPKSKNAKKD WEQSLDRQIE LFLYLQSAEG
AIAGGVTNSW SGAYGKYPEG TSTFYDMAYD PHPVYNDPPS NRWFGFQAWS MERIMEYYYL
TGDSRVKELC KKWVSWAIEN TRLKSDGTYE IPSTLEWSGQ PDPWTGKPSE NKNLHCTVTE
WTVDVGVTAS YAKALIYYAA ATEKHEKKID DKARETAKQL LDRMWHNYRD KKGVAAKEPR
ADYKRFFDEV YIPHDFSGIN AQGAEIKNGI TFIDLRPKYK EDKDYKMVEE AIKSGKDPVM
TYHRYWAQAE VAMANAMYHI FFEQKKDGLV PGINSDDSNS SSETQKSETP DYTPDSLSTS
DATITESPAN TNSPDENPSP QNTSAPTNVP LNPANTPYNS SNAAPNSPNT PARPSNTTAG
SPAGNNTVGR LILQYANGNG SDTTNTINPR FKLINNSGSP VKLSDVKIRY YYTIDGEKGQ
QFWCDWSSAG NSNVTGKFVK LSSPKNNADY YLEIGFTEGA GSIEPGMSVE VQARFSKDDW
SNYSQANDYS FSASANDYGN SNHIALYISG RLVSGNEP