Gene Cthe_0825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0825 
Symbol 
ID4810443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1004283 
End bp1006232 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content42% 
IMG OID640106242 
Productglycoside hydrolase family protein 
Protein accessionYP_001037253 
Protein GI125973343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGAA TGACCTTGAA AAGCAGCATG AAAAAACGTG TGTTATCTTT GCTCATTGCT 
GTAGTGTTTC TAAGCTTGAC CGGAGTATTT CCTTCGGGAT TGATTGAGAC CAAAGTGTCA
GCTGCAAAAA TAACGGAGAA TTATCAATTT GATTCACGAA TCCGTTTAAA CTCAATAGGT
TTTATACCGA ACCACAGCAA AAAGGCGACT ATAGCTGCAA ATTGTTCAAC CTTTTATGTT
GTTAAAGAAG ACGGAACAAT AGTGTATACC GGAACGGCAA CTTCAATGTT TGACAATGAT
ACAAAAGAAA CTGTTTATAT TGCTGATTTT TCATCTGTTA ATGAAGAAGG AACGTACTAT
CTTGCCGTGC CGGGAGTAGG AAAAAGCGTA AACTTTAAAA TTGCAATGAA TGTATATGAG
GATGCTTTTA AAACAGCAAT GCTGGGAATG TATTTGCTGC GCTGCGGCAC CAGTGTGTCG
GCCACATACA ACGGAATACA CTATTCCCAT GGACCGTGCC ATACTAATGA TGCATATCTT
GATTATATAA ACGGACAGCA TACTAAAAAA GACAGTACAA AAGGCTGGCA TGATGCGGGC
GACTACAACA AATATGTGGT AAACGCCGGC ATAACCGTTG GTTCAATGTT CCTGGCGTGG
GAGCATTTTA AAGACCAGTT GGAGCCTGTG GCATTGGAGA TTCCCGAAAA GAACAATTCA
ATACCGGATT TTCTTGATGA ATTAAAATAT GAGATAGACT GGATTCTTAC CATGCAATAC
CCTGACGGGA GCGGAAGGGT GGCTCATAAA GTTTCGACAA GGAACTTTGG CGGCTTTATC
ATGCCTGAGA ACGAACACGA CGAAAGATTT TTCGTGCCCT GGAGCAGTGC CGCAACGGCA
GACTTTGTTG CCATGACGGC CATGGCTGCA AGAATATTCA GGCCTTATGA TCCTCAATAT
GCTGAAAAAT GTATAAATGC GGCAAAAGTA AGCTATGAGT TTTTGAAGAA CAATCCTGCG
AATGTTTTTG CAAACCAGAG TGGATTCTCA ACAGGAGAAT ATGCCACTGT CAGTGATGCA
GATGACAGAT TGTGGGCGGC GGCTGAAATG TGGGAGACCC TGGGAGATGA AGAATACCTT
AGAGATTTTG AAAACAGGGC GGCGCAATTC TCGAAAAAAA TAGAAGCCGA TTTTGACTGG
GATAATGTTG CAAACTTAGG TATGTTTACA TATCTTTTGT CAGAAAGACC GGGCAAGAAT
CCTGCTTTGG TGCAGTCAAT AAAGGATAGT CTCCTTTCCA CTGCGGATTC AATTGTGAGG
ACCAGCCAAA ACCATGGCTA TGGCAGAACC CTTGGTACAA CATATTACTG GGGATGCAAC
GGCACGGTTG TAAGACAGAC TATGATACTT CAGGTTGCGA ACAAGATTTC ACCCAACAAT
GATTATGTAA ATGCTGCTCT CGATGCGATT TCACATGTAT TTGGAAGAAA CTATTACAAC
AGGTCTTATG TAACAGGCCT TGGTATAAAT CCTCCTATGA ATCCTCATGA CAGACGTTCA
GGGGCTGACG GAATATGGGA GCCGTGGCCC GGTTACCTTG TAGGAGGAGG ATGGCCCGGA
CCGAAGGATT GGGTGGATAT TCAGGACAGT TATCAGACCA ATGAAATTGC TATAAACTGG
AATGCGGCAT TGATTTATGC CCTTGCCGGA TTTGTCAACT ATAATTCTGC TCAAAATGAA
GTACTGTACG GAGATGTGAA TGATGACGGA AAAGTAAACT CCACTGACTT GACTTTGTTA
AAAAGATATG TTCTTAAAGC CGTCTCAACT CTGCCTTCTT CCAAAGCTGA AAAGAACGCA
GATGTAAATC GTGACGGAAG AGTTAATTCC AGTGATGTCA CAATACTTTC AAGATATTTG
ATAAGGGTAA TCGAGAAATT ACCAATATAA
 
Protein sequence
MSRMTLKSSM KKRVLSLLIA VVFLSLTGVF PSGLIETKVS AAKITENYQF DSRIRLNSIG 
FIPNHSKKAT IAANCSTFYV VKEDGTIVYT GTATSMFDND TKETVYIADF SSVNEEGTYY
LAVPGVGKSV NFKIAMNVYE DAFKTAMLGM YLLRCGTSVS ATYNGIHYSH GPCHTNDAYL
DYINGQHTKK DSTKGWHDAG DYNKYVVNAG ITVGSMFLAW EHFKDQLEPV ALEIPEKNNS
IPDFLDELKY EIDWILTMQY PDGSGRVAHK VSTRNFGGFI MPENEHDERF FVPWSSAATA
DFVAMTAMAA RIFRPYDPQY AEKCINAAKV SYEFLKNNPA NVFANQSGFS TGEYATVSDA
DDRLWAAAEM WETLGDEEYL RDFENRAAQF SKKIEADFDW DNVANLGMFT YLLSERPGKN
PALVQSIKDS LLSTADSIVR TSQNHGYGRT LGTTYYWGCN GTVVRQTMIL QVANKISPNN
DYVNAALDAI SHVFGRNYYN RSYVTGLGIN PPMNPHDRRS GADGIWEPWP GYLVGGGWPG
PKDWVDIQDS YQTNEIAINW NAALIYALAG FVNYNSAQNE VLYGDVNDDG KVNSTDLTLL
KRYVLKAVST LPSSKAEKNA DVNRDGRVNS SDVTILSRYL IRVIEKLPI