Gene Cthe_0660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0660 
Symbol 
ID4808190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp813278 
End bp815599 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content43% 
IMG OID640106075 
Productglycoside hydrolase family protein 
Protein accessionYP_001037088 
Protein GI125973178 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5498] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCACCGG GTGCTAAGGT ACCTCAGGCA GAGATTTACA AGACATCCAA TTTACAGGGG 
GCAGTTCCGA CCAACAGCTG GGAAAGCTCA ATTTTATGGA ATCAATATTC ACTTCCGATA
TATGCTCATC CTTTGACATT TAAATTTAAA GCCGAAGGTA TTGAAGTAGG AAAGCCTGCA
TTGGGAGGCT CGGGAATAGC TTATTTTGGC GCCCATAAAA ATGACTTTAC CGTTGGACAC
TCATCTGTCT ACACTTTTCC TGATGCAAGG GCGGATAAAA TATCCGATTT TGCCGTCGAT
GCGGTTATGG CTTCAGGTTC AGGCAGTATC AAGGCTACAT TGATGAAGGG AAGTCCTTAT
GCTTATTTTG TTTTTACAGG CGGAAATCCC AGAATTGATT TTTCCGGTAC TCCTACAGTG
TTTTACGGGG ATTCCGGCAG CCAATGCCTT GGCGTTACAA TAAACGGTGT AAATTACGGG
CTTTTTGCTC CGTCTGGCTC AAAATGGCAG GGAATTGGAA CAGGTACGAT AACTTGCATA
CTTCCGGCGG GAAAAAACTA TTTTTCAATT GCGGTTTTAC CTGACAACAC AGTTTCCACT
CTTACATATT ATAAAGATTA CGCCTACTGC TTTGTGACAG ATACAAAAGT GGAATGGAGC
TACAATGAGA CAGAAAGCAC TCTTACCACC ACTTTTACGG CAGAAGTTTC CGTAAAGGAA
GGGACAAACA AAGGCACAAT TCTTGCCCTT TACCCTCATC AATGGCGAAA CAATCCGCAT
ATTTTGCCTC TTCCATATAC TTATTCGACA CTGAGAGGCA TAATGAAAAC AATTCAAGGT
ACAAGCTTTA AAACTGTATA CCGCTACCAT GGAATTTTGC CCAATCTCCC TGACAAAGGA
ACCTACGACA GGGAAGCATT GAACAGATAT ATCAATGAAC TGGCTTTGCA GGCAGACGCT
CCTGTTGCCG TTGACACCTA TTGGTTTGGA AAGCATCTTG GCAAGCTTTC ATGCGCCCTT
CCCATTGCGG AGCAGCTTGG AAATATTTCT GCAAAAGACC GCTTTATAAG CTTTATGAAA
TCATCTTTGG AAGACTGGTT TACCGCAAAA GAAGGAGAAA CGGCAAAACT ATTCTATTAC
GACAGTAACT GGGGAACTTT GATAGGTTAC CCTTCAAGCT ACGGAAGTGA TGAAGAGTTA
AATGACCATC ATTTTCATTA CGGTTATTTT CTTCACGCCG CGGCCCAAAT AGCGTTAAGA
GACCCGCAAT GGGCATCCCG TGACAATTGG GGAGCAATGG TTGAGCTCTT AATCAAGGAT
ATTGCAAACT GGGACAGAAA TGACACAAGG TTTCCTTTCC TAAGAAATTT TGACCCCTAC
GAGGGCCATT CCTGGGCTTC GGGTCATGCC GGATTTGCCG ACGGCAACAA TCAAGAGTCA
TCATCGGAGG CCATCAACGC ATGGCAGGCA ATAATTTTAT GGGGAGAAGC AACAGGAAAC
AAAACGATAA GAGACCTTGG AATTTATCTT TATACCACTG AAGTTGAAGC TGTCTGCAAT
TACTGGTTTG ATTTGTACAA AGACATATTT TCACCTTCCT ATGGACATAA TTACGCTTCC
ATGGTGTGGG GAGGCAAATA CTGCCATGAA ATCTGGTGGA ACGGTACAAA TTCCGAAAAG
CATGGCATAA ACTTTTTGCC AATCACAGCC GCTTCATTGT ATCTTGGAAA AGACCCGAAT
TATATAAAGC AAAACTATGA GGAGATGTTA AGAGAGTGCG GAACGTCACA GCCTCCCAAT
TGGAAGGATA TACAGTATAT GTATTATGCC CTTTATGATC CTGCGGCGGC TAAAAATATG
TGGAACGAAA GCATTGTTCC GGAAGACGGA GAAAGCAAAG CCCATACTTA TCACTGGATT
TGCAACCTTG ACAGTTTGGG GCTTCCTGAT TTCAGTGTTA CTGCAGACAC ACCCCTCTAC
TCGGTATTTA ATAAAAACAA CATCAGAACC TATGTTGTTT ACAATGCTTC ATCGTCTGCA
AAAAAGGTTA CTTTTTCCGA CGGAAAAGTA ATGACGGTGG GGCCTCATTC CATGGCAGTT
TCAACCGGCA GTGAAAGTGA GGTTTTGGCC GGAGATTTAA ACGGTGACGG CAAAATAAAC
TCCACAGACA TAAGCCTTAT GAAGAGATAC CTTTTAAAGC AAATTGTAGA CCTGCCGGTG
GAAGATGATA TTAAAGCTGC AGACATAAAC AAAGACGGCA AAGTTAATTC AACCGACATG
TCGATTCTAA AAAGAGTGAT ATTGAGAAAT TATCCGCTTT AA
 
Protein sequence
MPPGAKVPQA EIYKTSNLQG AVPTNSWESS ILWNQYSLPI YAHPLTFKFK AEGIEVGKPA 
LGGSGIAYFG AHKNDFTVGH SSVYTFPDAR ADKISDFAVD AVMASGSGSI KATLMKGSPY
AYFVFTGGNP RIDFSGTPTV FYGDSGSQCL GVTINGVNYG LFAPSGSKWQ GIGTGTITCI
LPAGKNYFSI AVLPDNTVST LTYYKDYAYC FVTDTKVEWS YNETESTLTT TFTAEVSVKE
GTNKGTILAL YPHQWRNNPH ILPLPYTYST LRGIMKTIQG TSFKTVYRYH GILPNLPDKG
TYDREALNRY INELALQADA PVAVDTYWFG KHLGKLSCAL PIAEQLGNIS AKDRFISFMK
SSLEDWFTAK EGETAKLFYY DSNWGTLIGY PSSYGSDEEL NDHHFHYGYF LHAAAQIALR
DPQWASRDNW GAMVELLIKD IANWDRNDTR FPFLRNFDPY EGHSWASGHA GFADGNNQES
SSEAINAWQA IILWGEATGN KTIRDLGIYL YTTEVEAVCN YWFDLYKDIF SPSYGHNYAS
MVWGGKYCHE IWWNGTNSEK HGINFLPITA ASLYLGKDPN YIKQNYEEML RECGTSQPPN
WKDIQYMYYA LYDPAAAKNM WNESIVPEDG ESKAHTYHWI CNLDSLGLPD FSVTADTPLY
SVFNKNNIRT YVVYNASSSA KKVTFSDGKV MTVGPHSMAV STGSESEVLA GDLNGDGKIN
STDISLMKRY LLKQIVDLPV EDDIKAADIN KDGKVNSTDM SILKRVILRN YPL