Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0660 |
Symbol | |
ID | 4808190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 813278 |
End bp | 815599 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106075 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001037088 |
Protein GI | 125973178 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5498] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCACCGG GTGCTAAGGT ACCTCAGGCA GAGATTTACA AGACATCCAA TTTACAGGGG GCAGTTCCGA CCAACAGCTG GGAAAGCTCA ATTTTATGGA ATCAATATTC ACTTCCGATA TATGCTCATC CTTTGACATT TAAATTTAAA GCCGAAGGTA TTGAAGTAGG AAAGCCTGCA TTGGGAGGCT CGGGAATAGC TTATTTTGGC GCCCATAAAA ATGACTTTAC CGTTGGACAC TCATCTGTCT ACACTTTTCC TGATGCAAGG GCGGATAAAA TATCCGATTT TGCCGTCGAT GCGGTTATGG CTTCAGGTTC AGGCAGTATC AAGGCTACAT TGATGAAGGG AAGTCCTTAT GCTTATTTTG TTTTTACAGG CGGAAATCCC AGAATTGATT TTTCCGGTAC TCCTACAGTG TTTTACGGGG ATTCCGGCAG CCAATGCCTT GGCGTTACAA TAAACGGTGT AAATTACGGG CTTTTTGCTC CGTCTGGCTC AAAATGGCAG GGAATTGGAA CAGGTACGAT AACTTGCATA CTTCCGGCGG GAAAAAACTA TTTTTCAATT GCGGTTTTAC CTGACAACAC AGTTTCCACT CTTACATATT ATAAAGATTA CGCCTACTGC TTTGTGACAG ATACAAAAGT GGAATGGAGC TACAATGAGA CAGAAAGCAC TCTTACCACC ACTTTTACGG CAGAAGTTTC CGTAAAGGAA GGGACAAACA AAGGCACAAT TCTTGCCCTT TACCCTCATC AATGGCGAAA CAATCCGCAT ATTTTGCCTC TTCCATATAC TTATTCGACA CTGAGAGGCA TAATGAAAAC AATTCAAGGT ACAAGCTTTA AAACTGTATA CCGCTACCAT GGAATTTTGC CCAATCTCCC TGACAAAGGA ACCTACGACA GGGAAGCATT GAACAGATAT ATCAATGAAC TGGCTTTGCA GGCAGACGCT CCTGTTGCCG TTGACACCTA TTGGTTTGGA AAGCATCTTG GCAAGCTTTC ATGCGCCCTT CCCATTGCGG AGCAGCTTGG AAATATTTCT GCAAAAGACC GCTTTATAAG CTTTATGAAA TCATCTTTGG AAGACTGGTT TACCGCAAAA GAAGGAGAAA CGGCAAAACT ATTCTATTAC GACAGTAACT GGGGAACTTT GATAGGTTAC CCTTCAAGCT ACGGAAGTGA TGAAGAGTTA AATGACCATC ATTTTCATTA CGGTTATTTT CTTCACGCCG CGGCCCAAAT AGCGTTAAGA GACCCGCAAT GGGCATCCCG TGACAATTGG GGAGCAATGG TTGAGCTCTT AATCAAGGAT ATTGCAAACT GGGACAGAAA TGACACAAGG TTTCCTTTCC TAAGAAATTT TGACCCCTAC GAGGGCCATT CCTGGGCTTC GGGTCATGCC GGATTTGCCG ACGGCAACAA TCAAGAGTCA TCATCGGAGG CCATCAACGC ATGGCAGGCA ATAATTTTAT GGGGAGAAGC AACAGGAAAC AAAACGATAA GAGACCTTGG AATTTATCTT TATACCACTG AAGTTGAAGC TGTCTGCAAT TACTGGTTTG ATTTGTACAA AGACATATTT TCACCTTCCT ATGGACATAA TTACGCTTCC ATGGTGTGGG GAGGCAAATA CTGCCATGAA ATCTGGTGGA ACGGTACAAA TTCCGAAAAG CATGGCATAA ACTTTTTGCC AATCACAGCC GCTTCATTGT ATCTTGGAAA AGACCCGAAT TATATAAAGC AAAACTATGA GGAGATGTTA AGAGAGTGCG GAACGTCACA GCCTCCCAAT TGGAAGGATA TACAGTATAT GTATTATGCC CTTTATGATC CTGCGGCGGC TAAAAATATG TGGAACGAAA GCATTGTTCC GGAAGACGGA GAAAGCAAAG CCCATACTTA TCACTGGATT TGCAACCTTG ACAGTTTGGG GCTTCCTGAT TTCAGTGTTA CTGCAGACAC ACCCCTCTAC TCGGTATTTA ATAAAAACAA CATCAGAACC TATGTTGTTT ACAATGCTTC ATCGTCTGCA AAAAAGGTTA CTTTTTCCGA CGGAAAAGTA ATGACGGTGG GGCCTCATTC CATGGCAGTT TCAACCGGCA GTGAAAGTGA GGTTTTGGCC GGAGATTTAA ACGGTGACGG CAAAATAAAC TCCACAGACA TAAGCCTTAT GAAGAGATAC CTTTTAAAGC AAATTGTAGA CCTGCCGGTG GAAGATGATA TTAAAGCTGC AGACATAAAC AAAGACGGCA AAGTTAATTC AACCGACATG TCGATTCTAA AAAGAGTGAT ATTGAGAAAT TATCCGCTTT AA
|
Protein sequence | MPPGAKVPQA EIYKTSNLQG AVPTNSWESS ILWNQYSLPI YAHPLTFKFK AEGIEVGKPA LGGSGIAYFG AHKNDFTVGH SSVYTFPDAR ADKISDFAVD AVMASGSGSI KATLMKGSPY AYFVFTGGNP RIDFSGTPTV FYGDSGSQCL GVTINGVNYG LFAPSGSKWQ GIGTGTITCI LPAGKNYFSI AVLPDNTVST LTYYKDYAYC FVTDTKVEWS YNETESTLTT TFTAEVSVKE GTNKGTILAL YPHQWRNNPH ILPLPYTYST LRGIMKTIQG TSFKTVYRYH GILPNLPDKG TYDREALNRY INELALQADA PVAVDTYWFG KHLGKLSCAL PIAEQLGNIS AKDRFISFMK SSLEDWFTAK EGETAKLFYY DSNWGTLIGY PSSYGSDEEL NDHHFHYGYF LHAAAQIALR DPQWASRDNW GAMVELLIKD IANWDRNDTR FPFLRNFDPY EGHSWASGHA GFADGNNQES SSEAINAWQA IILWGEATGN KTIRDLGIYL YTTEVEAVCN YWFDLYKDIF SPSYGHNYAS MVWGGKYCHE IWWNGTNSEK HGINFLPITA ASLYLGKDPN YIKQNYEEML RECGTSQPPN WKDIQYMYYA LYDPAAAKNM WNESIVPEDG ESKAHTYHWI CNLDSLGLPD FSVTADTPLY SVFNKNNIRT YVVYNASSSA KKVTFSDGKV MTVGPHSMAV STGSESEVLA GDLNGDGKIN STDISLMKRY LLKQIVDLPV EDDIKAADIN KDGKVNSTDM SILKRVILRN YPL
|
| |