Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1400 |
Symbol | |
ID | 4809061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1710973 |
End bp | 1712220 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106823 |
Product | glycosyl hydrolase 53 |
Protein accession | YP_001037824 |
Protein GI | 125973914 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.195161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTATCGA AAATTACAAA ACTTACTGTT ATCTTTATCC TCATCCTGTC TGTCACATTT CTTTCAATGC CCAAAACTTA CACACAAGCG GCTCCAACTT TTGCAAAGGG GGCTGATGTA AGCTGGCTGC CGGAGATGGA GGCAAACGGC TACAAATTTT ATAATGACGA CGGGATTGAG CAGGATTGTC TTCAAATTTT AAAAGACCAT GGGATAGACT CAATCAGACT CAGGGTATGG GTCAATCCTC CAAACGGTTA CTGCAACAAA GAAGAGACAA TTAAGATGGC ATTAAGAGCT AAAAAAATGG GATTCAGAAT AATGATAAAC TTTCATTACA GTGACTCATG GGCCGACCCG GGGAAACAAA CAAAACCGGC AGCATGGGCC AAATATGATT TTAACGGCTT GATGAAAGCT GTATATGACT ATACATATGA TGTCATGAGC GCTTTGAAAG CCAACGGCAT AACTCCCGAG TGGGTTCAGG TTGGAAATGA AACCAATAAC GGCATGCTGT GGGAAGACGG CAAGGCAACG AACAGCATGA GGAATTTTGC ATGGCTCATC AACTGCGGTT ATGATGCCGT AAAAGCGGTA AGTCCTGAAA CTAAAGTTAT AGTTCATATT GCAAATGGAT ATGACAATGC ATTGTACAGA TGGATATTTG ATGGAATTAC CGCAAACGGT GCAAGATTTG ACGTAATAGG TATGTCGCTG TATCCTACGG CGTCTGACTG GCCACAATTG ACAAACCAGT GTCTAAACAA TATGAAAGAC ATGATATCAA GATACGGAAA AGAAATAATG ATATGTGAGA TTGGAATGGA TTACTGGGAG GCGCAGGCTT GCAAAGACTT TATTACCGAT ATAATTCAAA AGACAAAATC CCTGCCCGAT AATAAAGGCC TTGGAGTATT CTACTGGGAG CCTCAATGTT ATAACTGGCA GTATTATAAT AAAGGAGCCT TTGATTTATC CGGAAAACCC ACAATTGCGC TGGATGCGTT TTTGACCTCC GATACACCGG ATAATCTTAT TTATGGAGAT TTGAACGGTG ACGGACGCGT GAATTCCACG GACTACACTT TGCTGAAGAG ATATTTGCTT GGCGCTATAC AAACTTTCCC TTATGAAAGG GGAATTAAGG CTGCGGACCT GAATTTGGAC GGTCGTATCA ATTCGACTGA TTATACTGTG CTAAAAAGAT ATTTACTCAA TGCCATACCA TCACTTCCTG TAAAATAG
|
Protein sequence | MLSKITKLTV IFILILSVTF LSMPKTYTQA APTFAKGADV SWLPEMEANG YKFYNDDGIE QDCLQILKDH GIDSIRLRVW VNPPNGYCNK EETIKMALRA KKMGFRIMIN FHYSDSWADP GKQTKPAAWA KYDFNGLMKA VYDYTYDVMS ALKANGITPE WVQVGNETNN GMLWEDGKAT NSMRNFAWLI NCGYDAVKAV SPETKVIVHI ANGYDNALYR WIFDGITANG ARFDVIGMSL YPTASDWPQL TNQCLNNMKD MISRYGKEIM ICEIGMDYWE AQACKDFITD IIQKTKSLPD NKGLGVFYWE PQCYNWQYYN KGAFDLSGKP TIALDAFLTS DTPDNLIYGD LNGDGRVNST DYTLLKRYLL GAIQTFPYER GIKAADLNLD GRINSTDYTV LKRYLLNAIP SLPVK
|
| |