Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1046 |
Symbol | |
ID | 4811344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1248913 |
End bp | 1250253 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106468 |
Product | extracellular solute-binding protein |
Protein accession | YP_001037471 |
Protein GI | 125973561 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA CCGCAGTGAA GTTGTTGCTG GTTTTTCCGG TTTTGCTCGC TTACATATTT TTTACCGGAT GTTCTAAAAA GCCTGCGAAG GCAGAGGAAA ATACACAAAT TCAGATAAGC AGCACACCTC AGCAAGATTT TGACCTGGGC GGTTACACCG TTAGAATTGC TCAATGGTGG GACGCCAGTC CAAACGATAG AAGCTCAATA GCCCGTCACA AGGCAGCGGA AGAAAAATAT AATTGCAAAA TTGAATATAT AACCATTACA TGGGATCAAA TTGTAAGCAA ATTTACCTCA TCAGTATTAT CAGGAGAACC TATAGCCGAT ATTGTTTTGT TTGAAATGAC CAGGGCGCTT CCCGTGCTTG CTGAATCAGA CCTTATAATT CCGGTGGACG ACTACTTTGA CTTTAATGAT CGGAAATGGC CCCCTATAAT TAGGCAAATA GGAAGATATA AAGGAAAACA ATACGGATTT ACGAACTATT GCTGGACTGT AACGGGGATT TTTTACAACA AAGTGTTGTT TGACAGATTG GGATTGCCTG ACCCGTATAT GCTTCAGGAG AACGGAGATT GGACTTGGGA AAAATTTGCT GAGATAGCTC AAATGGCAAC CAGAGATGAA GACGGCGACG GAGAGAATGA CCTGTGGGGA TTGGCAATAC AAGGTCATAA TCTTTATTCT CCTCTTATTT TATCAAACAA TGCCAATATC ATAAATTTTG ATGAAAACGG CAGAGCTATC TATGCTCTTG ATGACCCAAA TGCCATTGAA GCTCTTCAGT TTTTTGAGGA TTTGCACAAT AAATATAAAG TTGTGGCGCC TGTTGAAGAT CCAACTGACT GGTATGAGGC TCCTCGAAAA TTTTCAGAAG GCAATATTGC CATGTTTTTT GGACATGGCT GGGACGGACA GGAACTCAAA AATACGATGA AAGATGATTT TGGTTTTGTA TTTTTTCCAA AAGGACCTAA AGCTTCCGAT TACATAGTTC CTGTTCAGCA GGAGTGCAAA ATTTATGTAA TGCCCAAATA TGCAAAGCAC CCAAGAGAAG TGGCTAAAGT TTTTGAAGAA ATATCTCCCT TCTATAATGA CAATGTAGGA TTCGAAAGCT GGATTAATAC ATTTCTGGAC ACAGATGGAG AGAAGAATAC GGCGAGAATG ATGCTTGAAA AAGGGAAGGT ATCGTTGCAT CAAGCGTATC CTACCTTTGA TAATCTTCTT TTCAATAAAA TAGCAAGAGA AATTATAATA GACAACATTT CGGTTGAAGA TTTTGTAAAA AAATTCAAAG ATGAGGCCCA AAAGGCTATT GATTCTGAAT ATGAAAGATA A
|
Protein sequence | MKNTAVKLLL VFPVLLAYIF FTGCSKKPAK AEENTQIQIS STPQQDFDLG GYTVRIAQWW DASPNDRSSI ARHKAAEEKY NCKIEYITIT WDQIVSKFTS SVLSGEPIAD IVLFEMTRAL PVLAESDLII PVDDYFDFND RKWPPIIRQI GRYKGKQYGF TNYCWTVTGI FYNKVLFDRL GLPDPYMLQE NGDWTWEKFA EIAQMATRDE DGDGENDLWG LAIQGHNLYS PLILSNNANI INFDENGRAI YALDDPNAIE ALQFFEDLHN KYKVVAPVED PTDWYEAPRK FSEGNIAMFF GHGWDGQELK NTMKDDFGFV FFPKGPKASD YIVPVQQECK IYVMPKYAKH PREVAKVFEE ISPFYNDNVG FESWINTFLD TDGEKNTARM MLEKGKVSLH QAYPTFDNLL FNKIAREIII DNISVEDFVK KFKDEAQKAI DSEYER
|
| |