Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2961 |
Symbol | |
ID | 4810849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3478233 |
End bp | 3479858 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108383 |
Product | extracellular solute-binding protein |
Protein accession | YP_001039351 |
Protein GI | 125975441 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAA TCTTGGCACT TACAATTACA CTTGTAATCG TTGCAACAGT ATTAACAGGC TGCGGAAAAA CACAGAACAA AACATCAACA TCAAACCAAG TCATTCGTGC GACTATTGCA TCCGAGCCAA AAACCCTGGA TCCGTCCCGC AACAATGCAG TTGATGGCGG CAACTATATT TTATGTGCAT TTGAAGGACT GACCACGCTA GGCAAAGACG GCACCATTGT TGCAGGAACG GCTGAAAGAT GGGAAACAAG TGATGACGGG CTTGTTTGGA CATTCTACAT CCGCAAGGAT GCAAAGTGGT CTGACGGCAA GGATGTAACT GCAAACGATT TTGTTTATTC CTGGAGAAGG CTCGTTGATC CGGCAACAGC CGCCGATTAT GCGTATTACG CTTATTTCAT AAAAAACGGT GAAAAAATCA ACGCCGGTGA AGCCGACGTC AGTACGTTGG GTGTTCGGGC TGTCAACGAT AAAACCCTTG AGGTAACCCT TGAAAGCCCA TGTCCGTTCT TTACCGAAAT AGTTGCCTTC CCTGCTCTCG TTCCTCTCAG GGAAGATATT ATATCCGCCA ATAAAGACAA GTGGGCGCTT GAACCGTCTA CTTATATAGG CAACGGACCT TACAAACTGA CCAGCTGGGA CCATGATTCA AAAATTGTAT TTGAAAAGAA TGAAAACTAC TGGGACAAAA ATAATGTAAT TGCACCTAAA ATCGAATGGT ATCTGATGAA TGACCAGAAT GCCATCTTAA GCGCATTTAA AAACGGACAA GTTGCCTATG CAAAGAATAT TCCTTCGGAT GAGCTGGCTG CTGAAAAAGC AGCAGGTAAT CTGAAAATTT TTCCGTTAAT TGGAACATAT TATATAGACT TTGTAAACAA TAAACCTCCT TTTAACGATG TAAGGGTAAG AAAAGCGTTT TCTCTGGCAA TTGACCGCAA CTACCTGGTA GAAAACGTCA AAAAAGGTGG TGAAACTCCG GCAACAGCTT TTGTACCATA CGGTATAGCT GACGTTAATC CGGAACCCGA CTTCCGCACA GTAGGCGGCG ACTATATTTC AGTAAAACCT GAAGATTACG AAAAGAATGT GGCTGAAGCC AAAAGGCTTC TTGCTGAGGC AGGTTACCCG GACGGAAAAG GATTCCCGAA AATTACTTTT GGTCTGAACT CAGGTGCGGG TCATGAGCCT ATAGCTGAAG CTTTGCAGCA GATGTGGAAG GAAAATCTGG GTGTTGAAGT TGAAATCCTG GCTCAGGAAT GGAATGTATT CCAGCAGTCA CGAAAAGACG GCGTTTACAA TATAAACAGA AACGGTTGGA TCGGAGACTA TATGGATCCG TCAACTTTCA TGGACATATT TACAACCGGA AACGGTCAGA ATAATGCCAT GTACAGCAAT CCAAAGTATG ATGAACTTAT TTCCGCAGCA AGAAGAGAAA CTGACCCGGC TAAACGCATT CAAATGTACC ATGATGCGGA AAAGATACTT ATGGATGACG CCGCAATAGC TCCTTTGTAC TTCTATACTG ATCCTATAGT CATATCACCG AATCTTAAGG GAGTGTTACA TTCACAACTT GGTTTCGTAA TCTTCAAATG GGCATATTTT GAATAG
|
Protein sequence | MKRILALTIT LVIVATVLTG CGKTQNKTST SNQVIRATIA SEPKTLDPSR NNAVDGGNYI LCAFEGLTTL GKDGTIVAGT AERWETSDDG LVWTFYIRKD AKWSDGKDVT ANDFVYSWRR LVDPATAADY AYYAYFIKNG EKINAGEADV STLGVRAVND KTLEVTLESP CPFFTEIVAF PALVPLREDI ISANKDKWAL EPSTYIGNGP YKLTSWDHDS KIVFEKNENY WDKNNVIAPK IEWYLMNDQN AILSAFKNGQ VAYAKNIPSD ELAAEKAAGN LKIFPLIGTY YIDFVNNKPP FNDVRVRKAF SLAIDRNYLV ENVKKGGETP ATAFVPYGIA DVNPEPDFRT VGGDYISVKP EDYEKNVAEA KRLLAEAGYP DGKGFPKITF GLNSGAGHEP IAEALQQMWK ENLGVEVEIL AQEWNVFQQS RKDGVYNINR NGWIGDYMDP STFMDIFTTG NGQNNAMYSN PKYDELISAA RRETDPAKRI QMYHDAEKIL MDDAAIAPLY FYTDPIVISP NLKGVLHSQL GFVIFKWAYF E
|
| |