Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2128 |
Symbol | |
ID | 4811175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2526898 |
End bp | 2528280 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107534 |
Product | extracellular solute-binding protein |
Protein accession | YP_001038527 |
Protein GI | 125974617 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TTATAACAAT GGTGTTAAGC TTAATTATTT TAAGCATTTT ATTTCTGGCA ACGTCTTGTT CTGGCAGCCG GGATGACACG CAAGATGTTA TAGGTGGTAA AATAGTGATG TATGCGGCAC CCGGCGACAA TGTTCAGTCA GAAATAAGAA ATATAGTAAG AAGCAAGTAT CCAAATGTGG AGTTTCAAGT GGTTTCGTTC AATAATGCTG ACGAATTCAA AAGCAGACTT TTAACTGAAT TGATGGCGGG AGAAGGTCCG GATGTTATTG TTTTAAGCCC ATCCACCAAA AAGGGTTCAA TTACAATAGA AACTATGAGA AAGTTGGTAG AATCAGGAGT TTTCTGTGAT CTGGAGCCAT ATATATCGAA GGATGAGAGT ATAAATTTGT CAGAGTATAA TGAGACTGTT TTAAACAGCG GTGTTATAAA CGGCAAAAGA TACTTTATTC CCATAGCCTA TGATGTACCT ATTTTTTGGA CGGCTAACTC CATTCTTGAG GAAAACAATA TAAAGGATGA AATAGCAAAC TGGACGTTGA AGGACATGGC TGATTTTGCA GTTCAGTTTA AAGAAAAGAA TTCTGATAAT TACCTCTTTG GCTATGGTGA CGGATTTATC AGAAATATTA TGTATGCGAA CTGGAGAGAA TTTGTTGATT ACGAGAATAA GCAGGCAAGC TTTGACAGTC AAGAGTTTGT TGAATTTTTG GAGGCAATTG GAGCTATTGA AAAAGCAGGC ATTTGTGATG AAAAACTTAT TAAAGAATAT ACGGGGATGG AGTTTGAAGC TCTAAAGCAT GGGAAAATTA CTTTGATAAG CAGTACTGAG TATCCCATAA ATCCTTGGGA ATTATGGTAT CGCAATTCGC ACATAAATTA CTATTTTAAT CCGGATAGCA TAAGGCTTTC AAAATTTCCT ACATTTGGGG ACTTGGGCAG AATAGTGGCG CATCCTACAG ATATAGTAGC GATAAACAAA AACAGCAAAA ATAAAGCAAC TGCATATGAG GTGCTGAAAG TTTTTTTGTC AAAAGAAATT CAAAGTTCCC AACAATTTCG CGATAGAATG GGAATACCGG TTAATGATGA GGCGATAAGA GAACTCATAG AGAAATATTC AGGAGAAGAA GGAAAGACCA CCCTTCCTGT GGGAATGACC ATTAACGAAA CTATGGATAC CGTACCGTTA CCGGAATCTG TAGTGGCGGA ATACAATTCA ATAATAAACG GAGTAACTGA ATGTGTACTG GTTGACGAGC AAATAATTGA TTTTATGATT GAAGGATTCA ATGAATACAA AAACGGCAAA ATGTCTGCTA AAGACGCAGC TCGGATGGTA CAGCAAAAAG TAAATTTGTT TTTAAATGAG TAA
|
Protein sequence | MKKFITMVLS LIILSILFLA TSCSGSRDDT QDVIGGKIVM YAAPGDNVQS EIRNIVRSKY PNVEFQVVSF NNADEFKSRL LTELMAGEGP DVIVLSPSTK KGSITIETMR KLVESGVFCD LEPYISKDES INLSEYNETV LNSGVINGKR YFIPIAYDVP IFWTANSILE ENNIKDEIAN WTLKDMADFA VQFKEKNSDN YLFGYGDGFI RNIMYANWRE FVDYENKQAS FDSQEFVEFL EAIGAIEKAG ICDEKLIKEY TGMEFEALKH GKITLISSTE YPINPWELWY RNSHINYYFN PDSIRLSKFP TFGDLGRIVA HPTDIVAINK NSKNKATAYE VLKVFLSKEI QSSQQFRDRM GIPVNDEAIR ELIEKYSGEE GKTTLPVGMT INETMDTVPL PESVVAEYNS IINGVTECVL VDEQIIDFMI EGFNEYKNGK MSAKDAARMV QQKVNLFLNE
|
| |