Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1020 |
Symbol | |
ID | 4811314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1220262 |
End bp | 1221641 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106438 |
Product | extracellular solute-binding protein |
Protein accession | YP_001037445 |
Protein GI | 125973535 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000124434 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAAGA AGGTAATCGC ATTAATGTTG GTTGCTGTTA TGGCTTTAAG TCTGGCAGCA TGTGGTGGTG GAGGAGGAAA TACTACGACT TCACCGCAAC CAAACGATTC CCAAAATTCG CCTGATTCAG GAACAAAGAA GGACCCAGTA AAATTGACCA TGTGGATCAT GCCTAACAGT GACACACCGG ACCAGGATCT TTTGAAAGTT GTTAAGCCAT TCACAGATGC TAATCCTCAT ATCACAGTTG AACCTACAGT TGTTGACTGG AGTGCAGCTT TGACAAAGAT CACAGCTGCT GCTACAAGTG GTGAAGCTCC TGACATTACA CAGGTTGGTT CCACTTGGAC AGCTGCTATC GGTGCAATGG AAGGTGCATT GGTTGAGCTT ACCGGAAAAA TCGATACAAG TGCTTTCGTT GAATCAACTC TGCAGTCAGC TTATATCAAA GGCACAGACA AGATGTTCGG TATGCCTTGG TTTACTGAAA CAAGAGCTCT CTTCTACAGA AAAGACGCTT GCGAAAAAGC AGGTGTAAAT CCTGAAACAG ATTTCGCAAC TTGGGACAAA TTCAAAGATG CTCTCAAGAA ACTCAACGGT ATTGAAGTTG ACGGCAAGAA ACTGGTTGCA CTGGGTATGC CGGGTAAGAA CGACTGGAAC GTTGTTCATA ACTTCTCATG GTGGATTTAC GGTGCCGGCG GAGACTTTGT AAACGAAGAA GGTACACAAG CTACTTTCTC AAGCGAAAAT GCTCTTAAAG GTATCAAATT CTATTCAGAA CTTGCTGTTG AAGGTTTGAT GGATGAGCCT TCACTTGAAA AGAATACAAG TGACATTGAG TCCGCATTTG GTGACGGTGC ATACGCTACT GCATTCATGG GTCCTTGGGT TATTTCATCT TACACAAAGA ATAAAGAAGA AAACGGTAAC GACCTTATCG ACAAAATTGG TGTTACTATG GTTCCTGAAG GACCTGCAGG AAGATATGCA TTCATGGGTG GAAGTAACCT TGTAATATTC AACTCATCAA AGAACAAGGA TGAAGCCGTT GAACTTCTCA AGTTCTTTGC TAGCAAAGAA GCTCAGGTTG AATACTCAAA GGTTAGCAAG ATGCTTCCGG TTGTTAAAGC GGCTTACGAA GATCCATACT TTGAAGATTC ATTGATGAAA GTATTCAAAG AACAGGTAGA CAAATATGGT AAACACTATG CATCAGTTCC TGGTTGGGCT TCTGCAGAAG TTATCTTCTC AGAAGGTCTC AGCAAGATCT GGGATAACGT TATGGAAGTT GATGGTGCAT ACAGCTACGA CAAGACTGTA CAAATCGTAA AAGATGTTGA AAGTCAAATC AACCAAATAT TGCAAGAAAC AAGCAAATAA
|
Protein sequence | MLKKVIALML VAVMALSLAA CGGGGGNTTT SPQPNDSQNS PDSGTKKDPV KLTMWIMPNS DTPDQDLLKV VKPFTDANPH ITVEPTVVDW SAALTKITAA ATSGEAPDIT QVGSTWTAAI GAMEGALVEL TGKIDTSAFV ESTLQSAYIK GTDKMFGMPW FTETRALFYR KDACEKAGVN PETDFATWDK FKDALKKLNG IEVDGKKLVA LGMPGKNDWN VVHNFSWWIY GAGGDFVNEE GTQATFSSEN ALKGIKFYSE LAVEGLMDEP SLEKNTSDIE SAFGDGAYAT AFMGPWVISS YTKNKEENGN DLIDKIGVTM VPEGPAGRYA FMGGSNLVIF NSSKNKDEAV ELLKFFASKE AQVEYSKVSK MLPVVKAAYE DPYFEDSLMK VFKEQVDKYG KHYASVPGWA SAEVIFSEGL SKIWDNVMEV DGAYSYDKTV QIVKDVESQI NQILQETSK
|
| |