Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1957 |
Symbol | |
ID | 4810740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2331305 |
End bp | 2332645 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107373 |
Product | extracellular solute-binding protein |
Protein accession | YP_001038368 |
Protein GI | 125974458 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000130159 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAGGT CAAAATATTC TCTCGTGTTG TCATTCCTTT GTTTCTTAGC AATTCTGACA GTGACGAGGT GTAGTCTGGG GGGAAAAGTG CAGGCGCCGA ACGAATATAC TCGCGTGCAG GCTGCTGAAA TAGAAGAGCC TGCCGAGGAT GTGGAAATAG AATTTTGGAC GTATAATGAC GGCTGGAAAG CTCCTATAAA CCATTTTCAA TTGATTCATC CCAAAATAAA AATAAAACTT GTCAAATTTG ATTTTAACGA TATGGGCAAT GTATATAAAA AAGCGTTGGC AGCGGGAGAA GGGCCTGACA TATTGTTTTT TGACAGCGCT TATTACAGTC AGTTTACCAC GGGAGAATAT CTTGAAGATT TGCTCAAAGA GCCTTATTAT GCAGGAAGGT ACGAAAAAGA TTTTCCGAAG GATATATGGG AAAGCAATAA ATCACTTGAC GGAAAGCGTC TTTTGGCAAT GACTTTCCTG ACATCACCCG TTGTGACTTT TTACCGGGCG GATGTAATGG AGGAAAACGG TTTTCCGTCG GAACCTGAAG AACTTGCCAA ATTTATTGAA AAAAGCGAAA ATCTTATGGC TATAGCGAAA AAGCTGAAAT CCAAGGGTCA GTACATTTTT CAGATGCCTG TGGATATCAT AAATCTTGCC GGTCTGTATT CGGGAATATT TGATGAAAAC CTGAGATTTG TAAGAAACTC AGACTTGTTT GTGCAAGCGC TGGATATGGC AAGGGAAATT AAAAGGCTGG ATTTGTCAAT TGGTGCAAAT ATTGTGGAGG AAGCGGCAAA GGAGGCTGTC CGGAACGGAG AACTGGTAAT GGTGCTGGGC ATCGGGTCGT GGGGGACCAG TACCATACAG AGCTATGCTC CCGACCAGGC AGGTAAGTGG AGGGTCACTG CGCCGCCCCT GGGGCTTAAG GTATGGTATT CGGATACCAA GCTTGCGATT AATGCCCAAA GCAAATACAA GAAATGGGCA TGGCTGTTTG TTGAATATGT GGCAACCCAG CAGGAGGGCG GGGAGAACAT TGACATGATT TCCGGCTATT GTCCTGCAAG AAGAAATCTT AAAGTTATGC TGAGAGAAAA CGAGTATTTC GGTAATCAGC ATATTCAGCC GCTTATTGAA GATTTGGCAG AGGAAATGGT TCAGTACAGG CAGACTCCTT TGGATGACAG GGCACTTCAA ATATTTAATG AGGAAATATT CAGAGCCATC GAAAACAATG TTGATTCCCA GAAAACGATA AAGGATATTG CAAATAAAGT GGAATATGAG CTGAAAAAGG ATCGGGAAGC TTTACTTAAG GGGAAAAAGC GGGTAAAATA G
|
Protein sequence | MRRSKYSLVL SFLCFLAILT VTRCSLGGKV QAPNEYTRVQ AAEIEEPAED VEIEFWTYND GWKAPINHFQ LIHPKIKIKL VKFDFNDMGN VYKKALAAGE GPDILFFDSA YYSQFTTGEY LEDLLKEPYY AGRYEKDFPK DIWESNKSLD GKRLLAMTFL TSPVVTFYRA DVMEENGFPS EPEELAKFIE KSENLMAIAK KLKSKGQYIF QMPVDIINLA GLYSGIFDEN LRFVRNSDLF VQALDMAREI KRLDLSIGAN IVEEAAKEAV RNGELVMVLG IGSWGTSTIQ SYAPDQAGKW RVTAPPLGLK VWYSDTKLAI NAQSKYKKWA WLFVEYVATQ QEGGENIDMI SGYCPARRNL KVMLRENEYF GNQHIQPLIE DLAEEMVQYR QTPLDDRALQ IFNEEIFRAI ENNVDSQKTI KDIANKVEYE LKKDREALLK GKKRVK
|
| |