Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0910 |
Symbol | |
ID | 4810531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1086340 |
End bp | 1088007 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106329 |
Product | extracellular solute-binding protein |
Protein accession | YP_001037337 |
Protein GI | 125973427 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.463169 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGAAAAA GAGCCTTGTT GGTTGTTGTG CTGGTGATTG CAAGTATTGT TTTTTCTGCT TGCAGCATTG ATTTAAGCAG ATTGTTGGAA GAAGGAAACA ATGACTATGA TTTTTTAAAT AATGGTGTGG ATGATGTACT GGATACAGGG CCTGTAAAAA ACGGTGTTTT AAATTTGTTT TCCACCGAGC CCGACACTCT CAATCCCATA TTGACTTCCA ATGCCTATGT AAAGGAGTAT TCCTGCTTTG TGTTTGAAAG CCTTGTGAGG TTGGACGGAA ATCAGAAGGC TGTGCCGCTT CTGGCCGAAA GCTGGGAAGG ATCCGATGAT GGATTGGTAT GGACATTTTA TTTAAAAGAA AATATTTATT GGCACGATGG TATACCTTTT TCAGCGGAAG ACGTGGAATT TACGGCAAGC GTTATAATGA ATGCGGGGGT AAACAGTCCG TACAAGACAT GTTTTGAAAA TGTGGAAAGC TTTCTTGCTC AAGACAGCAG GACATTTAAG GTGCTTTTAA AAAGCCCCAA TTCTTTCACT CCCGAACTTA TGACTTTTCC CGTAATTCCG AAGCATTATT TTCTTGGGGA AGATATTCTG ACCACTCCCA AAAACAACAG TCCGATAGGT ACCGGACCGT ATAAGTTCGC GGAATACAGG CAAGGTGAGT ATATCAGGCT TACATGTAAT GAAAATTGGT GGAACAAAGA TGATGGTATT GAAAACGGAA TTGACCTTCC CTATATCCAG GAGGTCAATA TAAAAATATA TGGCAAAAAT CAGTCTGTAA TGAATGCTTT TCAGTCACAG CAAGTGGATG TCATTACTTT AGACAGGACC AATTGGACGG GCTACAGTGG AAGGTCCGAT ATAATTTTAA AAAAATATGT AAGTAATGAA TTTGAGTTTG TTGCATTTAA TCTCTCAAAC AAAATACTGA AGGAAAGGGA AGTAAGGACG GCTATAGCAT ATGCTGTTGA CAGGGAGCAG ATTATAAGCA GTATTTTGCC GGGGGAGGCT GTGGCGTCGG ATTTGCCCGT TATTCCCGAT ACATGGCTCA ATGATACCAA TGTTGTATCT TATGAAAGAG ATGTGGAAAA GGCAAAGCAG ATACTTTCAG ATGCCGGATG GAAAGAAAGC AACGGTATAT TTTACAAAAG AATCAACGGT GTTAACACTC CGCTTTCACT GGAGCTTATG GTAAATGATG ACAACGAAGT TCGGCTGTCC GTGGCGGAAA TGATAAAAGA ACAGCTGAAA GAGGCGGGAA TAGAAATAGA AATTAAAAAG GTCAAATGGG AGGACGAACT TAACGGAGTA CAGAGCGGTA AATTTGACAT GGCGCTTATC GGATGCACTG TGGCATCCAT TCCGGATATA TCTTTTCTGT ATTCATCAGC ACAAATCGGG ACAGGGCTTA ATATTTCCGG TTACAGCAAT GAAGAGGTTG ACCGGTATCT TACCTTGATT TTGAAGGAAA AGGATCCGTC AATGAAGAAA GCTTATTTTA TTAACATGAA AGAAATAATA AATCGGGATG TGCCTTGTTT GGGATTGTAT TTTTATAATA ATATGGTTAT ATACAACAAA AGGTTAAGAG GAGAGTTCAA TCCCAGCATA TGGGGCAAAT ATTACGATTT TACCCGGTGG TATATACCTG TTGAGTAA
|
Protein sequence | MRKRALLVVV LVIASIVFSA CSIDLSRLLE EGNNDYDFLN NGVDDVLDTG PVKNGVLNLF STEPDTLNPI LTSNAYVKEY SCFVFESLVR LDGNQKAVPL LAESWEGSDD GLVWTFYLKE NIYWHDGIPF SAEDVEFTAS VIMNAGVNSP YKTCFENVES FLAQDSRTFK VLLKSPNSFT PELMTFPVIP KHYFLGEDIL TTPKNNSPIG TGPYKFAEYR QGEYIRLTCN ENWWNKDDGI ENGIDLPYIQ EVNIKIYGKN QSVMNAFQSQ QVDVITLDRT NWTGYSGRSD IILKKYVSNE FEFVAFNLSN KILKEREVRT AIAYAVDREQ IISSILPGEA VASDLPVIPD TWLNDTNVVS YERDVEKAKQ ILSDAGWKES NGIFYKRING VNTPLSLELM VNDDNEVRLS VAEMIKEQLK EAGIEIEIKK VKWEDELNGV QSGKFDMALI GCTVASIPDI SFLYSSAQIG TGLNISGYSN EEVDRYLTLI LKEKDPSMKK AYFINMKEII NRDVPCLGLY FYNNMVIYNK RLRGEFNPSI WGKYYDFTRW YIPVE
|
| |