Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2708 |
Symbol | |
ID | 4810702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3194634 |
End bp | 3196055 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108127 |
Product | hypothetical protein |
Protein accession | YP_001039100 |
Protein GI | 125975190 |
COG category | [N] Cell motility |
COG ID | [COG3225] ABC-type uncharacterized transport system involved in gliding motility, auxiliary component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000297424 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG GTTTTAAATA TAGTAAGAAT TTTAAATATG GCATGGTTTT TACTGTTATG GCGGCTCTTG TTGTCGGAAT AACGATATTG GTGAATGCTT TTGTTTCAGC TCTTAATATA AGATGGGACG TGTCTCAGAA CAAGATGTAT TCCATTGGTG AGCAGACTAA AGTGATACTT GAGGGTTTGA AACAGGAAGT CGACGTGGTA ATGCTGGCCG ACAGGGATGA GATAAAAACA TATGAAGCCG GTTTTATTCT TGTGGAATTT TTGGACAAGT ACGACAAGTT TGACAAAGTA AACGTAAAAT TTGTTGACCC GGACAAGAAT CCTGATATAG TAAAAGAACT GAACAAATCC GGTACATTAA ATCCCCAGCA GAATGAGATT ATTGTAAGAA GCGGAGACAA GGTAAAGAAA GTTACCATAT ATGATATTTT CCAGCAGGAC TATTACGGAC CGATGATGTT TATGGGCGAG CAGGCGATAA CCGGGGCAAT CAAGTATGTT ACCAGCGAGA CCATACCCAC GGTGTATTTT ATTGACGCCC ACAGCAAAAG AAAGCTGGAG TCGGATTACA CATATCTTCA AAAAGCTCTT GAGAACAACG GCTATGAAGT CAAGAAACTC GACCTTACAA GGCAGGAAAA AGTGCCCGAA GACACCACTG TATTGTTTTT TGCACCTCCC ACCCAGGATT TGTCGGTGGC AGAAAGGGAT AAAGTCCTTG ATTATCTGAA AGGCGGCGGA AACGCAATAT TCTTGTTTGA CCCGTCGAAT GATGATGAGA GGTTCGATAA TTTCGACAGA GTGTTAAATG AATACAGCAT GGCGTTAAAT TATGACAGAG TAAAAGAAAA CAACGACATG TACTATGTTG CAAACAGACC TTACCATATT ATACCTCAAG TTGGGTATAC CGATATTACA AGCCAGGAGG ATACAAGCAA ATTTACCGTT ATAATGCCTG ATTCCAGAAG TATTAAGAGA TTGGCCAATG ACAAGGAGCC TTTGACGGTT TTACCTCTTC TTACCACCAG CGAAGAGGCT GTGGGAGAAC CTTTTGGCGG AGGAGAGACG GAAGAAACCC GTGGTCCGCT TGATATTGGT CTTGTCGCGC AGTATTCCAG TGCTGTAACT ACCAAAATTG TTGTTATCGG AAACGGTTAT TTCCTGACGG ACGAAGTGTA TCAGTCATAT TTCCCGTATT CGTCATACAA TTTGTATCTT ATAGGTCTTG CGTCGGATTG GATGCTGGAT AAGTCAAATG ATGTGTTTAT TGTGGCAAAA ACTTCGATAA CCGACACCAT AAATCTCAGC GGTTTTAATG CGGCTTTGAT AATAGCCATT GCTGTTTTGG CATATCCGCT TATAATTACT TCAACAGGTA TTATTATATG GCTGAGGAGG AGGCATCTAT GA
|
Protein sequence | MKKGFKYSKN FKYGMVFTVM AALVVGITIL VNAFVSALNI RWDVSQNKMY SIGEQTKVIL EGLKQEVDVV MLADRDEIKT YEAGFILVEF LDKYDKFDKV NVKFVDPDKN PDIVKELNKS GTLNPQQNEI IVRSGDKVKK VTIYDIFQQD YYGPMMFMGE QAITGAIKYV TSETIPTVYF IDAHSKRKLE SDYTYLQKAL ENNGYEVKKL DLTRQEKVPE DTTVLFFAPP TQDLSVAERD KVLDYLKGGG NAIFLFDPSN DDERFDNFDR VLNEYSMALN YDRVKENNDM YYVANRPYHI IPQVGYTDIT SQEDTSKFTV IMPDSRSIKR LANDKEPLTV LPLLTTSEEA VGEPFGGGET EETRGPLDIG LVAQYSSAVT TKIVVIGNGY FLTDEVYQSY FPYSSYNLYL IGLASDWMLD KSNDVFIVAK TSITDTINLS GFNAALIIAI AVLAYPLIIT STGIIIWLRR RHL
|
| |