Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1398 |
Symbol | |
ID | 4809059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1707538 |
End bp | 1710066 |
Gene Length | 2529 bp |
Protein Length | 842 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640106821 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001037822 |
Protein GI | 125973912 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.461235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAAAA AGTTTACAAG TAAAATTAAG GCTGCTGTTT TTGCGGCTGT AGTTGCTGCA ACGGCAATAT TTGGCCCCGC GATTTCCAGC CAGGCTGTAA CCAGCGTGCC TTACAAATGG GACAACGTGG TAATCGGCGG AGGCGGAGGA TTTATGCCGG GTATAGTTTT TAATGAAACG GAAAAGGATT TGATTTATGC ACGTGCCGAT ATCGGAGGAG CGTACCGGTG GGATCCTTCG ACCGAGACAT GGATTCCGTT GCTGGACCAT TTCCAAATGG ATGAGTACAG TTATTACGGA GTGGAAAGTA TTGCAACCGA CCCTGTGGAT CCGAACCGTG TTTACATAGC TGCAGGTATG TATACCAACG ATTGGCTTCC TAATATGGGA GCAATTCTTC GCTCAACGGA CAGGGGAGAA ACATGGGAAA AAACCATACT GCCTTTCAAG ATGGGCGGAA ACATGCCGGG AAGATCCATG GGAGAACGTC TTGCGATCGA CCCGAATGAC AACAGGATTC TTTATCTTGG AACACGATGC GGAAACGGAC TTTGGAGAAG TACCGACTAC GGTGTAACAT GGTCCAAGGT TGAAAGTTTC CCAAATCCCG GAACTTACAT TTATGACCCG AATTTTGATT ATACCAAAGA CATTATTGGA GTAGTCTGGG TTGTTTTTGA CAAGAGCAGC AGTACACCGG GCAACCCTAC CAAGACTATA TATGTTGGTG TGGCTGATAA AAACGAAAGT ATTTACCGCA GTACGGACGG GGGTGTCACC TGGAAAGCAG TTCCCGGACA ACCTAAGGGA CTACTTCCTC ACCACGGGGT TTTGGCATCC AACGGAATGT TGTATATAAC TTATGGTGAT ACCTGCGGTC CTTATGACGG CAACGGAAAA GGTCAGGTTT GGAAGTTCAA TACACGTACA GGGGAATGGA TAGATATCAC CCCGATACCT TATTCAAGCA GTGACAATCG TTTCTGCTTT GCAGGACTTG CAGTGGACAG GCAGAATCCT GACATTATAA TGGTAACTTC CATGAACGCG TGGTGGCCGG ATGAATATAT TTTCCGCAGT ACTGACGGCG GAGCTACATG GAAGAATATC TGGGAATGGG GAATGTATCC TGAACGTATA CTGCATTATG AAATAGATAT TTCCGCAGCA CCGTGGCTGG ATTGGGGAAC TGAGAAACAG CTGCCGGAAA TCAATCCGAA ACTGGGATGG ATGATAGGTG ACATAGAGAT TGACCCGTTT AATTCCGACC GCATGATGTA TGTTACCGGT GCAACTATCT ATGGTTGTGA CAATCTTACT GACTGGGACA GAGGCGGCAA AGTAAAAATC GAGGTAAAAG CTACCGGAAT AGAAGAATGT GCGGTATTAG ACCTGGTAAG CCCGCCGGAG GGTGCACCGC TTGTAAGTGC AGTTGGCGAC CTTGTCGGTT TTGTTCATGA TGACCTGAAA GTTGGTCCGA AAAAAATGCA CGTTCCTTCT TATTCTTCAG GTACGGGAAT TGATTATGCG GAGCTTGTTC CGAACTTTAT GGCATTGGTT GCAAAGGCTG ATTTGTATGA TGTAAAGAAG ATTTCTTTCT CTTATGACGG AGGAAGGAAT TGGTTCCAGC CACCTAATGA AGCACCAAAC TCGGTAGGCG GCGGTTCGGT TGCCGTTGCA GCCGATGCAA AATCAGTTAT TTGGACACCG GAAAATGCAA GTCCTGCAGT TACAACGGAC AACGGAAACT CATGGAAAGT TTGTACAAAT CTTGGTATGG GTGCGGTGGT GGCATCCGAC CGTGTGAACG GTAAAAAATT CTACGCATTC TATAACGGCA AATTCTATAT AAGCACGGAC GGTGGATTAA CCTTTACCGA TACAAAGGCA CCGCAGCTTC CCAAGTCGGT TAACAAGATA AAAGCCGTAC CGGGCAAGGA AGGACATGTA TGGCTTGCTG CAAGAGAAGG CGGATTGTGG AGGTCCACTG ACGGTGGATA TACGTTTGAG AAACTCTCCA ATGTTGACAC AGCTCATGTG GTAGGCTTCG GAAAGGCAGC ACCGGGACAG GATTACATGG CGATTTACAT TACCGGTAAA ATTGACAATG TTTTAGGATT CTTCCGTTCC GATGATGCCG GCAAGACATG GGTGCGTATC AACGACGACG AGCACGGATA TGGCGCTGTT GATACTGCAA TAACAGGTGA CCCGAGAGTA TACGGACGTG TATATATTGC CACCAACGGA AGAGGTATTG TTTACGGCGA ACCTGCTTCA GATGAGCCTG TACCCACTCC TCCGCAGGTT GACAAAGGCC TGGTGGGCGA CTTGAACGGT GACAATCGAA TAAATTCAAC AGACCTTACT CTTATGAAGA GATATATCCT TAAATCGATA GAAGATTTAC CTGTCGAAGA TGATTTATGG GCGGCGGACA TAAACGGCGA CGGCAAAATA AATTCCACAG ACTATACATA CCTAAAGAAG TATCTGCTTC AAGCCATTCC GGAGCTGCCG AAAAAATAG
|
Protein sequence | MVKKFTSKIK AAVFAAVVAA TAIFGPAISS QAVTSVPYKW DNVVIGGGGG FMPGIVFNET EKDLIYARAD IGGAYRWDPS TETWIPLLDH FQMDEYSYYG VESIATDPVD PNRVYIAAGM YTNDWLPNMG AILRSTDRGE TWEKTILPFK MGGNMPGRSM GERLAIDPND NRILYLGTRC GNGLWRSTDY GVTWSKVESF PNPGTYIYDP NFDYTKDIIG VVWVVFDKSS STPGNPTKTI YVGVADKNES IYRSTDGGVT WKAVPGQPKG LLPHHGVLAS NGMLYITYGD TCGPYDGNGK GQVWKFNTRT GEWIDITPIP YSSSDNRFCF AGLAVDRQNP DIIMVTSMNA WWPDEYIFRS TDGGATWKNI WEWGMYPERI LHYEIDISAA PWLDWGTEKQ LPEINPKLGW MIGDIEIDPF NSDRMMYVTG ATIYGCDNLT DWDRGGKVKI EVKATGIEEC AVLDLVSPPE GAPLVSAVGD LVGFVHDDLK VGPKKMHVPS YSSGTGIDYA ELVPNFMALV AKADLYDVKK ISFSYDGGRN WFQPPNEAPN SVGGGSVAVA ADAKSVIWTP ENASPAVTTD NGNSWKVCTN LGMGAVVASD RVNGKKFYAF YNGKFYISTD GGLTFTDTKA PQLPKSVNKI KAVPGKEGHV WLAAREGGLW RSTDGGYTFE KLSNVDTAHV VGFGKAAPGQ DYMAIYITGK IDNVLGFFRS DDAGKTWVRI NDDEHGYGAV DTAITGDPRV YGRVYIATNG RGIVYGEPAS DEPVPTPPQV DKGLVGDLNG DNRINSTDLT LMKRYILKSI EDLPVEDDLW AADINGDGKI NSTDYTYLKK YLLQAIPELP KK
|
| |