Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2879 |
Symbol | |
ID | 4809086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3404469 |
End bp | 3406004 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640108298 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001039270 |
Protein GI | 125975360 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.42565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCAT CGAAAGTTGG GCAGAAGGTT ATCTTGTTTT TAGTGTCTTT TTCATTGTTC ATCTCATGTA TCACAATTTC AGCAACCGCT GCCAATGGCG GAAAATTGGG AGACATAAAC AGCGACGGAT CCATCAATTC CACCGATGTG ACTTTATTAA AAAGACATCT TCTCAGAGAA AACATACTTA CAGGGACGGC ATACTCCAAT GCTGATACCG ACGGAGACGG AAAAATAACC TCCATCGACT TAAGCTATTT GAAAAGATAT GTATTGCGCC TTATATCTTC TTTTCCGGGC GAAACGTCAA ATAATCTTAA CATACCCTGG GATTGGGTCG GAATCATAGG AACGGGACAA AGTCTCTCCG TAGGAACTAC TCCAATTTTA TCTACAACCC AGCCTTACAA CAATCTTAAG CTGGATTTGG GCAACCTGAG GGTTCCTCCC TACGATGCCA ACAGCAGTGA ACTGAAACTG GTTCCCCTTA CCGAACCTAT TCGCAGCCTT GCAACAGGTT TCCCGTCTGC CTACCCAAGG AACCTATATG GTGAGACCCC CCATAGTGCA ATGGCGAACC AAATAACTGC AATGGTAAAA GCTGCAGGCA GAAACGACTA TATTTCAGTA CATACGGTAG TAGGTGAATC CGGTCAGGGT ATGTCTGTAA TCAAAAAAGG TGCCACCGAT ACTGGAAATA CCGGCCGTGC CTATGCGGCA TCAATATTTG AAGTCACAGC AATTAACCGC TTGGCAAAAG CCGCGGGAAA AACCTACGGG GTGGGAGGAA TCATCCTAAC CCATGGCGAA ACCGATTGCG GTAATCCCAA TTATGAAAAT GAGCTACGCC AATTGTGGTC GGATTATAAC AAGGACATAA AGGCTATAAC CGGACAAACC CAAAATATCC CGATGTTTGT TGTGCAGCAG CACTCCTATC CAAGTACGGG AACTTCCGCT TCCACCTTGG CTCAGTGGAA AGCAGGTGTT GATTATCCCG GTGATATAAT CTGCATAGGT CCAAATTATC AGCGAACTTA CGGAGGAGAT AACGTCCATT TGACTTCTGC CGGATATCAA CATTTGGGTG AAAAGTACGC TCAGGTATAT TATGAAAAGG TTATCCTCGG TAAGGACTGG AAACCTCTTC AGCCTATAAA AGCCACCAAA AACGGCAGAA CCATCATAGT AGACTTCCAT GTACCGGTAC CGCCCCTTGT TTGGGACAAT ACGCTTCCGG CACCAAACCA GAACACCTTG ACCGAATGGA GAAACGGCAA AGGGTTTGAG GTTACTGCCA ACGGTTCAAG AGTCACAATA AATTCGGTTG AAATCTCGGG AAATTCGGTA ATAATAACCT GTGCCAGTGA ACTGCCCGCT TGGGGTGTAA AAGTCGGCTA TGCCTTTACC GGCGGAAAAG CAAGACCTAA CGGAACCTAC CGCTGGGGTT TGCTGCGCGA CTCAGATCCC TTCATAGGAA GGTCCGGTGT GGCACAGCCC AATTTCTGTG TATCTTTTGA AATGCCCGTT AATTAA
|
Protein sequence | MKASKVGQKV ILFLVSFSLF ISCITISATA ANGGKLGDIN SDGSINSTDV TLLKRHLLRE NILTGTAYSN ADTDGDGKIT SIDLSYLKRY VLRLISSFPG ETSNNLNIPW DWVGIIGTGQ SLSVGTTPIL STTQPYNNLK LDLGNLRVPP YDANSSELKL VPLTEPIRSL ATGFPSAYPR NLYGETPHSA MANQITAMVK AAGRNDYISV HTVVGESGQG MSVIKKGATD TGNTGRAYAA SIFEVTAINR LAKAAGKTYG VGGIILTHGE TDCGNPNYEN ELRQLWSDYN KDIKAITGQT QNIPMFVVQQ HSYPSTGTSA STLAQWKAGV DYPGDIICIG PNYQRTYGGD NVHLTSAGYQ HLGEKYAQVY YEKVILGKDW KPLQPIKATK NGRTIIVDFH VPVPPLVWDN TLPAPNQNTL TEWRNGKGFE VTANGSRVTI NSVEISGNSV IITCASELPA WGVKVGYAFT GGKARPNGTY RWGLLRDSDP FIGRSGVAQP NFCVSFEMPV N
|
| |