Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2195 |
Symbol | |
ID | 4811060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2618279 |
End bp | 2621176 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107601 |
Product | carbohydrate-binding family 6 protein |
Protein accession | YP_001038590 |
Protein GI | 125974680 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.862232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAATG GAATTATAGG AATTATGACC AAAAGACATA TGATAGTGAT AATGGCTTTA CTGTTTACGG TATCAGTTCT TTCGGCCGGA CTATTATTCA TAAATACGGT AAACGCAGCG GAACCGATAA CCTATTATGT ATCTCCCACC GGTAGTGACA GCAATACGGG TACAATAGAT GCACCCTTTA AGACGATTGC AAAAGCCCGG GACGTGGTGA GAACCGTCAA CGGCAATATG AAAAGTGATA TTTATGTATA TCTGAGGGGC GGCACTTATA ATATAACCGA AACAATCACG TTTGGCCCAC AGGATTCGGG AACAAACGGA TATAGGATTT ACTATATGGC GTATCCCGGA GAAACGCCTG TATTAAGCGG TGCAACAAAG GTTACAGGCT GGACGAGGCA TAACGGCAAT ATATACAAGG CAAAGTTAAA TCGTTCGACT AAACTGCGAA ACCTGTATGT AAATGACCAA AGAGCTTCGA TGACCAGCAA GAGAGTAACC GCCAGAGGGG GACACGGAAC TTACACCGTT ACTGCCGGGC AGGCTCCCTG GGCGTGGACC AGCGGAAGCA AAAGCGACGG TGTTCGGTAT GATATGTCGG AAGTACCGGA AATTACCCGC AATAAAGATG ACCTTGAGAT AGTAAACGGT ACTACATGGA ATGAAAATAT TGTGTGTACC CGCGATGTAA TTACAGCCAA CGGCTACAGG GTGCTTCTTT TGCAACAGCC TTACGGCGCC ATAGCGCAGA CCCCCGGTTG GGGTGCGGCT TTTACTACTT CCGGTACCCA TACAATTTAT AATGCCTTTG AATTTTTAAA TTCTCCGGGG CAATTCTATT TTGACAAAAC CGAACAAATG CTTTATTACT ATCTCCGTCC CGGAGAAAAT ATAGAGACGA TTGACGTTCA GGCCCCAATG GTTGAAAAAC TCATTGAGAT TGCCGGAACA TCAACTTCAA ACAGGGTAAA GAATATAACC TTCCAGGGCA TTACCTTTGC GTATACCGAT TACAACCTTG TCGAGGTCGG AGGTTCGCGG GGTAAATCGA CATGCCAGGC TGCCCAAGGC TTTATAGCTT TTTTCAACGA TAATTGGCAC TACACCAAAT ATGATCTTGT TGATACATTG CCGGGAATGA TCAACCTAAG AAACTGCGAT TCCATTGATT TTATTGAAAA TGTAATTAAG CATAGCGGAG CCGACGGAAT TTCCATGGTA AACGATGTTA TAAACTGCAA AATCATCGGC AACTATATTA CAGATATAAC ATCAAGCGGC ATAACGGTAG GCCATCCGCA GCATGTTTAC ATTGGAGACG GCGGGAGCCG TGCAAAATTT CCTTCCGGAG TAGAAGGTGT TTGCAAGAAC AATACCATTT CAAACAATGT GTTGTACGAC ATAAGTATGG TTCCGGGATT TGGCGGATGT GCCGGCATTA CAGCATACTT TGTGGAAGGT CTGGAAATAA CTCACAACCA TGTCCAGAAG ACGGCCTACA ACGGTATACA TTTGGGCTGG GGATGGTGCA ATTTTAAAGA CTCCACAACG TGCAAAAACA ACACAATAAG CTACAACAGG GTTGTTGATA CCTTGTCCAG GCTACATGAC AGCGGAGCAA TATATACCAT AGGCCAGATG CCGGGTACAA ATATCAACGA GAATTATGTA AAGGGTATTC CACCGGCAAC ATATGGCCCT ACTTATGGCT TGCATAATGA CGAAGGCACT GCATATATAA TTGAAAACGA CAACGTCCTG AATATCGACC CGGGAGTAAA ATATACCATC AACTGCGAAG ATTTCGGAGA AAAACACGAT CTGACAATCC TGAGGACCTA TGCAACGGTG AGCAAAATGG GAAAAAATCC TCCAAACAGC AGAATTGACC CTCCCGTTGC CGTCCCGGAT AATGTATGGC CTTTACGGCA GTATAATGTG TGCCTGAATT CGGGAATTCA GGATGAATAC AGAAAAATTA TGCCTGAGAG CTTACTTTCA ACGCCGGATT ATGTATTCCC GGCAAGCTGT GCTGCGGAAG CTGCGTCCAT TATAAATATA AGAAGCAGCG GAGATCCTTC AAACACGGTA TGGTTTGCAC CTCCCGGGAC AACAACCTTT GTTGAAGGAG CTACCATGAC CAAGGCGGCA GGAGACGCAA CTTCCATTAT TGCTCCATAC ACAGCCGGAA CATACAAGCT GTACATAGTT AATTCCCAGG GTGTAAAAAT CGGAGAGTCG GAATCAATAT TGAGAGTGAG CGGCTCTGTC AATCCTCCGC CTAAGGAACC GCGTTCGGCC TTTACCCGGA TTGAGGCCGA GAGCTACAAC GGACAATCGG GAATCCAGAC CGAAAACTGC AGCGAAGGCG GAATGGATGT AGGGTATATT GAGAACGGAG ATTATGTTGT TTATAAGAAT ATAGATTTTG GAAAAGGGGC AGCAAGTTTT AAAGCGAGAG TAGCCAGCGC TACAAGCGGA GGCAATATTG AACTTAGGAT TGACAGTATT GACGGACCTG TAGTGGGTAT CTGCCCGGTT GCAGGAAGCG GTGGCTGGCA GCAGTGGGTT GATGCCACAT GTGAGGTCAG CGGGCTTAAG GGAGTCCATG ATCTCTACTT AAAATTTACC GGTGGCAGCG GTTACCTGCT TAATATAAAT TGGTTTACCT TTGTTGAAGG AAACAATGAT GAGAATTTGG GTGATTTAAA CGACGATGGA AAAGTAAACT CGACAGACTT TCAGATATTG AAAAAGCATC TGCTTCGCAT AACTTTGCTT ACGGGAAAAA ATCTTTCAAA TGCGGATTTA AACAAAGACG GCAAAGTAGA TTCGAGCGAT TTGAGTTTGA TGAAAAGATA TCTGCTTCAA ATTATACCTA CTTTTTAA
|
Protein sequence | MVNGIIGIMT KRHMIVIMAL LFTVSVLSAG LLFINTVNAA EPITYYVSPT GSDSNTGTID APFKTIAKAR DVVRTVNGNM KSDIYVYLRG GTYNITETIT FGPQDSGTNG YRIYYMAYPG ETPVLSGATK VTGWTRHNGN IYKAKLNRST KLRNLYVNDQ RASMTSKRVT ARGGHGTYTV TAGQAPWAWT SGSKSDGVRY DMSEVPEITR NKDDLEIVNG TTWNENIVCT RDVITANGYR VLLLQQPYGA IAQTPGWGAA FTTSGTHTIY NAFEFLNSPG QFYFDKTEQM LYYYLRPGEN IETIDVQAPM VEKLIEIAGT STSNRVKNIT FQGITFAYTD YNLVEVGGSR GKSTCQAAQG FIAFFNDNWH YTKYDLVDTL PGMINLRNCD SIDFIENVIK HSGADGISMV NDVINCKIIG NYITDITSSG ITVGHPQHVY IGDGGSRAKF PSGVEGVCKN NTISNNVLYD ISMVPGFGGC AGITAYFVEG LEITHNHVQK TAYNGIHLGW GWCNFKDSTT CKNNTISYNR VVDTLSRLHD SGAIYTIGQM PGTNINENYV KGIPPATYGP TYGLHNDEGT AYIIENDNVL NIDPGVKYTI NCEDFGEKHD LTILRTYATV SKMGKNPPNS RIDPPVAVPD NVWPLRQYNV CLNSGIQDEY RKIMPESLLS TPDYVFPASC AAEAASIINI RSSGDPSNTV WFAPPGTTTF VEGATMTKAA GDATSIIAPY TAGTYKLYIV NSQGVKIGES ESILRVSGSV NPPPKEPRSA FTRIEAESYN GQSGIQTENC SEGGMDVGYI ENGDYVVYKN IDFGKGAASF KARVASATSG GNIELRIDSI DGPVVGICPV AGSGGWQQWV DATCEVSGLK GVHDLYLKFT GGSGYLLNIN WFTFVEGNND ENLGDLNDDG KVNSTDFQIL KKHLLRITLL TGKNLSNADL NKDGKVDSSD LSLMKRYLLQ IIPTF
|
| |