Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1911 |
Symbol | |
ID | 4810769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2273311 |
End bp | 2277183 |
Gene Length | 3873 bp |
Protein Length | 1290 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107328 |
Product | carbohydrate-binding family 6 protein |
Protein accession | YP_001038323 |
Protein GI | 125974413 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.374482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGGG GAAAACTACG GAAGATAATT GCTTTAGTGC TGGTTTTGGG TTTCATTGCC GGCAACTTTG CAGTAACCAC CTTTGCGGCG GAAACTGTGA AACCTTATTG TACCAGGACT TCAACGCCGG TCATGGCGGT AAATCTCGGA AGCAATGAAG ATGTCAGCCA TGGCGGTACC AATTTTAAAA AGCAGTCGGC TTACAGCAAC CTTAAAGTGG AGGTCAAGGG GCCTTCTGAT TCCGCATATC CCACCGGAAC AATCAATGCT CTCTCGGTTA TTCAGGCAGA GAATTGCGAT GAAAACCATG GACTGGAAAT CGAAGACTGC CCGGATGAGG GAGGCACAAA GAACCTGGCG TATATAGCCA ATGGAGACTA TACAGCTTAT TATAATGTAT ATTTTCCAAA AGGAACAAAA GGTTTTATAG CAAGGGTTTC AAGTGACACT GAAGGAGGAT ATATTGAACT TCGCCTTGAC TCAATTTCGG CGGAAGTGGT TGGACGATGC CGCGTAGAAA ACACCGGAGG ATGGGAAAAG TATGAAGAAG TTTACTGCGA ACTAAATAAA AGTGTTGAAG GCGTACATAC TTTGTATATG GGCTTTGCCG GAGAAAGGGA CGGTCTTTTC AACGTCAACT GGTTCAGGTT TACAAAGAGC CCGTATGAGC CGGTAATGAC AAAAAGCCGT GACGGAGCGG TTGACGGAGC CTATTTGTAC AAGTTCATTG ATTTTGGGAA GGAAAGTATT CCAACCAGGT TTAAAATGCA CCTTTCAGGA AGCATAAGCG GAATAATAAA TGTAAGGCTT GACAGTCCAT CAGGAGATGT TATAGCAACT GTTGACGGCA CAAACGGCGC CGGAGAAGTG GAAGGGCGCG TTGAAAAGCC TGTTACGGGA ATTCATGAGT TATATTTGAC CGATGATAAG AACGGTACTT TAATATCCAA AGTTGACTGG TTTGTATTTG AACCGGAAGA ATCAAAAGAA ATAACCGACA GAAATCTTCA GAATTTTGCC AAAACAAGTG TAAAAGGATA CAATGTAAAT CTTGAGGCCT CTCTTGATAA AAACAATGTC TACGAAGTTT ATATCCATCC TGTAGAGTTC CGCAAGAAAA ATCGTCAGAT ATTTGACCTT TATATCAATG GTACGCTGGT GGATACCATT GACACCGAGG AGTCGGGACT TGATTGGGAG AAAAAAGGAC CGTACATCAC AAAGGTTTTG GATGACGGAA AGCTGAAAAT TGAATGCAAA TCCAGACAAG GTATTGTATC TCTGGCCGGT TTGGAAATCA ACAAGATAAC GTATTCCAAA GCTTTTTCCG ATGTAAAAAT CAAAGACTGG TTTTATATTC CTGTAATGGA GCTCGCAAGC CGGGGAGTTA TATTCGGCAA GGGCAATGAC ATGTTTAAGC CGCAGGACCA TATAATCGGC GAGCATGTGG CATATATGAT GTTTAATGTC ATGAAGGTGT CCATTGCAGA AAATGACAAG GAATTCAATC CGGAAAAGTA CAGAAACCTG TCGGATGTGC CTCCGAGTTT TTGGGCTTAT CCCTATATGA GCGCTTATTA CAACTATTTC TTTAAAGAAA AAATGCTAAG ATATGATGTT AATACCCGTG TTCCTTACAG TGCAAAGGAG TACGAGGAGA AAAAGAAAGT AAGACGCGAA GAGTTTGCTA TGGCGATTAT AGGGGCAAGG CGTCTTGACT ATAATGAAGA CGGCAAGGTG TTTGTACTGG ATCCGTATCT TGAACCTTCA GCAATGTTGA ACAAGTATAA GGACAAAGAT GCCGACAAGA TTACGGATTC CTTCAGGTAT TTTGTAGAAT TGGCTTTGGA AAAGGGTCTG ATGAAAGGTG ACCAGTTTGG TTACTTAAAT CCTCAAAATC CTGTAACGAG AGGAGAGGCG GCGGCATTTA TATACAATGC CTTAAATCTT GACGAAAACA ATTTTGTAAA GCCGAAAAGG GGAGAGAAAA TTCCTGTACC GAGGATAACA GCCAGGAAGA GAAATATAAA TGTGGGAATT CTTATTTTGC CGGCACCGGC ATGGGATTCA ATAAACAATA TACCGGTGAA TGACCCAAAT CCTGACTTTA CTTTAATGGA GCTTTTGAAC AGGAACATAA ACAAGCCGAT GGATTGGGAG TTGGTGAATC CTCATCCGCC TGCCTTTGAC AAGAGTGAAT ACAAAGATAT AATGCACCTT AACTCATCCA AGATTCCGGG AATTGACAAC CAAAGCCACA GTGATTTCTG CGCATATTTC AACGACCTCA GGAGCGTGGC CAGGGCACAA ACCGATCTTG AAGCCGATAT AACTTATCTT GGCACGGTCG GATACAGTGA AAATATAAAT AAGTCGAAGT TCTTCAAATA TTGGGAGGTT CATCTGGACG ACCCGAATTT GACACCGGAA AAGATTGCAA AAGACTATGA CCTTCTGTTC CAGACATCCC ATGGTAAAAT AACATATTCA AAGGATGTCC AGGACAAAGT CAAGGCGTTC CTGAAAGCCG GCGGCCAGTT ATGGTGGGAA AACTGCAAAG GGCTTGAAAT TGAATCCGGA GACGGTTTTA CGGAAGAAGT TAAGTTTGTG TCGCTGCATC CGGGTCATAA CCGCAAGTAT CCTCAGATAC CTGTTTTAGA CGACGAAGGG AAAATGCATC CGTTGTTTGA CAATATTTTC AGAATCAATC CGGAGAAAAC ATCCCGTGTA TTTGCACCGG GTATATACAA CAAGAACAGC GAGATATCAA TGCTGGGCGA CGGTGAGGAA TGGCTCAACG ATGACAACAG GTATCTTGAT GAATTGCAGC CTGATGATAT TGTAATTCTC AACATAGAGA ATACCGACAC AGGCGAAATA CTTCCCAACA TGGCGGTAAG AAATATAGAG AATGAGGATG CGCCTGACGG AAGAATTGTA ATTAGCACAA ATGATATTGG ATGCGGTATA ACAAAATTTG TGGACCGCGG TGGCGGAAAA GCCGTTGAAG ACTACAAATT CTGCTACAAT CTTTTGGGTT GGATGTCCAA GATAGATGTA AGCTTCGATG AAACAACTGT CAACCAGTGG GACGGAGGCA GTGAGTTTTC CGTGGAAGCC ACATTCACAA ACAATGGAGC AAAGAAACAG ATTTATGACG TTACATATGA ATATGATCCT AAACTTTGGA ATCTTGTACC AACGAGCGAC TTTAAGAATT ACAAACAGAC TCATCCATGG ATTAAGGCTT TGGATGAAAA CGGATATCCG AAGAAAATTG AACTTGAGCC CAATCAGACG GAAGTAGTGA CATATAAATT CAACATCAAG AGAACAAACC TCCGCTGCTA TGACTTTACG ATAAAAGCAA GTGAATCGGG TGTGAAGTAT ACCCGCGACA TGGCTGAAAC GTTGTACAGA CTGAATAACG TAAGGGTTGA GGAGCCGATA TTCTCAGGAC GGAGGAATAA CGGAAGTGAA GCTTCTTTCG ATGTGACAAT CAACGCACCG GAGGAACCGG ACAGTGACCT TAGAACCGAG GATTATGAGC TTAATATAAA AATTAAAAAG GATGGAAGCT TTATTGATCC GGAAACCGTT ATAGACAATA TTGAGCTTCT GACGGACGGA AATACACCGC CGCTGGAAGG TTACAATTAC AAGTACTTGG TTGACAATAA GGGTGTTCTG TATCTGAAGG TTATTATAGA AGACACGCTG ATTACGAAGC CAACTGAAAA AATCAGGCTG AATATATCTT TAAAGAACCT TGACAGTGGC AGTTATGAGG TTGCCGGAAA GATAGAGGTA ATTGATCCGG TTTCCCGTAG AAGGCTTGCA TTCTCAGATG AGGCAATATA TAAAATAAAA TAG
|
Protein sequence | MRRGKLRKII ALVLVLGFIA GNFAVTTFAA ETVKPYCTRT STPVMAVNLG SNEDVSHGGT NFKKQSAYSN LKVEVKGPSD SAYPTGTINA LSVIQAENCD ENHGLEIEDC PDEGGTKNLA YIANGDYTAY YNVYFPKGTK GFIARVSSDT EGGYIELRLD SISAEVVGRC RVENTGGWEK YEEVYCELNK SVEGVHTLYM GFAGERDGLF NVNWFRFTKS PYEPVMTKSR DGAVDGAYLY KFIDFGKESI PTRFKMHLSG SISGIINVRL DSPSGDVIAT VDGTNGAGEV EGRVEKPVTG IHELYLTDDK NGTLISKVDW FVFEPEESKE ITDRNLQNFA KTSVKGYNVN LEASLDKNNV YEVYIHPVEF RKKNRQIFDL YINGTLVDTI DTEESGLDWE KKGPYITKVL DDGKLKIECK SRQGIVSLAG LEINKITYSK AFSDVKIKDW FYIPVMELAS RGVIFGKGND MFKPQDHIIG EHVAYMMFNV MKVSIAENDK EFNPEKYRNL SDVPPSFWAY PYMSAYYNYF FKEKMLRYDV NTRVPYSAKE YEEKKKVRRE EFAMAIIGAR RLDYNEDGKV FVLDPYLEPS AMLNKYKDKD ADKITDSFRY FVELALEKGL MKGDQFGYLN PQNPVTRGEA AAFIYNALNL DENNFVKPKR GEKIPVPRIT ARKRNINVGI LILPAPAWDS INNIPVNDPN PDFTLMELLN RNINKPMDWE LVNPHPPAFD KSEYKDIMHL NSSKIPGIDN QSHSDFCAYF NDLRSVARAQ TDLEADITYL GTVGYSENIN KSKFFKYWEV HLDDPNLTPE KIAKDYDLLF QTSHGKITYS KDVQDKVKAF LKAGGQLWWE NCKGLEIESG DGFTEEVKFV SLHPGHNRKY PQIPVLDDEG KMHPLFDNIF RINPEKTSRV FAPGIYNKNS EISMLGDGEE WLNDDNRYLD ELQPDDIVIL NIENTDTGEI LPNMAVRNIE NEDAPDGRIV ISTNDIGCGI TKFVDRGGGK AVEDYKFCYN LLGWMSKIDV SFDETTVNQW DGGSEFSVEA TFTNNGAKKQ IYDVTYEYDP KLWNLVPTSD FKNYKQTHPW IKALDENGYP KKIELEPNQT EVVTYKFNIK RTNLRCYDFT IKASESGVKY TRDMAETLYR LNNVRVEEPI FSGRRNNGSE ASFDVTINAP EEPDSDLRTE DYELNIKIKK DGSFIDPETV IDNIELLTDG NTPPLEGYNY KYLVDNKGVL YLKVIIEDTL ITKPTEKIRL NISLKNLDSG SYEVAGKIEV IDPVSRRRLA FSDEAIYKIK
|
| |