Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0234 |
Symbol | |
ID | 4808582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 282265 |
End bp | 284796 |
Gene Length | 2532 bp |
Protein Length | 843 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640105646 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001036666 |
Protein GI | 125972756 |
COG category | [G] Carbohydrate transport and metabolism [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [COG1082] Sugar phosphate isomerases/epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000591284 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTG CGTTTTCAAC ACTTGGTTGT CCTGACTTCA GTTGGACGGA CATTTATTCC ATGGCTAAGG ATTTTGGATT TAACGGTATC GAAATCCGTG GTCTTGGAAA GGAAATTTTC GCCGTGAAAG CACAGCCTTT TACCGAATCA GAGCTGCCTA AGACTTTAAA AAAGCTTTCG GAACTTCGTC TTGAAATTCC GTGCTTTTCT TCGGGATGCT GTTTGAAGTT TTCCGAGAAT GCCGAGAAAA ATTATGAGGA GATTGTAGAG TATATTACGC TTGCTTCCAA AACAGGAACT CCTTTTGTCC GTGTTCTTGG CGACCTTGAG CCGGAACCTC AGGGAGAAGT TGATGACAAT GTTGTTATTG AGGCACTGAA AAAACTTGCC CCCATTGCGG AAGAAAAAGG TGTAACGCTT CTTGTGGAAA CCAATGGTGT ATATTCCGAC ACAAAACGTC TGTGTGAGCT GCTTGACAAT GTGGCCAGTG ATGCAGTGGC GGCTTTGTGG GATGTACACC ACCCGTATAG ATTTGCCGGT GAGACTCCCG GAAAGACGGT GCAAAATCTT GGAGCATACA TTAAATATGT ACATATCAAG GACTCGGTTG TTGAAAACGG AAAAATTCAT TATCGCATGC TTGGTGAAGG TGATTTGCCA ATTGACGATA TCATGATGGC ACTTCGTTCA ATCAACTATG AAGGATACAT TTCTCTGGAA TGGGTTAAAC GGTGGGCTGC GGACCTCGAC GATGCCGGAG TTGTCTTCCC CAATTTTGCA AATTACATGA GCCGCTACAT TAAAAAAAGC GAAGTGAGAG GGCGCTTGTT TGACAATGCG AGAAAGACCG GAAAGTATAT TTGGGAGAAA GACACGCTTA TTGATTTGAC ATTCCCTCAG CTTTTGGACC GTGTTGTTGA AGAGTTTCCC GACCAGTATG CCTTCAAGTA TACCACAACC GATTATACCC GGACTTATGC CCAGTTCAGG GATGATGTCG ATACTTTTGC AAGATCCCTG ATAGCTCTGG GAGTAAAACC GGGAGACCAT GTTGCCATCT GGGCTACCAA CGTACCCCAA TGGTTTATTA CATTCTGGGC GACAACTAAG ATTGGAGCGG TGCTTGTCAC CGTAAACACC GCATATAAAA TTTATGAGGT TGAATATCTT CTCCGTCAGT CGGATACCCA CACACTGGTT ATGATTGACG GATTTAAGGA TTCGAATTAT GTTGAAATTA TTAAAGAACT TTGCCCTGAG CTTGAAACGG CGGAGCCCGG AAAACCTCTG CATATCAAGA GGCTTCCTTT CCTGCGCAAT ATCATTACTA TTGAGTCAAA ACAAAAAGGC TGCATTTCGT GGGATGAAGC AATTGCCCTG GCGGAAAAAG TGCCTATTGA GGAGGTTCAA CGCCGTGCTC TTGCGGTTAA CAGGCATGAT GTCTGCAATA TGCAGTATAC TTCAGGAACC ACCGGATTCC CAAAAGGTGT TATGCTTACC CATTACAATG TTATTAACAA CGGAAAATGC ATTGGAGACT GTATGGACCT TTCCACTGCC GACCGCATGC TGATCCAGGT TCCGATGTTC CACTGCTTTG GAATGGTGCT TTCAATGATA GCTTGTGTGA CTCATGGTTC CACAATGTGT CCGATACCGT ATTTTTCACC GAAGGTGGCT TTGGATTGTA TTAACCGTGA GAAGATAACC GTCTGCAACG GTGTTCCGAC GATGTTTATT GCAATGCTGG AACACGAAGA TTTCAAAAAG ACAGATTTCT CTCACATGAG AACGGGAATT ATGGCCGGAA GCCCGTGTCC TGTAAAGGTT ATGCAGGATG TGGTGGACAA GATGAACATG AAGGAGATAA CCATTGTATA CGGTCAGACT GAGGCTTCAC CGGGCTGTAC CCAGAGCCGT GTGGATGATC CTATTGAGGT GCGTGTGAAT ACTGTCGGAC GTCCGCTTCC CGGTATTGAA TGCAAGATTG TGGATCCTCA AACTGGTGAG GAATTGCCGG ATAATACCGA CGGAGAGTTT GTTGCCCGCG GATATAATAT TATGAAAGGT TACTACAAGA TGCCTGAAGC GACGGCGGCA GCAATTGACA AAGACGGCTG GCTCCATACC GGTGACATGG CAAGGCGTGA TGAAAACGGC AACTACAAGA TAACCGGCCG TATCAAGGAC ATGATAATAC GTGGCGGTGA AAATATTTAT CCGAAGGAAA TTGAAGACTT TATATACACT CATCCGAAAG TAAAGGATGT TCAGGTTATA GGTGTTCCCG ACAAGCAATA TGGTGAAGAG ATTATGGCAT GGGTAATCCT TAAGGACGGC GAAACAATGA CTGCCGAAGA GCTTCAGGAA TATGTTCGCT CCAATATGGC AAAACACAAG ACGCCTCGAT ACGTCAAATT TGTTACGGAA TTCCCCATGA ATGCGGCAGG AAAGGTATTA AAGTACAAAA TGCGTGAGAT GGCAGTTGAC ATGTTGTCCC TCCATGAAGC CAATTCAATC GTTACGGCTT AA
|
Protein sequence | MKIAFSTLGC PDFSWTDIYS MAKDFGFNGI EIRGLGKEIF AVKAQPFTES ELPKTLKKLS ELRLEIPCFS SGCCLKFSEN AEKNYEEIVE YITLASKTGT PFVRVLGDLE PEPQGEVDDN VVIEALKKLA PIAEEKGVTL LVETNGVYSD TKRLCELLDN VASDAVAALW DVHHPYRFAG ETPGKTVQNL GAYIKYVHIK DSVVENGKIH YRMLGEGDLP IDDIMMALRS INYEGYISLE WVKRWAADLD DAGVVFPNFA NYMSRYIKKS EVRGRLFDNA RKTGKYIWEK DTLIDLTFPQ LLDRVVEEFP DQYAFKYTTT DYTRTYAQFR DDVDTFARSL IALGVKPGDH VAIWATNVPQ WFITFWATTK IGAVLVTVNT AYKIYEVEYL LRQSDTHTLV MIDGFKDSNY VEIIKELCPE LETAEPGKPL HIKRLPFLRN IITIESKQKG CISWDEAIAL AEKVPIEEVQ RRALAVNRHD VCNMQYTSGT TGFPKGVMLT HYNVINNGKC IGDCMDLSTA DRMLIQVPMF HCFGMVLSMI ACVTHGSTMC PIPYFSPKVA LDCINREKIT VCNGVPTMFI AMLEHEDFKK TDFSHMRTGI MAGSPCPVKV MQDVVDKMNM KEITIVYGQT EASPGCTQSR VDDPIEVRVN TVGRPLPGIE CKIVDPQTGE ELPDNTDGEF VARGYNIMKG YYKMPEATAA AIDKDGWLHT GDMARRDENG NYKITGRIKD MIIRGGENIY PKEIEDFIYT HPKVKDVQVI GVPDKQYGEE IMAWVILKDG ETMTAEELQE YVRSNMAKHK TPRYVKFVTE FPMNAAGKVL KYKMREMAVD MLSLHEANSI VTA
|
| |