Gene Cthe_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0234 
Symbol 
ID4808582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp282265 
End bp284796 
Gene Length2532 bp 
Protein Length843 aa 
Translation table11 
GC content45% 
IMG OID640105646 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001036666 
Protein GI125972756 
COG category[G] Carbohydrate transport and metabolism
[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000591284 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTG CGTTTTCAAC ACTTGGTTGT CCTGACTTCA GTTGGACGGA CATTTATTCC 
ATGGCTAAGG ATTTTGGATT TAACGGTATC GAAATCCGTG GTCTTGGAAA GGAAATTTTC
GCCGTGAAAG CACAGCCTTT TACCGAATCA GAGCTGCCTA AGACTTTAAA AAAGCTTTCG
GAACTTCGTC TTGAAATTCC GTGCTTTTCT TCGGGATGCT GTTTGAAGTT TTCCGAGAAT
GCCGAGAAAA ATTATGAGGA GATTGTAGAG TATATTACGC TTGCTTCCAA AACAGGAACT
CCTTTTGTCC GTGTTCTTGG CGACCTTGAG CCGGAACCTC AGGGAGAAGT TGATGACAAT
GTTGTTATTG AGGCACTGAA AAAACTTGCC CCCATTGCGG AAGAAAAAGG TGTAACGCTT
CTTGTGGAAA CCAATGGTGT ATATTCCGAC ACAAAACGTC TGTGTGAGCT GCTTGACAAT
GTGGCCAGTG ATGCAGTGGC GGCTTTGTGG GATGTACACC ACCCGTATAG ATTTGCCGGT
GAGACTCCCG GAAAGACGGT GCAAAATCTT GGAGCATACA TTAAATATGT ACATATCAAG
GACTCGGTTG TTGAAAACGG AAAAATTCAT TATCGCATGC TTGGTGAAGG TGATTTGCCA
ATTGACGATA TCATGATGGC ACTTCGTTCA ATCAACTATG AAGGATACAT TTCTCTGGAA
TGGGTTAAAC GGTGGGCTGC GGACCTCGAC GATGCCGGAG TTGTCTTCCC CAATTTTGCA
AATTACATGA GCCGCTACAT TAAAAAAAGC GAAGTGAGAG GGCGCTTGTT TGACAATGCG
AGAAAGACCG GAAAGTATAT TTGGGAGAAA GACACGCTTA TTGATTTGAC ATTCCCTCAG
CTTTTGGACC GTGTTGTTGA AGAGTTTCCC GACCAGTATG CCTTCAAGTA TACCACAACC
GATTATACCC GGACTTATGC CCAGTTCAGG GATGATGTCG ATACTTTTGC AAGATCCCTG
ATAGCTCTGG GAGTAAAACC GGGAGACCAT GTTGCCATCT GGGCTACCAA CGTACCCCAA
TGGTTTATTA CATTCTGGGC GACAACTAAG ATTGGAGCGG TGCTTGTCAC CGTAAACACC
GCATATAAAA TTTATGAGGT TGAATATCTT CTCCGTCAGT CGGATACCCA CACACTGGTT
ATGATTGACG GATTTAAGGA TTCGAATTAT GTTGAAATTA TTAAAGAACT TTGCCCTGAG
CTTGAAACGG CGGAGCCCGG AAAACCTCTG CATATCAAGA GGCTTCCTTT CCTGCGCAAT
ATCATTACTA TTGAGTCAAA ACAAAAAGGC TGCATTTCGT GGGATGAAGC AATTGCCCTG
GCGGAAAAAG TGCCTATTGA GGAGGTTCAA CGCCGTGCTC TTGCGGTTAA CAGGCATGAT
GTCTGCAATA TGCAGTATAC TTCAGGAACC ACCGGATTCC CAAAAGGTGT TATGCTTACC
CATTACAATG TTATTAACAA CGGAAAATGC ATTGGAGACT GTATGGACCT TTCCACTGCC
GACCGCATGC TGATCCAGGT TCCGATGTTC CACTGCTTTG GAATGGTGCT TTCAATGATA
GCTTGTGTGA CTCATGGTTC CACAATGTGT CCGATACCGT ATTTTTCACC GAAGGTGGCT
TTGGATTGTA TTAACCGTGA GAAGATAACC GTCTGCAACG GTGTTCCGAC GATGTTTATT
GCAATGCTGG AACACGAAGA TTTCAAAAAG ACAGATTTCT CTCACATGAG AACGGGAATT
ATGGCCGGAA GCCCGTGTCC TGTAAAGGTT ATGCAGGATG TGGTGGACAA GATGAACATG
AAGGAGATAA CCATTGTATA CGGTCAGACT GAGGCTTCAC CGGGCTGTAC CCAGAGCCGT
GTGGATGATC CTATTGAGGT GCGTGTGAAT ACTGTCGGAC GTCCGCTTCC CGGTATTGAA
TGCAAGATTG TGGATCCTCA AACTGGTGAG GAATTGCCGG ATAATACCGA CGGAGAGTTT
GTTGCCCGCG GATATAATAT TATGAAAGGT TACTACAAGA TGCCTGAAGC GACGGCGGCA
GCAATTGACA AAGACGGCTG GCTCCATACC GGTGACATGG CAAGGCGTGA TGAAAACGGC
AACTACAAGA TAACCGGCCG TATCAAGGAC ATGATAATAC GTGGCGGTGA AAATATTTAT
CCGAAGGAAA TTGAAGACTT TATATACACT CATCCGAAAG TAAAGGATGT TCAGGTTATA
GGTGTTCCCG ACAAGCAATA TGGTGAAGAG ATTATGGCAT GGGTAATCCT TAAGGACGGC
GAAACAATGA CTGCCGAAGA GCTTCAGGAA TATGTTCGCT CCAATATGGC AAAACACAAG
ACGCCTCGAT ACGTCAAATT TGTTACGGAA TTCCCCATGA ATGCGGCAGG AAAGGTATTA
AAGTACAAAA TGCGTGAGAT GGCAGTTGAC ATGTTGTCCC TCCATGAAGC CAATTCAATC
GTTACGGCTT AA
 
Protein sequence
MKIAFSTLGC PDFSWTDIYS MAKDFGFNGI EIRGLGKEIF AVKAQPFTES ELPKTLKKLS 
ELRLEIPCFS SGCCLKFSEN AEKNYEEIVE YITLASKTGT PFVRVLGDLE PEPQGEVDDN
VVIEALKKLA PIAEEKGVTL LVETNGVYSD TKRLCELLDN VASDAVAALW DVHHPYRFAG
ETPGKTVQNL GAYIKYVHIK DSVVENGKIH YRMLGEGDLP IDDIMMALRS INYEGYISLE
WVKRWAADLD DAGVVFPNFA NYMSRYIKKS EVRGRLFDNA RKTGKYIWEK DTLIDLTFPQ
LLDRVVEEFP DQYAFKYTTT DYTRTYAQFR DDVDTFARSL IALGVKPGDH VAIWATNVPQ
WFITFWATTK IGAVLVTVNT AYKIYEVEYL LRQSDTHTLV MIDGFKDSNY VEIIKELCPE
LETAEPGKPL HIKRLPFLRN IITIESKQKG CISWDEAIAL AEKVPIEEVQ RRALAVNRHD
VCNMQYTSGT TGFPKGVMLT HYNVINNGKC IGDCMDLSTA DRMLIQVPMF HCFGMVLSMI
ACVTHGSTMC PIPYFSPKVA LDCINREKIT VCNGVPTMFI AMLEHEDFKK TDFSHMRTGI
MAGSPCPVKV MQDVVDKMNM KEITIVYGQT EASPGCTQSR VDDPIEVRVN TVGRPLPGIE
CKIVDPQTGE ELPDNTDGEF VARGYNIMKG YYKMPEATAA AIDKDGWLHT GDMARRDENG
NYKITGRIKD MIIRGGENIY PKEIEDFIYT HPKVKDVQVI GVPDKQYGEE IMAWVILKDG
ETMTAEELQE YVRSNMAKHK TPRYVKFVTE FPMNAAGKVL KYKMREMAVD MLSLHEANSI
VTA