Gene Cthe_1907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1907 
Symbol 
ID4810765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2265043 
End bp2266920 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content38% 
IMG OID640107324 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001038319 
Protein GI125974409 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATAATT GTGCTGTAAA TAACCAAAAT GAGAAAGGGC AAAGTGGAAG AATATGCAAG 
TATCCATTAA TGCCGGCTCA TCGTTTATTT GAAAACCAGG CTGAAAAATT TCCTGATTGT
ATAGCGGCTT ACTATGAAAA TGAAAAGATA TCTTATTCCG AACTAAACAG CAAATCTAAT
CAAGTAGCCA GGTATTTACA AAAATTAGGA GTTAGTTATG AGGTACCTGT GGGTATACTT
ATGGAACGGT CTATTGACGT TATTATCGCA ATTCTGGGAG TACTGAAAGC AGGTGGGGCA
TATATACCAC TTGAACCGGC ATATCCAAAG GAACGCTTAA ATTATATGAT AAATGATTCT
AAGATGCCTG TGTTGATTAC CAAATCTTCT TTCCTCGATA TAGTACCCGA TAGTAATGTA
ACTGTGGTTA ATATGGATTT GGATTGGGAA AGAATATCTA AAGAGAGTAA AGAAAATCCA
GACTGCAACA TTAATTACGA TAATTTGGTT TACATAATTT ATACTTCCGG TTCAACCGGC
ACTCCTAAGG GAGTTGAGAT AAGCCATGGT GCATTGGTTA ATCTTATACA CTCCATGCTT
AAAGAACCTG GAATGACATG TGAAGATCGT TTGCTTTCTG TTTCGGCACT ATCTTTTGAT
ATGTCTGTTT TCGACATCTT TGTACCGCTT TCAGCAGGTG CTTCCATTAT TATGGTCGGA
GACTGTATTG CAAAGAATGG GACAAAACTT ATCCAGGCAT TAGAAGAAAA CTCTATTACT
GTCATGCAAG CAACACCGTC TACATGGCGC ATGCTTTTAG AATCTGGCTG GAAAGGAAAT
AAACAATTAA AAATACTTTG CGGTGGAGAA GCATTGCCCA GAGAGTTGGT CAACCAACTT
AATGAAAAGG GTGCCGTTGT TTGGAATATG TATGGTTTAA CTGAACTGAC AGTATATTCC
GTGATTTCAA AAGTTACTTC AGGCGATGGG CCGGTACCAA TAGGTTATCC GATTGATAAC
ACTCAAGCAT ATATTTTGGA TGAAGATCTT AAGCCTGTAC CTTTTGGAGA GGTCGGAGAA
CTTTATATAG GTGGTGATGG AGTAGGCAGA GGATATTTTG GCAAGCCTGA ATTAACGAGC
GAAAAATATA TCCAGAACCC ATTTAGTGAC AACCAATCAG ATCGCATTTG CAAAACGGGA
GATCTTGCAC GTTTTTTACC TGATGGTTCA ATTGAGTATC TTGGACGAGC GGACTTTCAG
ATAAAACTCA GGGGTTTTAG AATTGAATTG GGAGAAATTG AATCTGCCAT TGAAAAGCAT
CCATGGGTGC AGCAGGCCGT TGTTGTTAAG GACAATGGCG AGGGAGATCA ACACATAGTA
GCTTATTTCA GAACAAAATC AGAGCAGGTC CCGTCCAGTG AGGATATGCG TTCCTTCTTA
AAAAATACTT TGCCTGATTA TATGATTCCA AGCTTCTTTG TTCAAATCGA TGAGTTTCCA
CTTACACCCA ACGGGAAGGT AGACAGAAAA TCATTACAGA ATTTTGATTA CAAATTGGAT
GTTCAAAGAG ATGGATATGT AGCGCCTTCT ACTTCACTTG AAAAGGAAGT TGCAAAAATA
TGGTCGGATT TATTGAAGAT TGATGATATT GGAATATATG ATAATTTTAT GGAACTTGGG
GGACATTCGC TTTTGGCCAA CCGCTTAACT TTGCGTATTA ACGATACTTT CGGAATTAAG
CTGTCTTTGA TGGAAGTTCT GACATCAGGA TTAACAGTGG CAGATATGGT AAAACTTATT
GAGAACAAAT TTCTTGAGGA AACTGACAAC AAGGATTTGG AAGCTATCCT GGAAGAAGTA
GAACGAACGG TTGGCTGA
 
Protein sequence
MNNCAVNNQN EKGQSGRICK YPLMPAHRLF ENQAEKFPDC IAAYYENEKI SYSELNSKSN 
QVARYLQKLG VSYEVPVGIL MERSIDVIIA ILGVLKAGGA YIPLEPAYPK ERLNYMINDS
KMPVLITKSS FLDIVPDSNV TVVNMDLDWE RISKESKENP DCNINYDNLV YIIYTSGSTG
TPKGVEISHG ALVNLIHSML KEPGMTCEDR LLSVSALSFD MSVFDIFVPL SAGASIIMVG
DCIAKNGTKL IQALEENSIT VMQATPSTWR MLLESGWKGN KQLKILCGGE ALPRELVNQL
NEKGAVVWNM YGLTELTVYS VISKVTSGDG PVPIGYPIDN TQAYILDEDL KPVPFGEVGE
LYIGGDGVGR GYFGKPELTS EKYIQNPFSD NQSDRICKTG DLARFLPDGS IEYLGRADFQ
IKLRGFRIEL GEIESAIEKH PWVQQAVVVK DNGEGDQHIV AYFRTKSEQV PSSEDMRSFL
KNTLPDYMIP SFFVQIDEFP LTPNGKVDRK SLQNFDYKLD VQRDGYVAPS TSLEKEVAKI
WSDLLKIDDI GIYDNFMELG GHSLLANRLT LRINDTFGIK LSLMEVLTSG LTVADMVKLI
ENKFLEETDN KDLEAILEEV ERTVG