Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1904 |
Symbol | |
ID | 4810762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2256831 |
End bp | 2262332 |
Gene Length | 5502 bp |
Protein Length | 1833 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107321 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001038316 |
Protein GI | 125974406 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3208] Predicted thioesterase involved in non-ribosomal peptide biosynthesis |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.844763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTTAT TAAAAAAGCA AAATAACAAA AGAGAAATTC AGAATGAAGA AGACAAAATA ACTTTGTGTC ATAGAGAAAA AGGATGTATC AATTATTTTC CGCTGTCTTA TTTTCAACAA GGGTTGTGGT TTATAAACCA GTTAGACCCG AACAATTCTT CATATAACAT TCCGTTAGCC TACAGACTTG TTGGGACACT TAATAAAGAA GCATTGAAAA AATCTCTCGA AATAATAATT AACAGACATG AAGTGTTACG TACAACTTTT CAGGAAATAA ACGGAGAACC GTTTCAAGTA ATTAGCCCGT ATTCTAAGGT TGAGTTGAAT ATTATAGATA TTTCGCACAT TACCGGGGAA GACTGTGAAA AACTTGCACT TGAAAGTGCC CGGAATGAAG CAAACCGGTT GTTTGACCTG ACTAAAGGAC CGCTCATCCG GTTTCTGTTG ATTTGCAAAT CTCAAACTGA ACATATTTTT GTAGTTACTG TTCACCATAT TATTTTCGAT GGATGGTCTA CCGGTATTTT CTGCAATGAG CTGTCGGAGA TATATAATGC ACTCATTTCG GGTAGAGAAT ATAATTTACC CCAACTCGAA GTGCAGTATG CAGATTATGT CGTATGGCAG CATAAAAAAC TTAATAATGA AGTTATCGAA AAGCAATTAA CTTATTGGAG ACAAAAGTTG ACCGGGAATG TCCAAGTTAT TGAGCTTCCT ACCGACAGGC CAAGACCCAG TATAAAAACT GTGCGGGGTG GTGCCTTACC TTTTGAATTG TCGCCTGCTC TTAGTAAAGA AATTAAGATT TTGACTGTCA GGAAAAGATG CACTTTGTTT ATGACATTAC TGGCAGCTTT TAAAACATTG CTTTACAGAT ACACCTGCCA GGATAATATA ACTGTAGGTA CGCCGATAGG CAATAGGAGT CAATTGGATT GCGAGAAGTT GATGGGGTTG TTTATTAACA CATTGGTTTT ATGTACAAAA ACAGGTGATG ACCCATCGTT TTCGGATTTA TTGGAACGTG TTCGAAATGT TACATTGGAA GCTTACGAAA ATCAGGATAT TCCCTTCCAG AAATTAGTGG AGGAACTTAA ACCTGAGAGG GATTTGAGTC GTAACGTATT TTATCAAGTA ATGTTTAACT TCAGTGACAT GTCAAAAGTG TGTATGCGGC TGGAAGGACT GGAAGTATCC CCTTTTGAGC TCGGCGGAAG TACAGCCAAT GTTGATTTAC AGTTGTATGT TTGGCAGGAA GGTGAAGTTA TAAAGGGATA TTTTGAGTAT AACAAGGACT TATTTGATGA AAGTACAATT AAACGCCTTA TTGAACAGTA TAAAGTTTTA TTACAGGGAG TGGTAAATGA TCCTGAACGG CACCTAAGTG AACTCCCGAT ATTGCCTTTG GAAGAGAAAA ATAAAGTACT TTACGAATGG AATGATAATG ATGTAGCATA CCCTCACATC AACGGACTCC ATAAGTTTTT CGAACGACAG GTAGAGAAAA CACCAGATTC TCCGGCTGTT TTTTTTGAGA ATGAATATTG CACCTATCAA GAACTTAATG AGAGAGCAAA TCAGCTTGCG CATTATTTAA TTAACATAGG TGCTAAAAAG AACACAGCTA TTGGCCTTTT CCTTGATAGA TCCATTGATA TGATTGTTGG CATGTTTGGA ATTATGAAGT CAGGAGCAGC TTATGTTCCG CTTGATATAA AATATCCTTC GGATAGAATT GCAGCTATTT TAAAAGAGGC AGGGATAAAA ATTTTAATAA CTCAAGATGA TTTACTGTCA GACGTTCCTC AAATGGAAGG GCTTAATGTA ATCTGTATTG ACAGGGAACA GAAAAAAATC TGCAGCTTCA GTAAAGAGAA CCCTTCTGTT GAAGTGTCCA ATAATGATTT GTTATATATC CTTTTTACAT CAGGAACTAC GGGAAAACCT AAGGGCGTAC TGGTAGAACA CAGGTGTTAT ATAAATTATA TACAGGGGAT TATAAGAAAA CTTGAAATAG ACTCACCGCT AAATTTTGCT ATAGTTTCAA GTTTTGCGGC TGATTTGGGT ACTACTAATA TTTTTATACC GCTTTTTACA GGAGGACAGT TGCATATTTT GTCTTATGAG AGAGCTACGG ATCCTGAAAA ATTTTTAGAT TATTTCAGAA AACATAAGAT CGATGCAATG AAACTGGTTC CAAGTCATTT TGAAGCTTTG AAAACAGTAC AAAACTTTGA AGATATTATA CCCGGTAAAA GACTTGTTTT TGCAGGCGAA GCTTGTTCAT GGGAACTTAT AGAAGAAGTT CGCAGATTAA ATCCGAGTTG TATGATACAG AATCACTATG GGCCAACAGA AACTACAGTT TCAGCATTGG CTTACCTGGT TCCTGATGAA CTGCCGCAAC ATGCCGGAAG TGTTGTGCCT ATAGGACGTC CTTTGCCCAA TGTAAAAGCA TATGTTCTTG ACAAGCATAG ACAACCGGTT CCTATAGGAG TTGTGGGTGA ACTCTATATC GGCGGAGCAG GAGTGGCGCG GGGATACATA AATGAACCGG AAATGACGAA GCAAAAGTTT ATACCGAATC CGTTCCATCC AGGCCCGTCA AGTTATATGT ACCGCACCGG TGATTTGGTA AGGTATTTAC CGGATGGGAA TATTGAATTT TTAGGAAGAA TTGACCGGCA AATCAAAATA AGAGGGTACA GAATAGATCC TGAGGAAATC GAACATGCAA TAAAAGAACA TTCCGTTGTT CGGGATGCTG TAGTTACTGT TAGGGGAAAT TCAGAAAAAA GTAATAAACT GGTAGCTTAT CTGGTTCTTG ATAAAAAAGC TGAAGGAAAC TTGGATATAT CCGAAATTCG GCGTTATTTA AAGAAAAAAT TGCCTGAATA TATGAGGCCA TCATCTTTTA CAGTATTGGA TTCCATACCG CTAAATACCA ATGGAAAAGT GGATTACAAG TCATTGCCTG AACCTAGTGA AGATATTATT GAAGATGATA ATTATGTAGC TCCGAGAAAT GAATTGGAAG AAAAGATTGC CTCTATATGG AAGGAAACAT TGGAAATATC CAGAGTAGGG ATTGATGACA ACTTTTTTGA CCTTGGAGGA GAGTCTTTTA AGGCTATGAG CGTAGTCAGA AAAATTTCTC CTTCACTTAG TGTAATCGAT TTGTTTAAAT ACCCTACTAT CAGAGAACTT AGCGATTACA TCTCAAACAA GCAAAAAGAA GAAAAGAGAG AGATTTTGCA TGAACTTACG AAGCATGTTT CAAAAGAGAA AAAACAAATG AATTTGATTT GTATTCCTTA TGGGGGAGGA AGTGCTGTTG CTTATCAGCC TTTAGCAAAT GAAATTCCTG AAAACTGGTC ACTGTATGCT GTACAAATAC CGGGACGTGA TTTCAGTCGC CCACATGAAA AACCGGAGAG CCTTGAAAAG GTTGCTGAAA TGTGCATTTC TGAAATCAAA GAAAAAGTGA CAGGGCCTAT TGTTCTGTAT GGACAATGTG TTGGGGGAGC TTTGGCTATC AAACTGGCAT ATATGATGGA AGAGCAGGGA ATGGAGCTGG TTGGTGTAAT TGAGGCTGGA AACTTTCCTT CGCCACGTCT TCCGGGAAAA TGGTTTGAAT TGTGGTCAAA AATTTTTCCT AGGGACCGTT GGATATCTAA TCGCTTGTAC AAGGAAATAT TAAAAAGTAT AGGAGCTCCG ATAGGCGGTT CCAACAATGA AGCAGAACAG GATTTTATTA TAAGAAGTCT TCGCCATGAC AGCAGAGAGG CGGAAGATTA TTACACTAAG ATGTTTTCAA CTGAAAATTT AAAGAAACTT AAAGCTCCTA TTACATGTGT TGTGGGAGAA AGAGACCGTA CTACAGAATT TTATCAGGAA AGATATAAAG AATGGGAACA TTTTAGTAAT TGTGTGAATT TAAGGGTTAT TGAAAATGCA GGCCACTTTT TCCAGAAACA TCAGGCCGAC ATATTAGCGC AGATTATAGT TGATCAGGTG GAAAAATGGA AAAATATTAG AAGTTCAGAA TTTGTGGAAG AGGCATTGGA AGAAACGGTA GATAAAAAGA AGGTAAAAAC TTCTATATTT GACAAAGCAA ATGTAAAACC CAGTATGAAA TTATTTTTAT TTATTGCCTT AGGGCAAATT GTGTCAATGT TTGGGACAAG CCTTACAGGA TTTGCATTAG GTTATTGGAT TTATAAAGAA ACAGGGTCTG TATCCTATTA TACTTTAATT TCAGTTTGTA CTTTACTGCC AAATATTCTT ATCTCTCCTA TTGCCGGTGC AGTCGCAGAC CGGTGGGATC GACGGAAAAT TATGATTATA TCCGATACTT TTGCTGCGAT GGGTACTTTG GCGATTGCTT TATTATTGTG GAGCGGTCGG CTTGAAATCT GGCACATTTA TATTTCCACA ACTATTAGTT CAATTGCAGG CGCGTTTCAG AGACCGGCAT TTTTGGCGGC AATTGCTCAA ATTACGCCCA AACAGTATCT TGGACAGGCA AACGGAATAG CACAAATGGG TTCTGCGTCA GGTAGCATGT TGGCGCCTAT AATAGGCGGA ATGCTTGCCT CTTCAATTAA TCTTTATGGA ATTCTGTTAA TTGATTTTAT ATCTTTCTTG TTTTCGGTGG TGCCTTTATT ATTGGTGGCT TTCCCGAACT ATATGTTTAA AAAGCGGGAA GAACCTTTCA TAGAAGAAAT AAAAGGTGGA TGGAATTATA TTATAAAACG AAAATGTTTG ATTATAATGA TAGGTTTCTT TATTGTTACA AATTTCTTTA TGAGTTTGTC AACTGTTTTG GTTACTCCTG TAGTATTAGC CTTTGCTTCA GTAGAAACGA TGGGTATTGT TACTTCTGCG AATGGCTTTG GGCTTATAGT AGGATCAATT ATTATGAGCC TGTGGGGCGG AACAAAAAGA CGTGCAGACG GAATGATTGG ATATGTTATA CTATCCGGTA TATGCCTGAT TTTGATTGGA ATAAGACCGT CGGTAGTTTT AGCAACAATA GGTCTTTTCG GATTTGGTCT ATCTATAGCA TTTATCGATA CTCATTGGCA GATTCTTATA CAGTCTAAAG TGGGGCTTGA ATTACAAGCC AGAGTTTTTT CAATTAATGA AATGTTAGCC TTTATTATGC GTCCTCTTGC GTTTTTCCTT GCGGGGCCTT TGTCAGATAA AGTATTTGAA CCGTTCATGG CTGGAGAAGG GAATCTTGCA ACGAAAATCA GCATGATAAT CGGAAGCGGT GAAGGAAGAG GCATGGGATT AATTCTGGTT TTGTCCGGTA TTATATTGAC TATATGGGGA ATTATGGGAT TTAATTATCG TCCTTTACGC TTTATGGAAG ATGTATTGCC GGATGCTATA CCTGATCCTG TTATTTTGAA AGATAAAAAC AAAATTCAGG AGTTGGCTGA TATGCAGTTA TTAAAAACCA TTCAAAATGA CAGGAAAAGA GCAAAGATTT GA
|
Protein sequence | MQLLKKQNNK REIQNEEDKI TLCHREKGCI NYFPLSYFQQ GLWFINQLDP NNSSYNIPLA YRLVGTLNKE ALKKSLEIII NRHEVLRTTF QEINGEPFQV ISPYSKVELN IIDISHITGE DCEKLALESA RNEANRLFDL TKGPLIRFLL ICKSQTEHIF VVTVHHIIFD GWSTGIFCNE LSEIYNALIS GREYNLPQLE VQYADYVVWQ HKKLNNEVIE KQLTYWRQKL TGNVQVIELP TDRPRPSIKT VRGGALPFEL SPALSKEIKI LTVRKRCTLF MTLLAAFKTL LYRYTCQDNI TVGTPIGNRS QLDCEKLMGL FINTLVLCTK TGDDPSFSDL LERVRNVTLE AYENQDIPFQ KLVEELKPER DLSRNVFYQV MFNFSDMSKV CMRLEGLEVS PFELGGSTAN VDLQLYVWQE GEVIKGYFEY NKDLFDESTI KRLIEQYKVL LQGVVNDPER HLSELPILPL EEKNKVLYEW NDNDVAYPHI NGLHKFFERQ VEKTPDSPAV FFENEYCTYQ ELNERANQLA HYLINIGAKK NTAIGLFLDR SIDMIVGMFG IMKSGAAYVP LDIKYPSDRI AAILKEAGIK ILITQDDLLS DVPQMEGLNV ICIDREQKKI CSFSKENPSV EVSNNDLLYI LFTSGTTGKP KGVLVEHRCY INYIQGIIRK LEIDSPLNFA IVSSFAADLG TTNIFIPLFT GGQLHILSYE RATDPEKFLD YFRKHKIDAM KLVPSHFEAL KTVQNFEDII PGKRLVFAGE ACSWELIEEV RRLNPSCMIQ NHYGPTETTV SALAYLVPDE LPQHAGSVVP IGRPLPNVKA YVLDKHRQPV PIGVVGELYI GGAGVARGYI NEPEMTKQKF IPNPFHPGPS SYMYRTGDLV RYLPDGNIEF LGRIDRQIKI RGYRIDPEEI EHAIKEHSVV RDAVVTVRGN SEKSNKLVAY LVLDKKAEGN LDISEIRRYL KKKLPEYMRP SSFTVLDSIP LNTNGKVDYK SLPEPSEDII EDDNYVAPRN ELEEKIASIW KETLEISRVG IDDNFFDLGG ESFKAMSVVR KISPSLSVID LFKYPTIREL SDYISNKQKE EKREILHELT KHVSKEKKQM NLICIPYGGG SAVAYQPLAN EIPENWSLYA VQIPGRDFSR PHEKPESLEK VAEMCISEIK EKVTGPIVLY GQCVGGALAI KLAYMMEEQG MELVGVIEAG NFPSPRLPGK WFELWSKIFP RDRWISNRLY KEILKSIGAP IGGSNNEAEQ DFIIRSLRHD SREAEDYYTK MFSTENLKKL KAPITCVVGE RDRTTEFYQE RYKEWEHFSN CVNLRVIENA GHFFQKHQAD ILAQIIVDQV EKWKNIRSSE FVEEALEETV DKKKVKTSIF DKANVKPSMK LFLFIALGQI VSMFGTSLTG FALGYWIYKE TGSVSYYTLI SVCTLLPNIL ISPIAGAVAD RWDRRKIMII SDTFAAMGTL AIALLLWSGR LEIWHIYIST TISSIAGAFQ RPAFLAAIAQ ITPKQYLGQA NGIAQMGSAS GSMLAPIIGG MLASSINLYG ILLIDFISFL FSVVPLLLVA FPNYMFKKRE EPFIEEIKGG WNYIIKRKCL IIMIGFFIVT NFFMSLSTVL VTPVVLAFAS VETMGIVTSA NGFGLIVGSI IMSLWGGTKR RADGMIGYVI LSGICLILIG IRPSVVLATI GLFGFGLSIA FIDTHWQILI QSKVGLELQA RVFSINEMLA FIMRPLAFFL AGPLSDKVFE PFMAGEGNLA TKISMIIGSG EGRGMGLILV LSGIILTIWG IMGFNYRPLR FMEDVLPDAI PDPVILKDKN KIQELADMQL LKTIQNDRKR AKI
|
| |