Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1664 |
Symbol | |
ID | 7976378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1743939 |
End bp | 1746935 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 644798540 |
Product | amino acid adenylation domain protein |
Protein accession | YP_002949712 |
Protein GI | 239827088 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR00517] acyl carrier protein [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTATA AAGAATTAGA TGAACTATCA TCAAAATTGG CAAACTATTT ACATGAAAAT AATTATAGTA AAAATGCCTT TATTCCAATT TATATGCCTC CATGTCCTGA AATGATTATA AGTATTTTGG GAGTGTTAAA GGTAGGTGCT GCTTATCTAC CTATATCTAC AGAATATCCA GTTAATCGCA TTAATATGCT ACTAGAAGAT TCAAATAGCC AAATAATTCT AAAAAATACG AGTAATTTGT TAAATTTAAA TGTTAAAGAA ATTGATATTA GAAATATTAT TACTTCAGAT TACTCTGACT CATTCAATGA AATAGATGGC GAATTAGCTT ATTTAATGTA CACTTCTGGA AGTACAGGAA AGCCAAAGGG TGTTAGAGTT ACGCATTCAA ATTTAGAATA TATACTAAAT AACATGCAGA AATACTACCC GGTTTCTAGA GATGACAAGT ATATTTTATC AACACCATTC ACCTTTGATG TTTCAGTTGT TGAAATTTTT GGTTGGATTT ACGGTGGGGG GGCTTTAGTA ATACCAACAC AAGAGAACTC ACGTAATTTT AGAAAACTGG CTCATTTAAT AGAAATTCAT AAAGTAACAC ATATGGCACT CTCGCCTGCA ATTCTAAATT TGATGCTAGA TAAGTTAAAT GAGGATGATA TTGATAAATT AGATAGAAAC CTTAAATATT TAATGGTAGC AGGTGAAGAG TTTAAAGTTT CCCTTGCTCA TAAAGCTATT AAGTATTTAA AAAATGTTTG TATTGAAAAT TTATATGGCC CAACTGAATG TACAGTATAT GCAACGCGTT ATAGAATTGA TCGTAACTTT AATCGTCCAA GCGTTCCGAT TGGAAAGGAA CTCGATGGGG TGCAAATTAA AATTTTAGAC TCTAACGGAA TTGAGGTGCC GATAGGTACT CAGGGGGAGA TGTATATTTC CGGTGAAGGA GTTGCTAAAG GATATTTGAA CTTACCTAGT GTCAATAATG AAAAATTCTT ATTTATAGAT GGCAAAAGAT ATTATAAAAC AGGAGATTAT GCCAAAAGAT TAAAAGACGG AAACATAGAG TTTATAGGAA GGAAAGACTA TCAGGTTCAA ATAAACGGGA TTCGAGTAGA GTTAGGGGAG ATTGAAGATA TTATTTTGAA AGAAATAAAA GAAATAAACA TGGTTAAGGT GTTATATAAA AACAATAAAC TTTATTGCTT TTATCAAGGG CAAAAAGCAA TCGTACCTGA TGATATAAAG AAGACTTTGA AGAACTTTTT GCCTTCATAT ATGATTCCGA ACTTTTATAA ACAGATAGAT GAATTTCCAC TAACGATTAA CAGAAAAATA GATACTAAGG CGCTTATGTC TTATTATGAT GATACTGATG TTCTACAGAA TGTTATTCAA GATGCTGTTA CTGATACGCA AAAAAAAATC TTGAGTATAT TTAAAGAAAC TTTAAATGTA GATAGTGTTT CTTTATATGA TAGCTTCTTT GATCTAGGTG GAGATTCATT AGATGTCATA TCTGTGATTA TAGAGTTAGA GAATTATTAT AACATCAATT TAGATGAATC TGTGTTATAT AACCACCAGA ATGCTAGTGA ACTAGCAAGT TATATAGAAA ATATGTTAGA GCAGGAAAAT GAAGTAACTA AAAAGGTTCA AAATACAATT GAAGATATTA ACATCGATCA TATAAAATCG CAAGTCTCTA GCTCATACTA TAAGAATAAT AAATTGTTAG GTACAATAGA GAAAGTTTAT CCAGTATATT ATCACCAAAA GAATTACATC AAAGACAATT TTAATAGTGT CATCGACATC AAAATTGACG TGAAAAAAGA TTTTGAAATG GAAAAGGTTA TTCAAGCATG TAAGGATATA ATTCTATCTA ATGAACTATT AAGGTCTGTA ATTTCTGTGG AGAGTGAACA GATTGTATTT AAACAGGTTA AATTAGATAT AGATAGTTAT GAAATACCAT TATTTGACCT GTCTGAATAT TCTTATGATA GTGCAATCTC ACTAGTTGAC GAAATTACTA AGACTATGGC AGAAGTCGTC TTAAAAAATC CTTTGGATGA GCTGCTATAC ACTGTTACGA TCTTTAAATT GCGTCAAAAG TACATTGTAG TGTTTGTGTT ATCCCACAAC ATTGCAGATT TATCTAACAA ACACATTTTG ATTAAACAAT TTATGAATTT ATTAAACGGT CATAAATTAG AAAATAGACC GGAATATAAA GATTTCATAG AGTTTATGGA TAGCAAAAAC AAATTAGAGT ATATTAGTAA TTGTGATTAC ACAAAAAAAC TTCTAAAAGT CAATAACAAT AGAGTTAAAG TACAAAGTAG TGATGATTTA TTAGTATTAA AGTTTAATTT TGACAATAAA CTCAGAACAA CATTCGATAT AATAGATAAA ATTAATTATA TAAGTACTCA AATCCTATCT AGGGTTATTG GACAAAAAGA GTTCATCTAT CAAACGATAG TTAACATAAG AAAATATAAA GATTTAGACT TCAGTAATTG TATCGGTGAT TATCATACTT CGATGGTTCT TTTGGGAAAA CCAGAGGAAA CATTTGAGGA ATTTAAAAAT AGAATGGAAG AAGTTTATTC CATGTATAGA GATGGATTCA ATCCAATTTA TCTGTTCGCA AAAGGGTTTC CAAACATGAG CGAAACTCAC AAAGATTTAT ACCATCTCTA CGGTGTCAAT CCTATTGCTA AGACAAACTA CTTAGGAACA ATTAAAAATG AGCAATTAAA TGTCATGTTA GATTCATTAG AAGAAACGAG AAAAAACTTA AGTACTTTAA GAGATAATCC GTTTTTCATA ACTTCTTTTT CAACAAAAGA CCATATCTAT ATTGCGTTTT TAAACAAGCC AGTTAACTTA GATGAAAAAA TATATAAAGA TCTTAACGTA TCTGAAGAAC GCATCTTTAG TAGTACTAAT ACTTTAGATA AAACACCAAT AAAATAG
|
Protein sequence | MTYKELDELS SKLANYLHEN NYSKNAFIPI YMPPCPEMII SILGVLKVGA AYLPISTEYP VNRINMLLED SNSQIILKNT SNLLNLNVKE IDIRNIITSD YSDSFNEIDG ELAYLMYTSG STGKPKGVRV THSNLEYILN NMQKYYPVSR DDKYILSTPF TFDVSVVEIF GWIYGGGALV IPTQENSRNF RKLAHLIEIH KVTHMALSPA ILNLMLDKLN EDDIDKLDRN LKYLMVAGEE FKVSLAHKAI KYLKNVCIEN LYGPTECTVY ATRYRIDRNF NRPSVPIGKE LDGVQIKILD SNGIEVPIGT QGEMYISGEG VAKGYLNLPS VNNEKFLFID GKRYYKTGDY AKRLKDGNIE FIGRKDYQVQ INGIRVELGE IEDIILKEIK EINMVKVLYK NNKLYCFYQG QKAIVPDDIK KTLKNFLPSY MIPNFYKQID EFPLTINRKI DTKALMSYYD DTDVLQNVIQ DAVTDTQKKI LSIFKETLNV DSVSLYDSFF DLGGDSLDVI SVIIELENYY NINLDESVLY NHQNASELAS YIENMLEQEN EVTKKVQNTI EDINIDHIKS QVSSSYYKNN KLLGTIEKVY PVYYHQKNYI KDNFNSVIDI KIDVKKDFEM EKVIQACKDI ILSNELLRSV ISVESEQIVF KQVKLDIDSY EIPLFDLSEY SYDSAISLVD EITKTMAEVV LKNPLDELLY TVTIFKLRQK YIVVFVLSHN IADLSNKHIL IKQFMNLLNG HKLENRPEYK DFIEFMDSKN KLEYISNCDY TKKLLKVNNN RVKVQSSDDL LVLKFNFDNK LRTTFDIIDK INYISTQILS RVIGQKEFIY QTIVNIRKYK DLDFSNCIGD YHTSMVLLGK PEETFEEFKN RMEEVYSMYR DGFNPIYLFA KGFPNMSETH KDLYHLYGVN PIAKTNYLGT IKNEQLNVML DSLEETRKNL STLRDNPFFI TSFSTKDHIY IAFLNKPVNL DEKIYKDLNV SEERIFSSTN TLDKTPIK
|
| |