Gene GWCH70_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1664 
Symbol 
ID7976378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1743939 
End bp1746935 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content29% 
IMG OID644798540 
Productamino acid adenylation domain protein 
Protein accessionYP_002949712 
Protein GI239827088 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR00517] acyl carrier protein
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTATA AAGAATTAGA TGAACTATCA TCAAAATTGG CAAACTATTT ACATGAAAAT 
AATTATAGTA AAAATGCCTT TATTCCAATT TATATGCCTC CATGTCCTGA AATGATTATA
AGTATTTTGG GAGTGTTAAA GGTAGGTGCT GCTTATCTAC CTATATCTAC AGAATATCCA
GTTAATCGCA TTAATATGCT ACTAGAAGAT TCAAATAGCC AAATAATTCT AAAAAATACG
AGTAATTTGT TAAATTTAAA TGTTAAAGAA ATTGATATTA GAAATATTAT TACTTCAGAT
TACTCTGACT CATTCAATGA AATAGATGGC GAATTAGCTT ATTTAATGTA CACTTCTGGA
AGTACAGGAA AGCCAAAGGG TGTTAGAGTT ACGCATTCAA ATTTAGAATA TATACTAAAT
AACATGCAGA AATACTACCC GGTTTCTAGA GATGACAAGT ATATTTTATC AACACCATTC
ACCTTTGATG TTTCAGTTGT TGAAATTTTT GGTTGGATTT ACGGTGGGGG GGCTTTAGTA
ATACCAACAC AAGAGAACTC ACGTAATTTT AGAAAACTGG CTCATTTAAT AGAAATTCAT
AAAGTAACAC ATATGGCACT CTCGCCTGCA ATTCTAAATT TGATGCTAGA TAAGTTAAAT
GAGGATGATA TTGATAAATT AGATAGAAAC CTTAAATATT TAATGGTAGC AGGTGAAGAG
TTTAAAGTTT CCCTTGCTCA TAAAGCTATT AAGTATTTAA AAAATGTTTG TATTGAAAAT
TTATATGGCC CAACTGAATG TACAGTATAT GCAACGCGTT ATAGAATTGA TCGTAACTTT
AATCGTCCAA GCGTTCCGAT TGGAAAGGAA CTCGATGGGG TGCAAATTAA AATTTTAGAC
TCTAACGGAA TTGAGGTGCC GATAGGTACT CAGGGGGAGA TGTATATTTC CGGTGAAGGA
GTTGCTAAAG GATATTTGAA CTTACCTAGT GTCAATAATG AAAAATTCTT ATTTATAGAT
GGCAAAAGAT ATTATAAAAC AGGAGATTAT GCCAAAAGAT TAAAAGACGG AAACATAGAG
TTTATAGGAA GGAAAGACTA TCAGGTTCAA ATAAACGGGA TTCGAGTAGA GTTAGGGGAG
ATTGAAGATA TTATTTTGAA AGAAATAAAA GAAATAAACA TGGTTAAGGT GTTATATAAA
AACAATAAAC TTTATTGCTT TTATCAAGGG CAAAAAGCAA TCGTACCTGA TGATATAAAG
AAGACTTTGA AGAACTTTTT GCCTTCATAT ATGATTCCGA ACTTTTATAA ACAGATAGAT
GAATTTCCAC TAACGATTAA CAGAAAAATA GATACTAAGG CGCTTATGTC TTATTATGAT
GATACTGATG TTCTACAGAA TGTTATTCAA GATGCTGTTA CTGATACGCA AAAAAAAATC
TTGAGTATAT TTAAAGAAAC TTTAAATGTA GATAGTGTTT CTTTATATGA TAGCTTCTTT
GATCTAGGTG GAGATTCATT AGATGTCATA TCTGTGATTA TAGAGTTAGA GAATTATTAT
AACATCAATT TAGATGAATC TGTGTTATAT AACCACCAGA ATGCTAGTGA ACTAGCAAGT
TATATAGAAA ATATGTTAGA GCAGGAAAAT GAAGTAACTA AAAAGGTTCA AAATACAATT
GAAGATATTA ACATCGATCA TATAAAATCG CAAGTCTCTA GCTCATACTA TAAGAATAAT
AAATTGTTAG GTACAATAGA GAAAGTTTAT CCAGTATATT ATCACCAAAA GAATTACATC
AAAGACAATT TTAATAGTGT CATCGACATC AAAATTGACG TGAAAAAAGA TTTTGAAATG
GAAAAGGTTA TTCAAGCATG TAAGGATATA ATTCTATCTA ATGAACTATT AAGGTCTGTA
ATTTCTGTGG AGAGTGAACA GATTGTATTT AAACAGGTTA AATTAGATAT AGATAGTTAT
GAAATACCAT TATTTGACCT GTCTGAATAT TCTTATGATA GTGCAATCTC ACTAGTTGAC
GAAATTACTA AGACTATGGC AGAAGTCGTC TTAAAAAATC CTTTGGATGA GCTGCTATAC
ACTGTTACGA TCTTTAAATT GCGTCAAAAG TACATTGTAG TGTTTGTGTT ATCCCACAAC
ATTGCAGATT TATCTAACAA ACACATTTTG ATTAAACAAT TTATGAATTT ATTAAACGGT
CATAAATTAG AAAATAGACC GGAATATAAA GATTTCATAG AGTTTATGGA TAGCAAAAAC
AAATTAGAGT ATATTAGTAA TTGTGATTAC ACAAAAAAAC TTCTAAAAGT CAATAACAAT
AGAGTTAAAG TACAAAGTAG TGATGATTTA TTAGTATTAA AGTTTAATTT TGACAATAAA
CTCAGAACAA CATTCGATAT AATAGATAAA ATTAATTATA TAAGTACTCA AATCCTATCT
AGGGTTATTG GACAAAAAGA GTTCATCTAT CAAACGATAG TTAACATAAG AAAATATAAA
GATTTAGACT TCAGTAATTG TATCGGTGAT TATCATACTT CGATGGTTCT TTTGGGAAAA
CCAGAGGAAA CATTTGAGGA ATTTAAAAAT AGAATGGAAG AAGTTTATTC CATGTATAGA
GATGGATTCA ATCCAATTTA TCTGTTCGCA AAAGGGTTTC CAAACATGAG CGAAACTCAC
AAAGATTTAT ACCATCTCTA CGGTGTCAAT CCTATTGCTA AGACAAACTA CTTAGGAACA
ATTAAAAATG AGCAATTAAA TGTCATGTTA GATTCATTAG AAGAAACGAG AAAAAACTTA
AGTACTTTAA GAGATAATCC GTTTTTCATA ACTTCTTTTT CAACAAAAGA CCATATCTAT
ATTGCGTTTT TAAACAAGCC AGTTAACTTA GATGAAAAAA TATATAAAGA TCTTAACGTA
TCTGAAGAAC GCATCTTTAG TAGTACTAAT ACTTTAGATA AAACACCAAT AAAATAG
 
Protein sequence
MTYKELDELS SKLANYLHEN NYSKNAFIPI YMPPCPEMII SILGVLKVGA AYLPISTEYP 
VNRINMLLED SNSQIILKNT SNLLNLNVKE IDIRNIITSD YSDSFNEIDG ELAYLMYTSG
STGKPKGVRV THSNLEYILN NMQKYYPVSR DDKYILSTPF TFDVSVVEIF GWIYGGGALV
IPTQENSRNF RKLAHLIEIH KVTHMALSPA ILNLMLDKLN EDDIDKLDRN LKYLMVAGEE
FKVSLAHKAI KYLKNVCIEN LYGPTECTVY ATRYRIDRNF NRPSVPIGKE LDGVQIKILD
SNGIEVPIGT QGEMYISGEG VAKGYLNLPS VNNEKFLFID GKRYYKTGDY AKRLKDGNIE
FIGRKDYQVQ INGIRVELGE IEDIILKEIK EINMVKVLYK NNKLYCFYQG QKAIVPDDIK
KTLKNFLPSY MIPNFYKQID EFPLTINRKI DTKALMSYYD DTDVLQNVIQ DAVTDTQKKI
LSIFKETLNV DSVSLYDSFF DLGGDSLDVI SVIIELENYY NINLDESVLY NHQNASELAS
YIENMLEQEN EVTKKVQNTI EDINIDHIKS QVSSSYYKNN KLLGTIEKVY PVYYHQKNYI
KDNFNSVIDI KIDVKKDFEM EKVIQACKDI ILSNELLRSV ISVESEQIVF KQVKLDIDSY
EIPLFDLSEY SYDSAISLVD EITKTMAEVV LKNPLDELLY TVTIFKLRQK YIVVFVLSHN
IADLSNKHIL IKQFMNLLNG HKLENRPEYK DFIEFMDSKN KLEYISNCDY TKKLLKVNNN
RVKVQSSDDL LVLKFNFDNK LRTTFDIIDK INYISTQILS RVIGQKEFIY QTIVNIRKYK
DLDFSNCIGD YHTSMVLLGK PEETFEEFKN RMEEVYSMYR DGFNPIYLFA KGFPNMSETH
KDLYHLYGVN PIAKTNYLGT IKNEQLNVML DSLEETRKNL STLRDNPFFI TSFSTKDHIY
IAFLNKPVNL DEKIYKDLNV SEERIFSSTN TLDKTPIK