Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0866 |
Symbol | |
ID | 5732767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 979865 |
End bp | 985591 |
Gene Length | 5727 bp |
Protein Length | 1908 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641277998 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_001543642 |
Protein GI | 159897395 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCACG CTATTGCCAT CGTCGGGGTG GCATGTCGCT ACCCCGATGC CGATTCACCC AAGGCTCTTT GGGAAATGGC CCTTGCCAAA CGCCGCGCTT TTCGGCGCAT GCCCGACCAA CGCCTGAACA ATCTCGATTA TATTTCGTCC GATCCGGCAG CTCTCGACAG CACCTACGTT GAATATGCCG CCGTTTTGCG TGATTATGAA TTTGATCGGG TGCGTTTTCG CGTGCCTGGC CATAGCTTTC GCTCCGCCGA TATGGCTCAT TGGTTAGCGC TCGATGTTGC CGACCAAGCC TTGCGTGATG CAGGTTTTGC CGATGGTCAA GGCTTGCCAC GTGAAAGCAC CGGAGTGTAC CTTGGCAATA CATTAACTGG CGAGTTTTCA CGTGCCAATA CCTTGCGCTT GCGCTGGCCG TTTGTACGGC GGGTGGTCGC GGCAGGCTTG GCCGCAGAAG GCTGGTCGAG CGAGCAACGT GGCCAATTTT TGCAGCAACT CGAACAAAAC TACAAAGCAC CATTCCCGAT TGTCGATGCT GAAACCTTGG CGGGCGGTTT ATCCAACACG ATTGCAGGCC GAATTTGCAA CCATTTTGAT CTGATGGGCG GCGGCTATAG CGTTGATGGC GCATGTTCAT CATCATTATT AGCCATAACC ACCGCCTGCA CTGCGCTCGC CAGTGGCGAA GTTGATGTAG CCTTGGCTGG CGGGGTCGAT CTCAGCCTTG ATCCATTTGA GTTGGTTGGG TTTGCCCGCG CCGGAGCCTT GGCAACCGAT TTAATGCGCA TTTACGATCA ACGCTCAGCT GGATTTTGGC CAGGCGAGGG CTGTGGCTTT GTCACCTTGA TGCGAGCCGA AGATGCCTAC GCCGAGCAAC GCCCGATCTA CGCCGTGATT CGCGGCTGGG GCATTTCGTC GGATGGCAGC GGCGGGATCA CTCGGCCCGA AGTTGCTGGC CAAGTGCTAA TGCTCAAACG TGCTTATCGG CGCACTGGCT TCAATATTGA CCAAGTGGGC TATTTTGAGG GCCATGGCAC GGGCACGGCA GTTGGCGATG CAACTGAATT GCAAGCGATC GCCAAAGCCC GCCTTGACGT GCAAAATCCT AATTTACCAA TCGCCGCCGT TGGTTCGATC AAAGCCAATA TTGGTCATAC CAAAGCAGCA GCGGGCATTG CAGGTTTCAT CAAAGCCACG CTGGCAGTGC ATAGCGCAAT TTTGCCGCCA ACCACTGGCT GCGAACAACC ACACAGTGCC TTGGAAACCT TGCGTGTACT TGCTACGCCC GAAGCATGGC CTGAACAAAC CCCGCGTCGT GCAGGTGTGA GCAGCATGGG CTTTGGCGGC ATCAACGCCC ATATCGTACT CGAACAAGCT GAGCAAGCCA AACCAGCGTA CAATTGGCAG CCGTTTATTG GTCAAAACCA CAGCCAAGAT CTCGAATTAA TTGTGGTTGC CAAAGCTGAT CGTCAAGCCT TGCTGACTGA ATTGCAGCAA TTGCGCCAAC GCGCCGAGCA ACTTTCCTAC GCCGAACTCG GCGATTTGGC GAGCCACTAT GCCCAAACCA ACTCCACAGG TTTGGCCCGC GCCGCCGTGA TTGCCAACAA TCCACGCCAA CTGGCAGCCA AACTTGATCT GTTGATCGCT CAGCTTGAAG CGGGAGTCAA TCAACAGCTT GATTTCAAAC AACAGATTTT TATTGGAATT GGCAAAAAAC AACCAACGCT GGGCTTGCTT TTTCCAGGCC AAGGTGCGCC CCAAGCCAAC CCCAAAAGCG CCGTATTTCA GCGCTTTGCC GAGTTGAATC GATTTTTGAG CCAAGCCCAA CTCAGCCAAG CCGAGCAAAT TAACACCGCC AACGCCCAAC CCAATATTGT GCGGGCAAGC TTGGCTGGTT TGCATCTGCT CAAGCAGTTT AAGCTCCACG CCAGCGCTGC TGTTGGCCAT AGTTTGGGCG AATTAAGCGC CTTGCACTGG GCCGGAGCCT ACGACCAAGC CAGCTTAATC GAGTTGGCGC AAGCACGAGG CCACGCTATG GCTAATTATG GTCAAGCTAA TGGCGGCATG GCCAGCATCG GCGCAGATCC AAGCACGATC AAAAGCCTGA TCAACGGCGA TCAGGCGGTA ATTGCTGGCT ATAACGGGCC ACAACAAACC GTAATTGCAG GCAGCCGCGA GGCCATGAAC ACCCTCGTTG AACGGGCGCA GCAGCAAGGT TTAGCCGCCA CCAACTTGGC AGTTTCCCAT GCCTTTCATT CACCCATGAT GCAGCCAGCG ATTCCAGTAC TCCAAGCGCA TGTCGCAAAC CTCAGCGCTC AGCCATTGCA AAGTACCGTC TATTCCACCA TCACTGGCAC AAAACTCAGT GCTCAGGTTG AGCTTGGCAG CTTGCTCAGC CAACAATTGA CTGATCCGGT GCGCTTTGTT GAGGCGATTG ATGGGCTAAG CGAGTGCGAT TTGCTGATCG AAGTTGGCCC AGGCACGATT TTGAGCCGCC TGGCTAGCGA ATGCATCGGC GTGCCCGCAG TTTCCTTGGA GGTTGAAAGC AATTCGCTAG CTGGCTTATT CAAGAGCCTT GCGGCAGCCT TTGTGCTGGG CAGCCCGTTG GATTTGAGCT ATCTAGCCCA AACCCGTTTC AGCCGCCCAT TTGATCTGCG CCATGAACCA AGCTTTTTGA CCAACCCTTG CGAAACGGCT CCAGCCCAAC TTGATGATCA TCAGCCAAGT TTGAGCCTCA CCATCAGCCC TAGCGCCCAG CTTGCCAGCC CAACCACCAG CAATGCGACT GATCCCTTGC AAATTGTACG CGAGTTGGTG GCTGCTCGCA GCGAGTTACC ACTAGCCTCG ATCAACGACA GCGACCATGT GCTTGGCGAT TTGCACCTTA ATTCGTTGAC TGTTGGCCAG ATTGTTAGCG AGGCAGCCCG CCTTTTGGGC TTACAACCGC CGTTGGCTCC AACCAGCTAT GCCAATGCCA CAGTTGGCCA ATTGGCCCAA GCCATCAGCG AATTGCAAAA CAGTGCCAAT ACACCCAGCG TTACCCCAGG CTACCCAGGC ATTGCGCCGT GGGTTGAAAG TTTCGTGATC ACGTTAGTTG AGCAACCAGT GCCCGCGCCC AAAATTAGCC GCTTGGCTAG CCAATGGCAA TTGTTCCACG AGCCAAATTA TGCTTTAGCG CATGCGCTGA GCAGTGCCTT TCAACAGCAA GTTGGCCAAG GCGTGGTAGT TTGCGTTGGC GAAACCGTTG ATGCGGCGAC GATTGAGCGC TTGCTTGAGG CCGCCCGATT TGCCTTGAGC CAAAGCAACC CCCAGCATTT TGTATTGGTG CAGCATGGCG AGGGCTTGGC GGGTTTTGCT CGCACCTTGG CCTTGGAAAA TCCACAATTA GCGGTCGCAG CGGTGCATGT GCCAATCGAT GCCCCCCAGG CTTGCGATTG GATCGTGGCC GAAGCGCTGG CCAGCACTGG CTATCTCGAA GCTCACTATG AGCACAACGG GCGGCGCACT AGCCCAATCT TGCAATTACT GCACCTTGGC GAGGATAGCG AATTACCGCT CGGCCCCGAT GACGTGATTT TGGCGACAGG TGGTGGCAAG GGCATCACCG CCGAAAGTGT TTATGCAATC GCCAAAGCTA GCGGAGCCAA GTTGGCTTTG TTGGGGCGTT CGCAGCCCAG CAACGACCAA GAATTAGCCC AAAACTTGGA GCGCATGCAA GCCGCAGGCA TCACGGTTGG TTATTGGGCG GTCGATGTGG GCGATGCAGC CAGCGTACAG CAGGCAATGA ATACCATTCA AGCTCAATTA GGCATCGTGA CGATGGTCCT GCATGGCGCT GCCCGCAACG TGCCCAGTTT GATTCGTAAC CTTGATCGCG CTAGTTTCGA GGCAACGTTA ACGCCCAAAG TCCAAGGTCT AAACAACGTT TTAGCGGCGC TGGATCAGCA ACAATTGCGC TTTGTGGTCG GCTTCGGCTC GATCATCGGG CGCATGGGCT TGGCTGGCGA TGCTGATTAT GCCGTGGCCA ACGAGCAAAT GCGCCGAATC ATTGAGCAAG GCCAACACGA TTATCCCAAT TGTCGTTGGC TCAGCATCGA ATGGTCGATT TGGTCGGATG TTGGCATGGG CGTGCGGCTT GGCGGAGTCG ATCAATTGCT CCAAGCTGGC ATCAGCCCAA TTCCACCCGA TACTGGGATC AATTTGTTGT TGCGTTTGCT GGCTAACCCA ATCGCTAGCA GCCATGTGGT CGTCACTGGA CGCTATGGCG AATTGCCAAC CTTGCAAACG ATTCAGCCCG AGCTGCCATT CCTGCGCTTC CTTGAGCGCC AATGTTTGTA TTATCCGCAG ATTGAATTGA TTGTTGAAGC CCAACTCTCA TCAGCCAACG ATCCATATGT GGTTGATCAT AGCTATCACG GCGAACAACT CTTCCCAACG GTGATTGGTC TTGAAGCCAT GGCCCAAGTT GCCATGGCCT TAACTGGCTC ACAGCAGATT CCAACATTTG AGCAGGTTGC GCTCCAACGG CCAATTGTTG TGCCTGCTAA CGAGCTGCTG ACAATTCGGA TTTGTAGTTT ACAAGTTGCC AAGGGCGTGG TCAAATTGGC GATTCGCAGC CAAGAAACCT TGTTCCAAGT CGATCACTTT AGCGCGATTG CGCGATTCGA TCAGCCTGCC AACTTCGGTG CTGCACCTAA CCAAATTGAC TGGCCAGTAC TCACGCTTGA CCCTGTTGCT GATATTTATG AGCCATTGCT GTTCCATCAA GGCCGCTTCC AACGCTTGCA AAACTATCGC TATCTGACGG CGCGGCACTG CATTGCCAAC TTGGCGACGC GCAACGAGCC ATGGTTTGGG CGCTATCTGC CCCAACGCAG CGTTTTAGGC GATGCTGGCA TGCGCGATGC CTTGATTCAT GCCTTGCAAG TGTGTGTGCC TTATGCCCAA GTCTTGCCAG TCGCGGTCGA ACGGATTAGC TGCCAAAGCC CCAGCCAGCC TAGCGATTGG ACGATTTATG CCCAAGAACG CGCTTGGGAT GGCAACATGT TCACCTACGA TGTGATTGCT GTCGATCAAC AGGGCAATGT CGTAGAAGAA TGGCAGGGCT TGCGTTTGCA GCTCGTCGAG GGCAGCGGCT ACAAAGACGC ATGGCCCGCC AGCTTGCTTG GCGCATATCT GGAGCGTCAA GTGCGCCAAG TCTTGCCGCA TACCAGTTTG ACCATTGCCG TCGAAACAGA TCCCGCACTC GAACGCCAAC AACGCAGCGA TTTGGCACTG CAACGGGCCA TCGGCCAACG TCAGCCAATT CAACGGCGCA GCGATGGCAA ACCTGAGGTA GCAGATTATG TGGTTTCGGC CAGCCATTAT GGCCAATTGA CGCTGGCGGT TGCCGCCAAA GAACCGATTA GCTGTGATCT TGAGCCGATT AGCCCACGCA GCGTTGAGCA ATGGCTTGAT CTGCTTGGCG CTGAGCGCAT GCAACTGGCC AAGCTCATTC AACAGCAAAC TGGCTGGACG CTTGACCAAG CGGCAACCCA GATTTGGACA GCCTTGGAAT GTTTGAGCAA AGTTGGCGCA GCTTTTGATA GCCCATTGCG TTTGGAACCA CAAGCCGCAA ATAATTGGTT GGTATTGCAG ACTGGTCAGT ATCGAATTGT ATCGCAGCAA CTCAATGTGC GCGATACCGA GCTGCCAGTC GTGGTTAGTC TGTTGGTAGG AGCCTAA
|
Protein sequence | MSHAIAIVGV ACRYPDADSP KALWEMALAK RRAFRRMPDQ RLNNLDYISS DPAALDSTYV EYAAVLRDYE FDRVRFRVPG HSFRSADMAH WLALDVADQA LRDAGFADGQ GLPRESTGVY LGNTLTGEFS RANTLRLRWP FVRRVVAAGL AAEGWSSEQR GQFLQQLEQN YKAPFPIVDA ETLAGGLSNT IAGRICNHFD LMGGGYSVDG ACSSSLLAIT TACTALASGE VDVALAGGVD LSLDPFELVG FARAGALATD LMRIYDQRSA GFWPGEGCGF VTLMRAEDAY AEQRPIYAVI RGWGISSDGS GGITRPEVAG QVLMLKRAYR RTGFNIDQVG YFEGHGTGTA VGDATELQAI AKARLDVQNP NLPIAAVGSI KANIGHTKAA AGIAGFIKAT LAVHSAILPP TTGCEQPHSA LETLRVLATP EAWPEQTPRR AGVSSMGFGG INAHIVLEQA EQAKPAYNWQ PFIGQNHSQD LELIVVAKAD RQALLTELQQ LRQRAEQLSY AELGDLASHY AQTNSTGLAR AAVIANNPRQ LAAKLDLLIA QLEAGVNQQL DFKQQIFIGI GKKQPTLGLL FPGQGAPQAN PKSAVFQRFA ELNRFLSQAQ LSQAEQINTA NAQPNIVRAS LAGLHLLKQF KLHASAAVGH SLGELSALHW AGAYDQASLI ELAQARGHAM ANYGQANGGM ASIGADPSTI KSLINGDQAV IAGYNGPQQT VIAGSREAMN TLVERAQQQG LAATNLAVSH AFHSPMMQPA IPVLQAHVAN LSAQPLQSTV YSTITGTKLS AQVELGSLLS QQLTDPVRFV EAIDGLSECD LLIEVGPGTI LSRLASECIG VPAVSLEVES NSLAGLFKSL AAAFVLGSPL DLSYLAQTRF SRPFDLRHEP SFLTNPCETA PAQLDDHQPS LSLTISPSAQ LASPTTSNAT DPLQIVRELV AARSELPLAS INDSDHVLGD LHLNSLTVGQ IVSEAARLLG LQPPLAPTSY ANATVGQLAQ AISELQNSAN TPSVTPGYPG IAPWVESFVI TLVEQPVPAP KISRLASQWQ LFHEPNYALA HALSSAFQQQ VGQGVVVCVG ETVDAATIER LLEAARFALS QSNPQHFVLV QHGEGLAGFA RTLALENPQL AVAAVHVPID APQACDWIVA EALASTGYLE AHYEHNGRRT SPILQLLHLG EDSELPLGPD DVILATGGGK GITAESVYAI AKASGAKLAL LGRSQPSNDQ ELAQNLERMQ AAGITVGYWA VDVGDAASVQ QAMNTIQAQL GIVTMVLHGA ARNVPSLIRN LDRASFEATL TPKVQGLNNV LAALDQQQLR FVVGFGSIIG RMGLAGDADY AVANEQMRRI IEQGQHDYPN CRWLSIEWSI WSDVGMGVRL GGVDQLLQAG ISPIPPDTGI NLLLRLLANP IASSHVVVTG RYGELPTLQT IQPELPFLRF LERQCLYYPQ IELIVEAQLS SANDPYVVDH SYHGEQLFPT VIGLEAMAQV AMALTGSQQI PTFEQVALQR PIVVPANELL TIRICSLQVA KGVVKLAIRS QETLFQVDHF SAIARFDQPA NFGAAPNQID WPVLTLDPVA DIYEPLLFHQ GRFQRLQNYR YLTARHCIAN LATRNEPWFG RYLPQRSVLG DAGMRDALIH ALQVCVPYAQ VLPVAVERIS CQSPSQPSDW TIYAQERAWD GNMFTYDVIA VDQQGNVVEE WQGLRLQLVE GSGYKDAWPA SLLGAYLERQ VRQVLPHTSL TIAVETDPAL ERQQRSDLAL QRAIGQRQPI QRRSDGKPEV ADYVVSASHY GQLTLAVAAK EPISCDLEPI SPRSVEQWLD LLGAERMQLA KLIQQQTGWT LDQAATQIWT ALECLSKVGA AFDSPLRLEP QAANNWLVLQ TGQYRIVSQQ LNVRDTELPV VVSLLVGA
|
| |