Gene Haur_0866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0866 
Symbol 
ID5732767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp979865 
End bp985591 
Gene Length5727 bp 
Protein Length1908 aa 
Translation table11 
GC content55% 
IMG OID641277998 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001543642 
Protein GI159897395 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACG CTATTGCCAT CGTCGGGGTG GCATGTCGCT ACCCCGATGC CGATTCACCC 
AAGGCTCTTT GGGAAATGGC CCTTGCCAAA CGCCGCGCTT TTCGGCGCAT GCCCGACCAA
CGCCTGAACA ATCTCGATTA TATTTCGTCC GATCCGGCAG CTCTCGACAG CACCTACGTT
GAATATGCCG CCGTTTTGCG TGATTATGAA TTTGATCGGG TGCGTTTTCG CGTGCCTGGC
CATAGCTTTC GCTCCGCCGA TATGGCTCAT TGGTTAGCGC TCGATGTTGC CGACCAAGCC
TTGCGTGATG CAGGTTTTGC CGATGGTCAA GGCTTGCCAC GTGAAAGCAC CGGAGTGTAC
CTTGGCAATA CATTAACTGG CGAGTTTTCA CGTGCCAATA CCTTGCGCTT GCGCTGGCCG
TTTGTACGGC GGGTGGTCGC GGCAGGCTTG GCCGCAGAAG GCTGGTCGAG CGAGCAACGT
GGCCAATTTT TGCAGCAACT CGAACAAAAC TACAAAGCAC CATTCCCGAT TGTCGATGCT
GAAACCTTGG CGGGCGGTTT ATCCAACACG ATTGCAGGCC GAATTTGCAA CCATTTTGAT
CTGATGGGCG GCGGCTATAG CGTTGATGGC GCATGTTCAT CATCATTATT AGCCATAACC
ACCGCCTGCA CTGCGCTCGC CAGTGGCGAA GTTGATGTAG CCTTGGCTGG CGGGGTCGAT
CTCAGCCTTG ATCCATTTGA GTTGGTTGGG TTTGCCCGCG CCGGAGCCTT GGCAACCGAT
TTAATGCGCA TTTACGATCA ACGCTCAGCT GGATTTTGGC CAGGCGAGGG CTGTGGCTTT
GTCACCTTGA TGCGAGCCGA AGATGCCTAC GCCGAGCAAC GCCCGATCTA CGCCGTGATT
CGCGGCTGGG GCATTTCGTC GGATGGCAGC GGCGGGATCA CTCGGCCCGA AGTTGCTGGC
CAAGTGCTAA TGCTCAAACG TGCTTATCGG CGCACTGGCT TCAATATTGA CCAAGTGGGC
TATTTTGAGG GCCATGGCAC GGGCACGGCA GTTGGCGATG CAACTGAATT GCAAGCGATC
GCCAAAGCCC GCCTTGACGT GCAAAATCCT AATTTACCAA TCGCCGCCGT TGGTTCGATC
AAAGCCAATA TTGGTCATAC CAAAGCAGCA GCGGGCATTG CAGGTTTCAT CAAAGCCACG
CTGGCAGTGC ATAGCGCAAT TTTGCCGCCA ACCACTGGCT GCGAACAACC ACACAGTGCC
TTGGAAACCT TGCGTGTACT TGCTACGCCC GAAGCATGGC CTGAACAAAC CCCGCGTCGT
GCAGGTGTGA GCAGCATGGG CTTTGGCGGC ATCAACGCCC ATATCGTACT CGAACAAGCT
GAGCAAGCCA AACCAGCGTA CAATTGGCAG CCGTTTATTG GTCAAAACCA CAGCCAAGAT
CTCGAATTAA TTGTGGTTGC CAAAGCTGAT CGTCAAGCCT TGCTGACTGA ATTGCAGCAA
TTGCGCCAAC GCGCCGAGCA ACTTTCCTAC GCCGAACTCG GCGATTTGGC GAGCCACTAT
GCCCAAACCA ACTCCACAGG TTTGGCCCGC GCCGCCGTGA TTGCCAACAA TCCACGCCAA
CTGGCAGCCA AACTTGATCT GTTGATCGCT CAGCTTGAAG CGGGAGTCAA TCAACAGCTT
GATTTCAAAC AACAGATTTT TATTGGAATT GGCAAAAAAC AACCAACGCT GGGCTTGCTT
TTTCCAGGCC AAGGTGCGCC CCAAGCCAAC CCCAAAAGCG CCGTATTTCA GCGCTTTGCC
GAGTTGAATC GATTTTTGAG CCAAGCCCAA CTCAGCCAAG CCGAGCAAAT TAACACCGCC
AACGCCCAAC CCAATATTGT GCGGGCAAGC TTGGCTGGTT TGCATCTGCT CAAGCAGTTT
AAGCTCCACG CCAGCGCTGC TGTTGGCCAT AGTTTGGGCG AATTAAGCGC CTTGCACTGG
GCCGGAGCCT ACGACCAAGC CAGCTTAATC GAGTTGGCGC AAGCACGAGG CCACGCTATG
GCTAATTATG GTCAAGCTAA TGGCGGCATG GCCAGCATCG GCGCAGATCC AAGCACGATC
AAAAGCCTGA TCAACGGCGA TCAGGCGGTA ATTGCTGGCT ATAACGGGCC ACAACAAACC
GTAATTGCAG GCAGCCGCGA GGCCATGAAC ACCCTCGTTG AACGGGCGCA GCAGCAAGGT
TTAGCCGCCA CCAACTTGGC AGTTTCCCAT GCCTTTCATT CACCCATGAT GCAGCCAGCG
ATTCCAGTAC TCCAAGCGCA TGTCGCAAAC CTCAGCGCTC AGCCATTGCA AAGTACCGTC
TATTCCACCA TCACTGGCAC AAAACTCAGT GCTCAGGTTG AGCTTGGCAG CTTGCTCAGC
CAACAATTGA CTGATCCGGT GCGCTTTGTT GAGGCGATTG ATGGGCTAAG CGAGTGCGAT
TTGCTGATCG AAGTTGGCCC AGGCACGATT TTGAGCCGCC TGGCTAGCGA ATGCATCGGC
GTGCCCGCAG TTTCCTTGGA GGTTGAAAGC AATTCGCTAG CTGGCTTATT CAAGAGCCTT
GCGGCAGCCT TTGTGCTGGG CAGCCCGTTG GATTTGAGCT ATCTAGCCCA AACCCGTTTC
AGCCGCCCAT TTGATCTGCG CCATGAACCA AGCTTTTTGA CCAACCCTTG CGAAACGGCT
CCAGCCCAAC TTGATGATCA TCAGCCAAGT TTGAGCCTCA CCATCAGCCC TAGCGCCCAG
CTTGCCAGCC CAACCACCAG CAATGCGACT GATCCCTTGC AAATTGTACG CGAGTTGGTG
GCTGCTCGCA GCGAGTTACC ACTAGCCTCG ATCAACGACA GCGACCATGT GCTTGGCGAT
TTGCACCTTA ATTCGTTGAC TGTTGGCCAG ATTGTTAGCG AGGCAGCCCG CCTTTTGGGC
TTACAACCGC CGTTGGCTCC AACCAGCTAT GCCAATGCCA CAGTTGGCCA ATTGGCCCAA
GCCATCAGCG AATTGCAAAA CAGTGCCAAT ACACCCAGCG TTACCCCAGG CTACCCAGGC
ATTGCGCCGT GGGTTGAAAG TTTCGTGATC ACGTTAGTTG AGCAACCAGT GCCCGCGCCC
AAAATTAGCC GCTTGGCTAG CCAATGGCAA TTGTTCCACG AGCCAAATTA TGCTTTAGCG
CATGCGCTGA GCAGTGCCTT TCAACAGCAA GTTGGCCAAG GCGTGGTAGT TTGCGTTGGC
GAAACCGTTG ATGCGGCGAC GATTGAGCGC TTGCTTGAGG CCGCCCGATT TGCCTTGAGC
CAAAGCAACC CCCAGCATTT TGTATTGGTG CAGCATGGCG AGGGCTTGGC GGGTTTTGCT
CGCACCTTGG CCTTGGAAAA TCCACAATTA GCGGTCGCAG CGGTGCATGT GCCAATCGAT
GCCCCCCAGG CTTGCGATTG GATCGTGGCC GAAGCGCTGG CCAGCACTGG CTATCTCGAA
GCTCACTATG AGCACAACGG GCGGCGCACT AGCCCAATCT TGCAATTACT GCACCTTGGC
GAGGATAGCG AATTACCGCT CGGCCCCGAT GACGTGATTT TGGCGACAGG TGGTGGCAAG
GGCATCACCG CCGAAAGTGT TTATGCAATC GCCAAAGCTA GCGGAGCCAA GTTGGCTTTG
TTGGGGCGTT CGCAGCCCAG CAACGACCAA GAATTAGCCC AAAACTTGGA GCGCATGCAA
GCCGCAGGCA TCACGGTTGG TTATTGGGCG GTCGATGTGG GCGATGCAGC CAGCGTACAG
CAGGCAATGA ATACCATTCA AGCTCAATTA GGCATCGTGA CGATGGTCCT GCATGGCGCT
GCCCGCAACG TGCCCAGTTT GATTCGTAAC CTTGATCGCG CTAGTTTCGA GGCAACGTTA
ACGCCCAAAG TCCAAGGTCT AAACAACGTT TTAGCGGCGC TGGATCAGCA ACAATTGCGC
TTTGTGGTCG GCTTCGGCTC GATCATCGGG CGCATGGGCT TGGCTGGCGA TGCTGATTAT
GCCGTGGCCA ACGAGCAAAT GCGCCGAATC ATTGAGCAAG GCCAACACGA TTATCCCAAT
TGTCGTTGGC TCAGCATCGA ATGGTCGATT TGGTCGGATG TTGGCATGGG CGTGCGGCTT
GGCGGAGTCG ATCAATTGCT CCAAGCTGGC ATCAGCCCAA TTCCACCCGA TACTGGGATC
AATTTGTTGT TGCGTTTGCT GGCTAACCCA ATCGCTAGCA GCCATGTGGT CGTCACTGGA
CGCTATGGCG AATTGCCAAC CTTGCAAACG ATTCAGCCCG AGCTGCCATT CCTGCGCTTC
CTTGAGCGCC AATGTTTGTA TTATCCGCAG ATTGAATTGA TTGTTGAAGC CCAACTCTCA
TCAGCCAACG ATCCATATGT GGTTGATCAT AGCTATCACG GCGAACAACT CTTCCCAACG
GTGATTGGTC TTGAAGCCAT GGCCCAAGTT GCCATGGCCT TAACTGGCTC ACAGCAGATT
CCAACATTTG AGCAGGTTGC GCTCCAACGG CCAATTGTTG TGCCTGCTAA CGAGCTGCTG
ACAATTCGGA TTTGTAGTTT ACAAGTTGCC AAGGGCGTGG TCAAATTGGC GATTCGCAGC
CAAGAAACCT TGTTCCAAGT CGATCACTTT AGCGCGATTG CGCGATTCGA TCAGCCTGCC
AACTTCGGTG CTGCACCTAA CCAAATTGAC TGGCCAGTAC TCACGCTTGA CCCTGTTGCT
GATATTTATG AGCCATTGCT GTTCCATCAA GGCCGCTTCC AACGCTTGCA AAACTATCGC
TATCTGACGG CGCGGCACTG CATTGCCAAC TTGGCGACGC GCAACGAGCC ATGGTTTGGG
CGCTATCTGC CCCAACGCAG CGTTTTAGGC GATGCTGGCA TGCGCGATGC CTTGATTCAT
GCCTTGCAAG TGTGTGTGCC TTATGCCCAA GTCTTGCCAG TCGCGGTCGA ACGGATTAGC
TGCCAAAGCC CCAGCCAGCC TAGCGATTGG ACGATTTATG CCCAAGAACG CGCTTGGGAT
GGCAACATGT TCACCTACGA TGTGATTGCT GTCGATCAAC AGGGCAATGT CGTAGAAGAA
TGGCAGGGCT TGCGTTTGCA GCTCGTCGAG GGCAGCGGCT ACAAAGACGC ATGGCCCGCC
AGCTTGCTTG GCGCATATCT GGAGCGTCAA GTGCGCCAAG TCTTGCCGCA TACCAGTTTG
ACCATTGCCG TCGAAACAGA TCCCGCACTC GAACGCCAAC AACGCAGCGA TTTGGCACTG
CAACGGGCCA TCGGCCAACG TCAGCCAATT CAACGGCGCA GCGATGGCAA ACCTGAGGTA
GCAGATTATG TGGTTTCGGC CAGCCATTAT GGCCAATTGA CGCTGGCGGT TGCCGCCAAA
GAACCGATTA GCTGTGATCT TGAGCCGATT AGCCCACGCA GCGTTGAGCA ATGGCTTGAT
CTGCTTGGCG CTGAGCGCAT GCAACTGGCC AAGCTCATTC AACAGCAAAC TGGCTGGACG
CTTGACCAAG CGGCAACCCA GATTTGGACA GCCTTGGAAT GTTTGAGCAA AGTTGGCGCA
GCTTTTGATA GCCCATTGCG TTTGGAACCA CAAGCCGCAA ATAATTGGTT GGTATTGCAG
ACTGGTCAGT ATCGAATTGT ATCGCAGCAA CTCAATGTGC GCGATACCGA GCTGCCAGTC
GTGGTTAGTC TGTTGGTAGG AGCCTAA
 
Protein sequence
MSHAIAIVGV ACRYPDADSP KALWEMALAK RRAFRRMPDQ RLNNLDYISS DPAALDSTYV 
EYAAVLRDYE FDRVRFRVPG HSFRSADMAH WLALDVADQA LRDAGFADGQ GLPRESTGVY
LGNTLTGEFS RANTLRLRWP FVRRVVAAGL AAEGWSSEQR GQFLQQLEQN YKAPFPIVDA
ETLAGGLSNT IAGRICNHFD LMGGGYSVDG ACSSSLLAIT TACTALASGE VDVALAGGVD
LSLDPFELVG FARAGALATD LMRIYDQRSA GFWPGEGCGF VTLMRAEDAY AEQRPIYAVI
RGWGISSDGS GGITRPEVAG QVLMLKRAYR RTGFNIDQVG YFEGHGTGTA VGDATELQAI
AKARLDVQNP NLPIAAVGSI KANIGHTKAA AGIAGFIKAT LAVHSAILPP TTGCEQPHSA
LETLRVLATP EAWPEQTPRR AGVSSMGFGG INAHIVLEQA EQAKPAYNWQ PFIGQNHSQD
LELIVVAKAD RQALLTELQQ LRQRAEQLSY AELGDLASHY AQTNSTGLAR AAVIANNPRQ
LAAKLDLLIA QLEAGVNQQL DFKQQIFIGI GKKQPTLGLL FPGQGAPQAN PKSAVFQRFA
ELNRFLSQAQ LSQAEQINTA NAQPNIVRAS LAGLHLLKQF KLHASAAVGH SLGELSALHW
AGAYDQASLI ELAQARGHAM ANYGQANGGM ASIGADPSTI KSLINGDQAV IAGYNGPQQT
VIAGSREAMN TLVERAQQQG LAATNLAVSH AFHSPMMQPA IPVLQAHVAN LSAQPLQSTV
YSTITGTKLS AQVELGSLLS QQLTDPVRFV EAIDGLSECD LLIEVGPGTI LSRLASECIG
VPAVSLEVES NSLAGLFKSL AAAFVLGSPL DLSYLAQTRF SRPFDLRHEP SFLTNPCETA
PAQLDDHQPS LSLTISPSAQ LASPTTSNAT DPLQIVRELV AARSELPLAS INDSDHVLGD
LHLNSLTVGQ IVSEAARLLG LQPPLAPTSY ANATVGQLAQ AISELQNSAN TPSVTPGYPG
IAPWVESFVI TLVEQPVPAP KISRLASQWQ LFHEPNYALA HALSSAFQQQ VGQGVVVCVG
ETVDAATIER LLEAARFALS QSNPQHFVLV QHGEGLAGFA RTLALENPQL AVAAVHVPID
APQACDWIVA EALASTGYLE AHYEHNGRRT SPILQLLHLG EDSELPLGPD DVILATGGGK
GITAESVYAI AKASGAKLAL LGRSQPSNDQ ELAQNLERMQ AAGITVGYWA VDVGDAASVQ
QAMNTIQAQL GIVTMVLHGA ARNVPSLIRN LDRASFEATL TPKVQGLNNV LAALDQQQLR
FVVGFGSIIG RMGLAGDADY AVANEQMRRI IEQGQHDYPN CRWLSIEWSI WSDVGMGVRL
GGVDQLLQAG ISPIPPDTGI NLLLRLLANP IASSHVVVTG RYGELPTLQT IQPELPFLRF
LERQCLYYPQ IELIVEAQLS SANDPYVVDH SYHGEQLFPT VIGLEAMAQV AMALTGSQQI
PTFEQVALQR PIVVPANELL TIRICSLQVA KGVVKLAIRS QETLFQVDHF SAIARFDQPA
NFGAAPNQID WPVLTLDPVA DIYEPLLFHQ GRFQRLQNYR YLTARHCIAN LATRNEPWFG
RYLPQRSVLG DAGMRDALIH ALQVCVPYAQ VLPVAVERIS CQSPSQPSDW TIYAQERAWD
GNMFTYDVIA VDQQGNVVEE WQGLRLQLVE GSGYKDAWPA SLLGAYLERQ VRQVLPHTSL
TIAVETDPAL ERQQRSDLAL QRAIGQRQPI QRRSDGKPEV ADYVVSASHY GQLTLAVAAK
EPISCDLEPI SPRSVEQWLD LLGAERMQLA KLIQQQTGWT LDQAATQIWT ALECLSKVGA
AFDSPLRLEP QAANNWLVLQ TGQYRIVSQQ LNVRDTELPV VVSLLVGA