Gene Haur_3962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3962 
Symbol 
ID5735823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4990434 
End bp4995182 
Gene Length4749 bp 
Protein Length1582 aa 
Translation table11 
GC content65% 
IMG OID641281112 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001546722 
Protein GI159900475 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGATC GCGAATCACA GGATTTCGAA TTCGATACCG CTGTTGCCAT CATCGGCATG 
GCAGGGCGTT TCCCTGGCGC CAACACGCTG GATCAGTTCT GGCACAACAT GACCCAGGGC
GTGCAATCGA TCCGGTTTTT TTCCGATGAA GAACTGCTGG CCGCCGGGGT GGACCCGGAT
CTGATGAGCC AACCCGAGTA TGTGAAAGCC GGGACGGTCA TCGACAATAT CGATTCCTTT
GACTCCGCGT TCTTTGGCTT TACGCCGCGC GAGGCCGAGT TGATGGACCC GCAGTTGCGC
TTGTTTCTAG AATGCTCTTG GGAGGCGTTC GAGGACGCCG CTTATAGCCC GGAAACCTAC
CAAGGTCTGG TTGGGGTGTT CGCCGGATCG GCCATCTCGA CCTATATGTT GAATAATATC
TTTAACAACG CCGAGGTGTT CCGCAAAGCG GGCATGCTCC AGGTCGGCGT CCTGAACTCG
TCGGACTCGC TTTCGACCTG GGTCTCCTAC AAGCTTAACT TCCGCGGGCC GAGCGTGGTC
GTACAAACTT TTTGCTCGAC CTCGCTGGTG GCAGTCCACA TGGCCTGCCA GAGCCTGCTC
AACTACGAGT GCGATATGGC GCTGGCTGGC GGTGTTGCGA TCTCGGTACC CCATGGAACC
GGCTATGTGT ACCAGGAGGG CGGCATTGTT TCGCCCGACG GCCAGTGCCG CACCTTCGAC
GCCGACGGCC AGGGCAGCGT GATGAGCAAC GGCGCCGGCG TCGTCGCGCT CAAGCGCCTC
GATCAGGCGG TCGCAGATGG AGACCACGTG TACGCCGTCA TCCGCGGCTC GGCGGTCAAC
AACGACGGAA TCCGCAAAGT CGGCTACACC GCCCCTGGCC TAGAAGGGCA GTCGTCGGTG
ATCGCCGAAG CGCTGGCCCA CGCTGGCGTC GACCCTGCGA CGGTGGGCTA CCTTGAGGCC
CACGGCACCG CCACTGCGCT GGGCGACTCG ATCGAGCTCG CGGCGACGAT CAAGGCCTAC
AAGCAGCAGA CCGACCAGAC CCAGTACTGT GCGCTCGGTT CGGTCAAGCC CAACGTTGGT
CATCTCGACC GCGCCGCAGG CGTGACCGGC CTGATCAAAA CCGTTCTGGC GCTGAAGCAC
CGAGAGATTC CACCGAGTCT TAATTTCGAG CAGGCCAGCC CCGAGATTGA TTTGCCCAAT
AGCCCGTTTT TTGTCAATAC GACGTTGCGG CCCTGGGAGA CCGACGGTCG GACGCCGCGC
CGCGCCGGCG TCAGCTCGTT CGGCCTGGGC GGCACCAATG CCCACGTCGT GCTCCAGGAA
ACGCCGCTCG AGGCTCCGTC CGGTCGATCA TACCCGCAGC AGCTGTTGCT GCTGTCGGCC
AAAACCGACT CCGCCCTGCA AACAATGGCC GCCAACTTGG CCAGCTTCTT GCGTGCCCAT
CCCGAGGTGG ACTTGGCCGA CGTTGCCCAT ACCCTCCAGG TTGGGCGCAC CGCCTTCAAC
CACCGGCGCG CCCTGGTGGC CCGCGACCGT GACGATGCCA TCGCGCAGTT GGAGGCGGCT
GGCGCCCGCG GGCTGACCGC CAACCAGACC GACCGTGATC GGCCGGTGGC CTTCCTGTTC
CCGGGCGTCG GCGACCACTA TGCAGGGATG GCCGCGGACC TGTACACTCA CGAAGCGCGC
TTTCGCGCGG TGGTTGACGA GTGCTGCACG CTGCTGAATC CGCTGCTGGA TCAGGATTTG
CTGGCGGTGC TGTATCCAGA GTCCGGGCGC GGCAACGGGG CGCCGGCGGC CGGCCTGGAT
TTCCGCCAGT TGCTGGCAGG CCTGCCGGCC GGAACGCCTG CCGGGACGCT GCACCAGACG
GAGCTGGCCC AGCCGGCGGT GTTTGTAGTC GAGTACGCGC TGGTGCAATT GCTGGCGAGC
TGGGGCATCC GGCCGCAGGC GCTGCTGGGC TACAGCCTGG GCGAATACGT CGCGGCAACG
GTCGCCGGCG TGCTGAGCCT GGAGGACGCG CTGCGTCTGG TGGCCCTCCG CGCCAGGCTT
ATCCAGAATT TGCCTGCCGG CGCCATGCTG GCGGTCAGCC TAGGGGAGGA CGATGCGCGG
CGCTACGTCC GAGGCGATGT CGCCTTGGCA GCGGTTAACA GCCCGAGCGC CTCCATCCTG
GCCGGCCCGG CGGCGGCTCT TGAGGCCGTG GCCAGGCAAT GCGCCGCCGA TGAAGTTGCC
TGCCGCTGGC TGGAGACGAG CCATGCCTTC CATTCGGCGA TGCTGGAGCC GGCCCGCGCG
GCCCTGACCG ATCTGACCTG CTCGCTGACC CTCAACCCGC CGGCCATACC GTATGTCTCC
AACGTCACCG GCACCTGGAT TACCGTCGAA GAGGCCACCG ACCCGGGCTA CTGGGCGCGG
CACATGTGTC AGACCGTGCG GTTTGCCGCC GGAGCTGGCG CGCTGCTGGA GGGGGAGCCA
GCGTTGATCC TGGAAGTCGG CCCTGGTCAG GCGCTGGCGT CGTTTGTCAA GCAGCATTCA
GCCTGCTCCC GCGAGCGTAT GGGCCAGATC CTGAGCGCGC TGCCGGCGTC CCACGGTCGC
CAGGCCGAGT TGTCCCACGT GCTGGAAACC CTGGGCCGGC TGTGGCTTGC CGGGGTGAAC
ATTGACTGGG CCGCGTTCTC CGCGGGCGAA CAACGGCGGC GGCTTTCGCT GCCGACCTAT
CCGTTTGAGC GCCAGCGCTA CTGGGTGGAC GCCGATGCGC ACGGTAAGTC GGGCAGTTCG
CTAGCCAACG ACGAGTTCCT CAACAGTGCC GACCGCATCG CCGACGTTGG CGACTGGTTC
TTTGTGCCTT CCTGGAAACG GACCAGCCCG CCGTCGCCGA TCCTAGGCGC CCCACTTTTC
GCCGACCAAC ACACCTGGCT GCTCCTGGTT GATGACTCCG GCCTGGGCCT TGTGCTCGCC
GAGCGCTTGA AACAGCACGG CCAGACTGTG GTGACGATCG CGCCGGGTGC GGCATTCGCC
GCCATCGACC CGGCAAGCTA CACGGTACGT CCCGCCGAAC GCGAAGACTA CATCAGCGTG
ATTAAGAAGC TCGCCCGCGA AGGCATTGAA CCCAGCCGCG TGCTACATCT GTGGCTGGCC
TCGCCCGCTG AGTCGGCCGG CGAGTCGCCG GCCGAGGTGG ACGGCATATT GGAGCGCGGC
TTTTACAGCC TGTTGGCGCT GACCCAGGCG CTGGGCAACC AGGGCGTGGA GCGCTGCGAG
GTCAACGTCG TCACCACCGG CGTTCACGCG GTCGTCGGCC AGGAGGCGGT GAACGCAACG
AAATCGACAG TGATCGGTCC GTGCAAGATT ATTCCTCAGG AGCACCCCAA CCTGACGGCG
CGCTCCATCG ATGTGATCTG GGCACCCGAT CGGCAGGGCT GCGAGGAGTT GGTGGATCGC
CTGGTGGTGG AACTGGCCAG CACCCCGACC GGCGTCGTGA TCGCGCTGCG TGGGCAGCAC
CGTTGGGTCC AGGCATATGA GCAAATCCAC CTGCCTGAGT TCAGCCATCC CCACGCCCGT
TTGCGCGATC AGGGCGTGTA TGTCATAACT GGCGGCTTGG GCGGCATTGG CCTGGCCCTG
GCCGAGTACC TGGTGGCGAG TGTGCGGGCC AAAGTGGTCT TGATCGGGCG GACTGCCCTA
CCGTCGCGTG AGCGCTGGGA TGACATCATC GCCGCTGAAG GCACCGAGAG CGGCACCGGG
CATCGCGTCC ATTGCATCCG GCAGCTTGAG GCCAGCGGCG CCGAAGTGCT GGTGCTCCAG
GCGGACGTCG CCGACGCCGG CCAGATTGCT GCGGCCATCG ACCAGGCTGT CGCCCGCTTC
GGCGCCATCA ATGGGGTATT CCACGCCGCT GGCGTGCCGG GGGTGGGCCT GATGCAGCTT
AAGACCGCCG AAGCGGCAGC TAGCGTGCTC GCGCCAAAGG TCCAGGGAAC CCTGGCGATC
GCTCAGGCCG TGCGCTCCTT GCCGCTTGAC TTCTTGGTGC TGTTCTCGTC GGTTGCCGCG
GCGGCCGGCG GCGGCCAGGG CCAGGCCGAC TACTGCGCGG CGAACATCTT CCTGGACAGC
TACGCCCAGT TGCATCATCG CGACAACGGG GTGACGATCT CGATCGGATG GGGCGAATGG
CACTGGGACG CCTGGAGCGA AGGCTTACAA GGCTTCACCG AAGAAGTACG GGCGTTCTTC
ATAGCCTCGC GCCGAACCTT CGGCATCGAC TTCGCCGACG GCATGGAAGC ATTGCGCCGC
ATCTTGGCTT ACGATCTACC GCAGATCTTT GTCAGCCCGC GTGACCTGAC GTTCCTAGTT
GAGGCTAGCC AGCGCTCGTT CGCCGCGTTC CTCAAGATGC GCGAAGACCG CGAGCAGTCG
CGCTACCCGC GGCCAGCCCT GGCTGTGGCC TATGCCGCAC CACGCAACGA CCTCGAAACG
CGCATCGCCA CCATATGGAG CGATGTGCTG GGCATCGACC CGATCGGGAT CGACGATAAC
TTTTTCGACC TCGGCGGCAA TTCGCTGCTG GGCCTCGACC TGTTTGGGCG CATGCGCAAA
GCCCTGAAGC TCGACAACTT CCCCGCATAT GTGCTGTATG AGTCGCCCAC GGTTGAGTTG
CAAGCGGCCC ACATCACCAA CCTCCAACAG CCGGCCATCG CCCACGACGA CGGCGACGAG
CACGAGGACG AGCAGCGCAG GATGCAATTG AACTACTTTG TGGATTTGGA TGAAATGGGG
GATCTATGA
 
Protein sequence
MIDRESQDFE FDTAVAIIGM AGRFPGANTL DQFWHNMTQG VQSIRFFSDE ELLAAGVDPD 
LMSQPEYVKA GTVIDNIDSF DSAFFGFTPR EAELMDPQLR LFLECSWEAF EDAAYSPETY
QGLVGVFAGS AISTYMLNNI FNNAEVFRKA GMLQVGVLNS SDSLSTWVSY KLNFRGPSVV
VQTFCSTSLV AVHMACQSLL NYECDMALAG GVAISVPHGT GYVYQEGGIV SPDGQCRTFD
ADGQGSVMSN GAGVVALKRL DQAVADGDHV YAVIRGSAVN NDGIRKVGYT APGLEGQSSV
IAEALAHAGV DPATVGYLEA HGTATALGDS IELAATIKAY KQQTDQTQYC ALGSVKPNVG
HLDRAAGVTG LIKTVLALKH REIPPSLNFE QASPEIDLPN SPFFVNTTLR PWETDGRTPR
RAGVSSFGLG GTNAHVVLQE TPLEAPSGRS YPQQLLLLSA KTDSALQTMA ANLASFLRAH
PEVDLADVAH TLQVGRTAFN HRRALVARDR DDAIAQLEAA GARGLTANQT DRDRPVAFLF
PGVGDHYAGM AADLYTHEAR FRAVVDECCT LLNPLLDQDL LAVLYPESGR GNGAPAAGLD
FRQLLAGLPA GTPAGTLHQT ELAQPAVFVV EYALVQLLAS WGIRPQALLG YSLGEYVAAT
VAGVLSLEDA LRLVALRARL IQNLPAGAML AVSLGEDDAR RYVRGDVALA AVNSPSASIL
AGPAAALEAV ARQCAADEVA CRWLETSHAF HSAMLEPARA ALTDLTCSLT LNPPAIPYVS
NVTGTWITVE EATDPGYWAR HMCQTVRFAA GAGALLEGEP ALILEVGPGQ ALASFVKQHS
ACSRERMGQI LSALPASHGR QAELSHVLET LGRLWLAGVN IDWAAFSAGE QRRRLSLPTY
PFERQRYWVD ADAHGKSGSS LANDEFLNSA DRIADVGDWF FVPSWKRTSP PSPILGAPLF
ADQHTWLLLV DDSGLGLVLA ERLKQHGQTV VTIAPGAAFA AIDPASYTVR PAEREDYISV
IKKLAREGIE PSRVLHLWLA SPAESAGESP AEVDGILERG FYSLLALTQA LGNQGVERCE
VNVVTTGVHA VVGQEAVNAT KSTVIGPCKI IPQEHPNLTA RSIDVIWAPD RQGCEELVDR
LVVELASTPT GVVIALRGQH RWVQAYEQIH LPEFSHPHAR LRDQGVYVIT GGLGGIGLAL
AEYLVASVRA KVVLIGRTAL PSRERWDDII AAEGTESGTG HRVHCIRQLE ASGAEVLVLQ
ADVADAGQIA AAIDQAVARF GAINGVFHAA GVPGVGLMQL KTAEAAASVL APKVQGTLAI
AQAVRSLPLD FLVLFSSVAA AAGGGQGQAD YCAANIFLDS YAQLHHRDNG VTISIGWGEW
HWDAWSEGLQ GFTEEVRAFF IASRRTFGID FADGMEALRR ILAYDLPQIF VSPRDLTFLV
EASQRSFAAF LKMREDREQS RYPRPALAVA YAAPRNDLET RIATIWSDVL GIDPIGIDDN
FFDLGGNSLL GLDLFGRMRK ALKLDNFPAY VLYESPTVEL QAAHITNLQQ PAIAHDDGDE
HEDEQRRMQL NYFVDLDEMG DL