Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3962 |
Symbol | |
ID | 5735823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4990434 |
End bp | 4995182 |
Gene Length | 4749 bp |
Protein Length | 1582 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641281112 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_001546722 |
Protein GI | 159900475 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGATC GCGAATCACA GGATTTCGAA TTCGATACCG CTGTTGCCAT CATCGGCATG GCAGGGCGTT TCCCTGGCGC CAACACGCTG GATCAGTTCT GGCACAACAT GACCCAGGGC GTGCAATCGA TCCGGTTTTT TTCCGATGAA GAACTGCTGG CCGCCGGGGT GGACCCGGAT CTGATGAGCC AACCCGAGTA TGTGAAAGCC GGGACGGTCA TCGACAATAT CGATTCCTTT GACTCCGCGT TCTTTGGCTT TACGCCGCGC GAGGCCGAGT TGATGGACCC GCAGTTGCGC TTGTTTCTAG AATGCTCTTG GGAGGCGTTC GAGGACGCCG CTTATAGCCC GGAAACCTAC CAAGGTCTGG TTGGGGTGTT CGCCGGATCG GCCATCTCGA CCTATATGTT GAATAATATC TTTAACAACG CCGAGGTGTT CCGCAAAGCG GGCATGCTCC AGGTCGGCGT CCTGAACTCG TCGGACTCGC TTTCGACCTG GGTCTCCTAC AAGCTTAACT TCCGCGGGCC GAGCGTGGTC GTACAAACTT TTTGCTCGAC CTCGCTGGTG GCAGTCCACA TGGCCTGCCA GAGCCTGCTC AACTACGAGT GCGATATGGC GCTGGCTGGC GGTGTTGCGA TCTCGGTACC CCATGGAACC GGCTATGTGT ACCAGGAGGG CGGCATTGTT TCGCCCGACG GCCAGTGCCG CACCTTCGAC GCCGACGGCC AGGGCAGCGT GATGAGCAAC GGCGCCGGCG TCGTCGCGCT CAAGCGCCTC GATCAGGCGG TCGCAGATGG AGACCACGTG TACGCCGTCA TCCGCGGCTC GGCGGTCAAC AACGACGGAA TCCGCAAAGT CGGCTACACC GCCCCTGGCC TAGAAGGGCA GTCGTCGGTG ATCGCCGAAG CGCTGGCCCA CGCTGGCGTC GACCCTGCGA CGGTGGGCTA CCTTGAGGCC CACGGCACCG CCACTGCGCT GGGCGACTCG ATCGAGCTCG CGGCGACGAT CAAGGCCTAC AAGCAGCAGA CCGACCAGAC CCAGTACTGT GCGCTCGGTT CGGTCAAGCC CAACGTTGGT CATCTCGACC GCGCCGCAGG CGTGACCGGC CTGATCAAAA CCGTTCTGGC GCTGAAGCAC CGAGAGATTC CACCGAGTCT TAATTTCGAG CAGGCCAGCC CCGAGATTGA TTTGCCCAAT AGCCCGTTTT TTGTCAATAC GACGTTGCGG CCCTGGGAGA CCGACGGTCG GACGCCGCGC CGCGCCGGCG TCAGCTCGTT CGGCCTGGGC GGCACCAATG CCCACGTCGT GCTCCAGGAA ACGCCGCTCG AGGCTCCGTC CGGTCGATCA TACCCGCAGC AGCTGTTGCT GCTGTCGGCC AAAACCGACT CCGCCCTGCA AACAATGGCC GCCAACTTGG CCAGCTTCTT GCGTGCCCAT CCCGAGGTGG ACTTGGCCGA CGTTGCCCAT ACCCTCCAGG TTGGGCGCAC CGCCTTCAAC CACCGGCGCG CCCTGGTGGC CCGCGACCGT GACGATGCCA TCGCGCAGTT GGAGGCGGCT GGCGCCCGCG GGCTGACCGC CAACCAGACC GACCGTGATC GGCCGGTGGC CTTCCTGTTC CCGGGCGTCG GCGACCACTA TGCAGGGATG GCCGCGGACC TGTACACTCA CGAAGCGCGC TTTCGCGCGG TGGTTGACGA GTGCTGCACG CTGCTGAATC CGCTGCTGGA TCAGGATTTG CTGGCGGTGC TGTATCCAGA GTCCGGGCGC GGCAACGGGG CGCCGGCGGC CGGCCTGGAT TTCCGCCAGT TGCTGGCAGG CCTGCCGGCC GGAACGCCTG CCGGGACGCT GCACCAGACG GAGCTGGCCC AGCCGGCGGT GTTTGTAGTC GAGTACGCGC TGGTGCAATT GCTGGCGAGC TGGGGCATCC GGCCGCAGGC GCTGCTGGGC TACAGCCTGG GCGAATACGT CGCGGCAACG GTCGCCGGCG TGCTGAGCCT GGAGGACGCG CTGCGTCTGG TGGCCCTCCG CGCCAGGCTT ATCCAGAATT TGCCTGCCGG CGCCATGCTG GCGGTCAGCC TAGGGGAGGA CGATGCGCGG CGCTACGTCC GAGGCGATGT CGCCTTGGCA GCGGTTAACA GCCCGAGCGC CTCCATCCTG GCCGGCCCGG CGGCGGCTCT TGAGGCCGTG GCCAGGCAAT GCGCCGCCGA TGAAGTTGCC TGCCGCTGGC TGGAGACGAG CCATGCCTTC CATTCGGCGA TGCTGGAGCC GGCCCGCGCG GCCCTGACCG ATCTGACCTG CTCGCTGACC CTCAACCCGC CGGCCATACC GTATGTCTCC AACGTCACCG GCACCTGGAT TACCGTCGAA GAGGCCACCG ACCCGGGCTA CTGGGCGCGG CACATGTGTC AGACCGTGCG GTTTGCCGCC GGAGCTGGCG CGCTGCTGGA GGGGGAGCCA GCGTTGATCC TGGAAGTCGG CCCTGGTCAG GCGCTGGCGT CGTTTGTCAA GCAGCATTCA GCCTGCTCCC GCGAGCGTAT GGGCCAGATC CTGAGCGCGC TGCCGGCGTC CCACGGTCGC CAGGCCGAGT TGTCCCACGT GCTGGAAACC CTGGGCCGGC TGTGGCTTGC CGGGGTGAAC ATTGACTGGG CCGCGTTCTC CGCGGGCGAA CAACGGCGGC GGCTTTCGCT GCCGACCTAT CCGTTTGAGC GCCAGCGCTA CTGGGTGGAC GCCGATGCGC ACGGTAAGTC GGGCAGTTCG CTAGCCAACG ACGAGTTCCT CAACAGTGCC GACCGCATCG CCGACGTTGG CGACTGGTTC TTTGTGCCTT CCTGGAAACG GACCAGCCCG CCGTCGCCGA TCCTAGGCGC CCCACTTTTC GCCGACCAAC ACACCTGGCT GCTCCTGGTT GATGACTCCG GCCTGGGCCT TGTGCTCGCC GAGCGCTTGA AACAGCACGG CCAGACTGTG GTGACGATCG CGCCGGGTGC GGCATTCGCC GCCATCGACC CGGCAAGCTA CACGGTACGT CCCGCCGAAC GCGAAGACTA CATCAGCGTG ATTAAGAAGC TCGCCCGCGA AGGCATTGAA CCCAGCCGCG TGCTACATCT GTGGCTGGCC TCGCCCGCTG AGTCGGCCGG CGAGTCGCCG GCCGAGGTGG ACGGCATATT GGAGCGCGGC TTTTACAGCC TGTTGGCGCT GACCCAGGCG CTGGGCAACC AGGGCGTGGA GCGCTGCGAG GTCAACGTCG TCACCACCGG CGTTCACGCG GTCGTCGGCC AGGAGGCGGT GAACGCAACG AAATCGACAG TGATCGGTCC GTGCAAGATT ATTCCTCAGG AGCACCCCAA CCTGACGGCG CGCTCCATCG ATGTGATCTG GGCACCCGAT CGGCAGGGCT GCGAGGAGTT GGTGGATCGC CTGGTGGTGG AACTGGCCAG CACCCCGACC GGCGTCGTGA TCGCGCTGCG TGGGCAGCAC CGTTGGGTCC AGGCATATGA GCAAATCCAC CTGCCTGAGT TCAGCCATCC CCACGCCCGT TTGCGCGATC AGGGCGTGTA TGTCATAACT GGCGGCTTGG GCGGCATTGG CCTGGCCCTG GCCGAGTACC TGGTGGCGAG TGTGCGGGCC AAAGTGGTCT TGATCGGGCG GACTGCCCTA CCGTCGCGTG AGCGCTGGGA TGACATCATC GCCGCTGAAG GCACCGAGAG CGGCACCGGG CATCGCGTCC ATTGCATCCG GCAGCTTGAG GCCAGCGGCG CCGAAGTGCT GGTGCTCCAG GCGGACGTCG CCGACGCCGG CCAGATTGCT GCGGCCATCG ACCAGGCTGT CGCCCGCTTC GGCGCCATCA ATGGGGTATT CCACGCCGCT GGCGTGCCGG GGGTGGGCCT GATGCAGCTT AAGACCGCCG AAGCGGCAGC TAGCGTGCTC GCGCCAAAGG TCCAGGGAAC CCTGGCGATC GCTCAGGCCG TGCGCTCCTT GCCGCTTGAC TTCTTGGTGC TGTTCTCGTC GGTTGCCGCG GCGGCCGGCG GCGGCCAGGG CCAGGCCGAC TACTGCGCGG CGAACATCTT CCTGGACAGC TACGCCCAGT TGCATCATCG CGACAACGGG GTGACGATCT CGATCGGATG GGGCGAATGG CACTGGGACG CCTGGAGCGA AGGCTTACAA GGCTTCACCG AAGAAGTACG GGCGTTCTTC ATAGCCTCGC GCCGAACCTT CGGCATCGAC TTCGCCGACG GCATGGAAGC ATTGCGCCGC ATCTTGGCTT ACGATCTACC GCAGATCTTT GTCAGCCCGC GTGACCTGAC GTTCCTAGTT GAGGCTAGCC AGCGCTCGTT CGCCGCGTTC CTCAAGATGC GCGAAGACCG CGAGCAGTCG CGCTACCCGC GGCCAGCCCT GGCTGTGGCC TATGCCGCAC CACGCAACGA CCTCGAAACG CGCATCGCCA CCATATGGAG CGATGTGCTG GGCATCGACC CGATCGGGAT CGACGATAAC TTTTTCGACC TCGGCGGCAA TTCGCTGCTG GGCCTCGACC TGTTTGGGCG CATGCGCAAA GCCCTGAAGC TCGACAACTT CCCCGCATAT GTGCTGTATG AGTCGCCCAC GGTTGAGTTG CAAGCGGCCC ACATCACCAA CCTCCAACAG CCGGCCATCG CCCACGACGA CGGCGACGAG CACGAGGACG AGCAGCGCAG GATGCAATTG AACTACTTTG TGGATTTGGA TGAAATGGGG GATCTATGA
|
Protein sequence | MIDRESQDFE FDTAVAIIGM AGRFPGANTL DQFWHNMTQG VQSIRFFSDE ELLAAGVDPD LMSQPEYVKA GTVIDNIDSF DSAFFGFTPR EAELMDPQLR LFLECSWEAF EDAAYSPETY QGLVGVFAGS AISTYMLNNI FNNAEVFRKA GMLQVGVLNS SDSLSTWVSY KLNFRGPSVV VQTFCSTSLV AVHMACQSLL NYECDMALAG GVAISVPHGT GYVYQEGGIV SPDGQCRTFD ADGQGSVMSN GAGVVALKRL DQAVADGDHV YAVIRGSAVN NDGIRKVGYT APGLEGQSSV IAEALAHAGV DPATVGYLEA HGTATALGDS IELAATIKAY KQQTDQTQYC ALGSVKPNVG HLDRAAGVTG LIKTVLALKH REIPPSLNFE QASPEIDLPN SPFFVNTTLR PWETDGRTPR RAGVSSFGLG GTNAHVVLQE TPLEAPSGRS YPQQLLLLSA KTDSALQTMA ANLASFLRAH PEVDLADVAH TLQVGRTAFN HRRALVARDR DDAIAQLEAA GARGLTANQT DRDRPVAFLF PGVGDHYAGM AADLYTHEAR FRAVVDECCT LLNPLLDQDL LAVLYPESGR GNGAPAAGLD FRQLLAGLPA GTPAGTLHQT ELAQPAVFVV EYALVQLLAS WGIRPQALLG YSLGEYVAAT VAGVLSLEDA LRLVALRARL IQNLPAGAML AVSLGEDDAR RYVRGDVALA AVNSPSASIL AGPAAALEAV ARQCAADEVA CRWLETSHAF HSAMLEPARA ALTDLTCSLT LNPPAIPYVS NVTGTWITVE EATDPGYWAR HMCQTVRFAA GAGALLEGEP ALILEVGPGQ ALASFVKQHS ACSRERMGQI LSALPASHGR QAELSHVLET LGRLWLAGVN IDWAAFSAGE QRRRLSLPTY PFERQRYWVD ADAHGKSGSS LANDEFLNSA DRIADVGDWF FVPSWKRTSP PSPILGAPLF ADQHTWLLLV DDSGLGLVLA ERLKQHGQTV VTIAPGAAFA AIDPASYTVR PAEREDYISV IKKLAREGIE PSRVLHLWLA SPAESAGESP AEVDGILERG FYSLLALTQA LGNQGVERCE VNVVTTGVHA VVGQEAVNAT KSTVIGPCKI IPQEHPNLTA RSIDVIWAPD RQGCEELVDR LVVELASTPT GVVIALRGQH RWVQAYEQIH LPEFSHPHAR LRDQGVYVIT GGLGGIGLAL AEYLVASVRA KVVLIGRTAL PSRERWDDII AAEGTESGTG HRVHCIRQLE ASGAEVLVLQ ADVADAGQIA AAIDQAVARF GAINGVFHAA GVPGVGLMQL KTAEAAASVL APKVQGTLAI AQAVRSLPLD FLVLFSSVAA AAGGGQGQAD YCAANIFLDS YAQLHHRDNG VTISIGWGEW HWDAWSEGLQ GFTEEVRAFF IASRRTFGID FADGMEALRR ILAYDLPQIF VSPRDLTFLV EASQRSFAAF LKMREDREQS RYPRPALAVA YAAPRNDLET RIATIWSDVL GIDPIGIDDN FFDLGGNSLL GLDLFGRMRK ALKLDNFPAY VLYESPTVEL QAAHITNLQQ PAIAHDDGDE HEDEQRRMQL NYFVDLDEMG DL
|
| |