Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2412 |
Symbol | |
ID | 5734293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3075562 |
End bp | 3081381 |
Gene Length | 5820 bp |
Protein Length | 1939 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279553 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_001545180 |
Protein GI | 159898933 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAACC TACCATCTGG AACGGATAGC ATCAAGCCTG AAACAATCGA AACAACCGCA GAAATATTAA CTGAGTGGTT AGTAAATCAC TTTTGTACCT ATTTAAACGT TTCCTCGAAT GAAATTGATG TGCGCTTGCC TTTTGCGCAC TATCAAATCG ATTCAATCCA AGCGCTTAGT TTAATCAGTA AGCTTGAGCA GTTTCTGGGG CGTTCGCTCT CACCAACATT GCTTTGGGAT TATCCAAGCA TTGCAACGTT GGTTCCAGCG CTGATCGGTA CTGAACAGCC TGAAGTCGCA ACCGTGGCTG ATTCCGACCA AAATGCTGAT GCAATTGCCA TTATCGGTAT GAGTTGTCGC TTTCCTGGTG CTCGCAATTT GGCCGAATAC TGGAATTTAC TGATTAATGG TCGCGATGCA ATTACCGAAA TTCCTGCCGA GCGCTGGAAT TTAGAGGCCG TGTATAACCC CGATCGCAGT GTGCCCGGCA CGATGTACAC CCGTTGGGGT GGCTTTGTTG ATCAGCCTGA TTATTTCGAT GCAGGCTTTT TTGGCATCTC GCCCCGCGAA GCCATTCACC TTGATCCGCA ACAGCGCATG TTGTTGGAAT GCACATGGGA AGCCTTCGAA GATGCTGGTC AATCGCCGCA AGCATTGGCT GGCAGCAAAA CTGGGGTCTT TATTGCTTCA GTCAGCGAAG ATTATGGCCG AATTTTATTC AGCCACCCCG AAATTATCGA TGCCTACACT GGCCCAGGTA CAGCCCACAG CATTTTGGCC AATCGGCTTT CGTATGTGCT GAATTTGCAA GGCCCAAGTA TCAACATCAA CACAGCTTGC TCGGGATCAC TCGTCGCAAT TCATATGGCC TGCCAAGCAT TGCAAACGGG CGAAGCCGAT CTGGTGGTTG CTGGCGGGGT CAATGCCAGC TTGTTGCCTG ATGGCAATCT ATTTTTCTCC AAAGCTGGAG CTTTATCGCC CGATGGCCGC TGCAAAACCT TCGATGCTCG CGCCAATGGA ATTGTGCGTA GCGACGGCGC AGGCATTGTA ATTCTCAAGC CTTTGCAAAA AGCCTTGGCC GATGGCAACC CAATTTATGC AGTCATTCGT GGCAGCGCCG TCAATAGCGA TGGCCGTACC AACGGGATTA TGGCACCCAA CCGTCAATCG CAAGAGGTTG TGCTGCAAGA AGCCTATCGT CGTGCTGGAG TTAACCCCGC AGCAGTGCAA TACATTGAAG CTCATGGCAC GGGCACAAGC CTTGGCGATG TGATTGAGGC CCAAGCGCTT GGCGCAGTTT TGGCAGTCGG ACGCTCAGCC CAACAACCCT GTGCGATTGG CTCGGTCAAA ACCAATATCG GACACAGTGA ATCTGCGGCG GGGATTGCCG GGGTGATTAA GGTCGCCCTG GCAATGAAGC ATCAGCTCTT GCCTGCAAGT TTACACTTTG AAACCCCCAA TCCCATGATT CCCTTTGAAG AACTCCGCTT GCAGGTTCAA GCCCAACGTG GGCCATGGCC CAATGCTAGT GGCTCGCTCT TGGCAGGGGT TAGTGGCTTT GGTTTTGGGG GTACAAATGC CCACGTGGTG CTCGAATCGG CTCCCATTCG CGAAACCCAA AGCCCAAGCA GCAACGATGC TTGGCCATTG TTATTGCCAT TATCAGCCCA AAGCGAGCCA GCTTTACGGC AACTTGCAGC CCGTTATGCC AGCCAGATTG CCGCCGCCAG CCCCAGCGAA GTTGCCAATA TTTGCTATAG CGCTAGCGTT GGGCGCAGCC AGCTTGATCA TCGCATTGCC GTATATGCAG CCAGCCCAGC TTTGCTAAGC GAACAGCTCA ACGATTTCGC CGAAGGCCGC TCCGCCACTG GTTTGATCAC CCAAGATCGT TCACAAGCTC ACAAATTGGT TTGGGTATTT TCGGGGCAAG GATCGCATTG GGTTGGCATG GGTCGCGGTC TGCTGGCCCA ACAACCAATC TTCCGCCAAA CCCTCGAAGC CTGCGATCAA GCTTTTCGTA ACTACGCTAG TTGGTCGTTG ATCGAGGCCT TGCGCGATGA TCAAGCTGCT GAGCAGATCA ACCAAACTGA TCGTGCTCAA CCGCTGATTT TTGCCTTGCA AGTCAGTCTC GCAGCGCTTT GGCGATCATG GGGCATTACC CCAGCGGCGA TTGTTGGGCA TAGTTTGGGC GAGGTCGCTG CGGCCTATGT TAGCGGCGTG CTGACGCTCG ATGAAGCGGT GCAAGTGGTT TATCATCGCA GCCGTTTGAT GAAGCAGGTT GCGGGCAAAG GCAAAACTGC CGCCGTCGAA TTGACCTTCG AACAAGCCCG CTTGCTGTTG GTTGGCCGCG AACAACAGGT AGCAATTGCT GGCATCAATA GCCCAACCTC GTGTATTTTG GCGGGCGATC CGAGCACGTT AGAGCAATTA GTTGCCTCGT TGCAGCATAA CGATGTGTTT GCTCGCTTGG TGCGCGGGGT TGATATTGCC TTTCATAGCC CACAGATGAA GCCACTAGTT CCTGAATTAA ACGCCGCTTT GGCCCAGCTC AAACCACAAG CACCTATAAT TCCCTTGGTT TCAACCGTGA CTGGCACATT TGCCGAGCAA GCACTCTACA GCGAGGGCTA TTGGGGTCGT AATCTGCGTG AACCATTCTT GTTTGCCAAC GCGATCAAAA ATTTGCTCGA CAAGGGCTAC GACACATTTT TGGAAGTCAG CCCACACCCG GTTTTGGGCG AATCGATGTT GCGCAGCATC CAACATTTCA AGCAGTCAGC CCAAGTGTTT AGCTCGTTAC GCCGCGATCA AGCTGAATTG GATTTACTGT TTGAGACGCT TGGGCGCTTA TTTGTGGCTG GTTATAGCCC CGATTGGCAG CAAGTTTATC CCGAGCCACG CCAGCGCAGC ACCTTGCCCA GCTATCCTTG GCAACGTGAA CGCTACTGGT TCGATCAGCT TTTGCCAGCA AGCAGCAATC AAACGTCGGC CCGTGGTGGG TTTGTGCCAG CGCTCTTAGC GGGCAACCTT CAGGCTACCA CCAGCGCTCA ACACCCGTTA CTGGGTCTTT CGATCGCTTC AGCAGTCAAT CAAAGTCAGT TTTGGCAAAC CAATTTAGCC GCCAACTATC CGGCTTATTT GGCCGACCAT GTGGTGCAAG AACAGGTGTT GTTGCCTGGC GCGGCCTATG TTGAGATGAT CGTTTCCGCC CTGCGGAGTC GAGGGCAACA CCATGTCACA ATCAATAACT TGGTGTTCAA ACAGCCATTG ATCTTGCCAA ACCAAGGCCA ACGCACCGTC CAACTGGTGT GCAACGCTGA TGATAATGGC GTTAATCTCC AAATTTTGAG CCAAGCCGCC GAGCCAGATA GCCCTTGGGA ATTGCATGCA ACCGCCAATG CGCTTGATAA TGCCACACCT GTAAATCATT CAGCCTATCT TGCCCTCGAC GAATTGCAAG CCCGCTGTAC CGAAAGCATC GCAGTGAGTG AACATTATGC TCGCATGCAA GCAGTACAAT TGGTGTATGG CCCAGCGTTT CAATCGCTCA GCCAAATTTG GCGCGGCGAG GCTGAGGCCT TGGCCCAATT GCAATTAGCA CCAGCAATCG GCCAACTTGC CCAGCACGAT CAGCTGCATC CGGCCTTGCT TGATGCCACT TTCCAGCTCG TTGCGGTGAT GCTTGCCCAA CACACCAACG ATCAAACCTA CTTGCCAATT GCGCTTGAGC GCTTGAATGT GCTTGATCGA ATTCCCGCCG AAGCTTGGTG TCATGCGGCG TTGCGCTCGG CTCCTGCTGA TGAAACCCTG ATGTATGAAG CAGATTTGGT GATTGCTGAT GCCCAAGGGC GAGTTGTGGT AGAAATCGCT GGCTTGAAGT TATTCCAAGT GGCTGCCGCG CGGAATGCCA ACCAACCCCA ACAAGCGTTG TATGACTATC GCTGGCAACC GATCGAAATC CAAACTATTG AGCATCCCGC TGAACGTTGG CTAATTTTGG CCAATACCCA CGATCAATTC GCCAAACAGT TGAGCAACAG CTTGGCAGCC CATGGCCAGC AGGTCGATTG CCTCGAACAA TCAATCGAAG CCAATGGCTT AGGCGATTGG CTCAAAACCC AGTTACAAGC CAATTATCAA CAAATCGTTT GTTTATGGCC ACTGGCCGCT AGCAATGATC ACGCGCCAGT CACAAGCGCA ACCCAGCAAA GCTTGGCAAT GTTAACCCTG CTGCAAACAC TCAGCGATTC AGGCAGCGCC ACGCCACGCT TGTGGTGTGT CACGCGCGGA GCACAGGCCG TGCTCGATCA CGAAGTTATT AATCTGGCAC AAGCACCGCT TTGGGGAATG GTGCGCAGCG CCGCCTTAGA ACACCCCGAA CTAGCGCCAA GCTTGATCGA CCTTGCACCA ATGGCTGAAA GCAACGAAGC AAGCCAACTG GCCAAGACAC TGTTGCAACG CGCCAACGAG CATCAACAGG CCTTGCGCAA TCAACAGCAG TTGGTAGCTC GCTTGCAACA ACGCCCAATC ACCAAGTCAA CCACGCTCAA GTTGAGCAAT CAAGCGGCCT ATCTGATTAC TGGTGGTAGC GGTGGTTTGG GCTTGGAGAT TGCCCATTGG ATGCTGGCCA AGGGTGCGAG CAACTTGATT ATTCTTGGTC GCCGACCTTT GCAACCAAGC CATAACGCTG TATCCGAGCA GCAATCGCAA CTTGTGAATG CACTCAGCCA ACTCGAACAA GCTGGGGCGA ACCTGCGGTA TGCCGCAATC AACGTAGCCG ATCAGGCGGC CTTGGCCGAA TTTTTGCAGC AATATCGCGC CGAAACTGGC CTTGCCATTC GTGGGATTGT GCATGCTGCG GGTGTGCTTG ATGATCAAAT GCTCTATCGC ATGCAGCCAA GTGCCCTGAC CAGCGTTTTT GCGCCCAAAG TTGCTGGCGC ATGGGCCTTG CACGAAGTTT TCAGCCAAGA ACCACTTGAT TTTATGATCT TCTGTTCATC GCTGGCGGCC AGCATTGGCT CGGTTGGCCA AGCGCATTAT GCCGCCGCCA ACAGCTTTAT GGATAGCCTG GCAGCCTATC GTCGCAGCCA AGGCTTGGTT GGGCTAAGCA TTAATTGGGG GCCATGGGCC GAGGTGGGAA TGGCCGCCAA ACTTAATCCA CAACTTTTTG AAGCCCATGG CGTGCAACTA CTGCAACCCC AACAAGCCTT AGTTGCAATG GAGCAGCTGA TCAACGATCA AGCAATTCAG ACGACGATTG CTGAGATCGA TTGGGCAACG TGGCTCAAAA ATAATCAGGT TGTCGCAACC CTACCCTTCT TTGCGGCACT TGCGCCGTCA ACCACCATTG CTCAAACAGC CAGCAATTTA ACCCAAGAGC ATGAATTTCG CCAACGGGTT TTACAGACCC AGCCCAGCGA ACGTCAAGCG TTGATAACTC AACAACTCAA ACAATTGATT GCCAAGGTTA TGCAGCTTGA TCCATCAAAA CTGGACAGCC AACTAGCACT CCATACCCTC GGCCTCGACT CGATTATGGC AATTGAACTC AAAACCAGTA TTAGCCAAAA TCTTGGCGTA ACCTTGTCGG TTGCCTATCT GATTCAGGGT CCCAGCATTG ATGAAATTGT TGCCAATGTC AATCAACAAC TATCCCTAGA ACTATCATCG GAGATGTTTG CATCGCCTGA AACCCGCGAT GATGCGCTTC AGGTGCTACT TGAACAAGTA CAGCAAAGCG ATCACGACCA GATCGCTCAA ATACTTGCTG AATTAGAACA ACTCTCCACT GATGAGGCTA AATCTCGTCT GGTTGGGTGA
|
Protein sequence | MNNLPSGTDS IKPETIETTA EILTEWLVNH FCTYLNVSSN EIDVRLPFAH YQIDSIQALS LISKLEQFLG RSLSPTLLWD YPSIATLVPA LIGTEQPEVA TVADSDQNAD AIAIIGMSCR FPGARNLAEY WNLLINGRDA ITEIPAERWN LEAVYNPDRS VPGTMYTRWG GFVDQPDYFD AGFFGISPRE AIHLDPQQRM LLECTWEAFE DAGQSPQALA GSKTGVFIAS VSEDYGRILF SHPEIIDAYT GPGTAHSILA NRLSYVLNLQ GPSININTAC SGSLVAIHMA CQALQTGEAD LVVAGGVNAS LLPDGNLFFS KAGALSPDGR CKTFDARANG IVRSDGAGIV ILKPLQKALA DGNPIYAVIR GSAVNSDGRT NGIMAPNRQS QEVVLQEAYR RAGVNPAAVQ YIEAHGTGTS LGDVIEAQAL GAVLAVGRSA QQPCAIGSVK TNIGHSESAA GIAGVIKVAL AMKHQLLPAS LHFETPNPMI PFEELRLQVQ AQRGPWPNAS GSLLAGVSGF GFGGTNAHVV LESAPIRETQ SPSSNDAWPL LLPLSAQSEP ALRQLAARYA SQIAAASPSE VANICYSASV GRSQLDHRIA VYAASPALLS EQLNDFAEGR SATGLITQDR SQAHKLVWVF SGQGSHWVGM GRGLLAQQPI FRQTLEACDQ AFRNYASWSL IEALRDDQAA EQINQTDRAQ PLIFALQVSL AALWRSWGIT PAAIVGHSLG EVAAAYVSGV LTLDEAVQVV YHRSRLMKQV AGKGKTAAVE LTFEQARLLL VGREQQVAIA GINSPTSCIL AGDPSTLEQL VASLQHNDVF ARLVRGVDIA FHSPQMKPLV PELNAALAQL KPQAPIIPLV STVTGTFAEQ ALYSEGYWGR NLREPFLFAN AIKNLLDKGY DTFLEVSPHP VLGESMLRSI QHFKQSAQVF SSLRRDQAEL DLLFETLGRL FVAGYSPDWQ QVYPEPRQRS TLPSYPWQRE RYWFDQLLPA SSNQTSARGG FVPALLAGNL QATTSAQHPL LGLSIASAVN QSQFWQTNLA ANYPAYLADH VVQEQVLLPG AAYVEMIVSA LRSRGQHHVT INNLVFKQPL ILPNQGQRTV QLVCNADDNG VNLQILSQAA EPDSPWELHA TANALDNATP VNHSAYLALD ELQARCTESI AVSEHYARMQ AVQLVYGPAF QSLSQIWRGE AEALAQLQLA PAIGQLAQHD QLHPALLDAT FQLVAVMLAQ HTNDQTYLPI ALERLNVLDR IPAEAWCHAA LRSAPADETL MYEADLVIAD AQGRVVVEIA GLKLFQVAAA RNANQPQQAL YDYRWQPIEI QTIEHPAERW LILANTHDQF AKQLSNSLAA HGQQVDCLEQ SIEANGLGDW LKTQLQANYQ QIVCLWPLAA SNDHAPVTSA TQQSLAMLTL LQTLSDSGSA TPRLWCVTRG AQAVLDHEVI NLAQAPLWGM VRSAALEHPE LAPSLIDLAP MAESNEASQL AKTLLQRANE HQQALRNQQQ LVARLQQRPI TKSTTLKLSN QAAYLITGGS GGLGLEIAHW MLAKGASNLI ILGRRPLQPS HNAVSEQQSQ LVNALSQLEQ AGANLRYAAI NVADQAALAE FLQQYRAETG LAIRGIVHAA GVLDDQMLYR MQPSALTSVF APKVAGAWAL HEVFSQEPLD FMIFCSSLAA SIGSVGQAHY AAANSFMDSL AAYRRSQGLV GLSINWGPWA EVGMAAKLNP QLFEAHGVQL LQPQQALVAM EQLINDQAIQ TTIAEIDWAT WLKNNQVVAT LPFFAALAPS TTIAQTASNL TQEHEFRQRV LQTQPSERQA LITQQLKQLI AKVMQLDPSK LDSQLALHTL GLDSIMAIEL KTSISQNLGV TLSVAYLIQG PSIDEIVANV NQQLSLELSS EMFASPETRD DALQVLLEQV QQSDHDQIAQ ILAELEQLST DEAKSRLVG
|
| |