Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2416 |
Symbol | |
ID | 5734297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3095212 |
End bp | 3099120 |
Gene Length | 3909 bp |
Protein Length | 1302 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279557 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_001545184 |
Protein GI | 159898937 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTATG ATGAAACCGA TGGCAATTTT GATATTGCTG TGGTAGGTTT GGCGGGGCGC TGGGCTGGCG CTCCCGATAT TGATCACTTT TGGCAGCAGC TTTGTGCTGG AGCCGAGGGC ATTAGCTTTT TTGATGACGA GCAATTGCTA GCCGCAGGCG TAACGCCTGA GCAACTAGCC CATCCACGCT ATGTCAAAGC TGGCTCGGTG CTCGAAGGGG TCGAGCTATT CGATGCAGCC TTTTTTGGCT ATTCGCCACG CGAGGCCGAA TTGCTCGATC CGCAGCAGCG GATTTTTCTG GAATGTGCTT GGCAAGCCTT GGAACATGCA GGCTACAACG CCTCAACCTA TCCCGGCTTA ATCGGCGTAT TTGCTGGTTC AAGCCTCAAT ACCTACTTGC TGCGTAACCT AGCCAGCCAA CAAGGCCGAG GCACAATTCC CGATCTGTTT CAACTGATTA TGGGCAACGA CAAAGACTTT TTGCCCACTC GAGTCAGCTA TAAGTTGAAT TTGCGCGGGC CAAGTTTTAG CGTGCAAAGT GCCTGCTCAA CCTCATTGGT CGCAACCCAT TTGGCCTGCC AAAGCCTGTT GGGCTATCAA TGCGATATCG CGCTGGCTGG CGGAATTTCG ATCACCGTGC CGCAACATCA AGGCTATCTG GCCCAAGAAG GCGGCATTCT CGCCCCAGAC GGCCATTGCC GCACATTCGA TGCCCAAGCC CAAGGCACAC TCAACGGCAA CGGTGCAGGT ATCGTGGTTT TGAAACGGCT CGACGATGCC TTGGCTGATG GCGATTCGAT CTATGCGGTA ATCAAAGGCT CAGCCGTCAA CAACGATGGT GCGCTCAAAA TTGGCTACAC GGCACCGAGC ATTGATGGGC AGGCCGCTGT GATTCAGGCC GCTCAAGCGG TGGCTGATGT TGATCCAGTA ACGATTTCCT ATATCGAAGC CCATGGAACG GCCACTGCCC TCGGCGATCC AATCGAAGTT GCCGCTTTAA CCAAGGCCTT CCGTCAACAA ACTGATAAAA CGCAGTTTTG TATGCTTGGC TCGGTCAAAT CCAATTTTGG GCATCTTGAT ACTGCGGCGG GCGTAACCAG CTTGATTAAA ACCGTGCTGG CCCTGCATCA CAATAGAATT CCCGCCAGCT TGCATTTTCA AACCCCCAAT CCGCAACTTG AGCTAGAAAG CTCACCATTT TATGTCAACA CTAAATTAAG TGATTGGCCA AGCGATCAGC CAATTCGGCG GGCGGGGGTC AGTTCATTTG GGATTGGCGG CACGAATGCG CATATTGTGC TCGAAGAAGC GCCCATGCTC GAACCAACCG AGGAAACTGA CGCTTGGCAC ATGCTGGTGA TTTCGGGGCG TAATCGGGCT ACCTTGAATG CGGCCACCAA AAATCTGGCC GAGCACCTTG AGGCTAATCC CCAATTGGCC TTGGCTGATG TTGCCTACAC CTTACAAGTT GGCCGCCAAG CCTTTAATCA TCGGCGGGTG TTGCTGTGCC GTTCGCTGGA TGAAGCCAGC CAAATTTTGC GCCAACGCGA TAAGCGCATG ATCAGCGGCC AAGTCAGCGC GGCTACGCCT GCGGTGGTGT TTATGTTTCC GGGCGGCGGC GTACAATATC CCAGCATGGG CCAAGAACTC TATCAAACTC AGCCGATCTT CCGTGCGGCG GTTGAACGCT GTTTGGCCAT GCTTAAACCT GAAATCAGCG CCAATCTACG CCAGTTGGTC TATAGCGAAA ACGCCAAAGT TCATGAGCAA CAATTGGCGA CAATGCTCTA CAGTTTGTTG GCAATTTTCA TCAGTGAGTA TGCGCTTGCT CAATTATGGC TGGCATGGGG CATTCAGCCT GTCGCCTTGA TTGGCCATAG TTTGGGTGAA TATACCGCGG CCCATTTGGC TGGCTCGATC TCACTTGAGG CGGCTTTGCG CTTAGTTGAG CTACGTGGCC GCTTGATGGA TCAGCTTGAA GATGGCGCGA TGATCAACAT TGCCCTTGCT GAAGCTGAGG TTTTGCCGTT GCTCGGCGAA CAACTTTCGC TAGCGGCAGT CAATGGGCCT GAACATTCGG TCGTGGCTGG CTCGATTGCA GCAATTCAAG CCCTTGAGTT TGAGCTAGAA CAGCGGGCGA TCAAATATCG ACGCTTGCCA ATTCGCGTGG CAGCCCATTC CAGTTTGCTC AAGCCAATTG TGGCCGAATT TGCGACATTT GCCCAGACGA TCAGCATGCA ACCAGCCACA ATTCCCTATA TTTCCAATGT AACTGGCGGT TGGATGAGCC ATGAACAATG GTCAAACCCG AACTATTGGA CTGAGCATTT ACAATCGACC GTGCGATTTA GCGCTGGGAT GAGCCAACTT TTGCAAAATC CAGCTCATCT ATTTCTCGAA GTTGGGCCTG GCCAAACGCT CACAACCTTA ACTCGCGCTC AAGCCAACTT TGGAGCCGAA CGGGTGGTAG CGCAGTCGAT GCGCCATCCC CAAGATCAAC AAACTGACAC CCAATGTTTG TTGACTGCGG TCGGGCGGTT ATGGCTGGCT GGCGTGGCGA TCGATTGGGC CAAACTAAGC GCCAGCAAGC GTCGCCGCGT GGCCCTACCA ACTTATCCCT TCGAACGCAA ACGCTACTGG GTTGAGCAAT TTATCAATAA TGAAACCCAA GGCCCGACCT TGCTCGAAGC GGCGCAAGGT GGTTATAGCT TGCCCGAAAG CAGCGAGCCA GCCGAGGCCA GCCCTGGCTA CGAACGGCCT AATCTCACGA CTGAATATGT CGCGCCCAGC AATAACCTTG AGCATATGAT CACCGCGCTG TGGGGTGCTG TCTTGGGCGT ACCGCTGATT GGTATTCACG ACAATTTCTT TGAGTTGGGC GGCGATTCGC TGCTAGCCTT ACAAGTTGCG ACCCATCTCA CCGAAAAAGT GCATACCACA ATCGGCGTGC GCAGTTTATT CGAAGCCCCA ACGATCGCCG AGCTGGCCCA GCTTGTGCAA GCCCAAACCG CCGAACAAGC AGGCGAATTA TCGTCGTTGG TACGCTTGCA ACCACAAGGC CAAGCAGCGC CGTTCTTTTG TATTCACCCC ATGAGCGGTA TGGCCAATGT CTATGCAGCT TTGGCCCAAT TGCTTGGCAC GCAACGCCCA TTCTACGGGG TGCAGGCCTT TGGCTTGGAA TACCCTGAAA TGCCACTTGA CGATATTACA GTTATGGCGC AACGCTATCT CAGCGATATC CGTCAGGTTC AGCCGCAAGG GCCATATCTG CTTGGCGGCT GGTCGATGGG TGGCTCGATT GCCTTTGAAA TTGCCAGCCA ATTAGTGGCT CAAGGCGAAA CGGTGAGCTT GCTGGCCTTG ATCGACACGC CCGCCAAACT AACAGGCACA TCACCAGCAA CACTAACCGA TTTGGAGCTG TTGATCGCCA TGCTTGGGGT CGATGCCAAC ATTTTAGGGA TCGACGATCC CAAGGCTGAG GCCAGCGAAG CAGTTTGGAA CGAATTACTG AGCATTATCA AGCAACATCT TGGCTTGCCC GAAAGCTACA CCTTACAGCG TTTGCGCCAC CTGGTCAGCA CCTTCCGCAC CCACTCGCAA GCAGTTTGGA ATTATCAGCC AGCTAACTAT CCCAACGATG TGCTAATTTT ACGGGCCGCC GATTTGGCAG GCGAGCACGA CGACCAACTG AACGAGGCAT ATCGCATAGC CGATTTAGGC TGGAGCCAGT TTGTCACAGG CCAAATTCAA GTCCAAACCA TTCCAGGCAC GCACAACACC CTGTTGAACG AACCGTCGTT GCCAATCTTG GCTAGTCATC TCGCGGTAGC GTTACACAAC GTTGCCGCAA ATACATCTTT GACACCATCA CACCATGAAG GATCAAATGA AATTGCCATT ATGGATTGA
|
Protein sequence | MSYDETDGNF DIAVVGLAGR WAGAPDIDHF WQQLCAGAEG ISFFDDEQLL AAGVTPEQLA HPRYVKAGSV LEGVELFDAA FFGYSPREAE LLDPQQRIFL ECAWQALEHA GYNASTYPGL IGVFAGSSLN TYLLRNLASQ QGRGTIPDLF QLIMGNDKDF LPTRVSYKLN LRGPSFSVQS ACSTSLVATH LACQSLLGYQ CDIALAGGIS ITVPQHQGYL AQEGGILAPD GHCRTFDAQA QGTLNGNGAG IVVLKRLDDA LADGDSIYAV IKGSAVNNDG ALKIGYTAPS IDGQAAVIQA AQAVADVDPV TISYIEAHGT ATALGDPIEV AALTKAFRQQ TDKTQFCMLG SVKSNFGHLD TAAGVTSLIK TVLALHHNRI PASLHFQTPN PQLELESSPF YVNTKLSDWP SDQPIRRAGV SSFGIGGTNA HIVLEEAPML EPTEETDAWH MLVISGRNRA TLNAATKNLA EHLEANPQLA LADVAYTLQV GRQAFNHRRV LLCRSLDEAS QILRQRDKRM ISGQVSAATP AVVFMFPGGG VQYPSMGQEL YQTQPIFRAA VERCLAMLKP EISANLRQLV YSENAKVHEQ QLATMLYSLL AIFISEYALA QLWLAWGIQP VALIGHSLGE YTAAHLAGSI SLEAALRLVE LRGRLMDQLE DGAMINIALA EAEVLPLLGE QLSLAAVNGP EHSVVAGSIA AIQALEFELE QRAIKYRRLP IRVAAHSSLL KPIVAEFATF AQTISMQPAT IPYISNVTGG WMSHEQWSNP NYWTEHLQST VRFSAGMSQL LQNPAHLFLE VGPGQTLTTL TRAQANFGAE RVVAQSMRHP QDQQTDTQCL LTAVGRLWLA GVAIDWAKLS ASKRRRVALP TYPFERKRYW VEQFINNETQ GPTLLEAAQG GYSLPESSEP AEASPGYERP NLTTEYVAPS NNLEHMITAL WGAVLGVPLI GIHDNFFELG GDSLLALQVA THLTEKVHTT IGVRSLFEAP TIAELAQLVQ AQTAEQAGEL SSLVRLQPQG QAAPFFCIHP MSGMANVYAA LAQLLGTQRP FYGVQAFGLE YPEMPLDDIT VMAQRYLSDI RQVQPQGPYL LGGWSMGGSI AFEIASQLVA QGETVSLLAL IDTPAKLTGT SPATLTDLEL LIAMLGVDAN ILGIDDPKAE ASEAVWNELL SIIKQHLGLP ESYTLQRLRH LVSTFRTHSQ AVWNYQPANY PNDVLILRAA DLAGEHDDQL NEAYRIADLG WSQFVTGQIQ VQTIPGTHNT LLNEPSLPIL ASHLAVALHN VAANTSLTPS HHEGSNEIAI MD
|
| |