Gene Haur_2416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2416 
Symbol 
ID5734297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3095212 
End bp3099120 
Gene Length3909 bp 
Protein Length1302 aa 
Translation table11 
GC content53% 
IMG OID641279557 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001545184 
Protein GI159898937 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTATG ATGAAACCGA TGGCAATTTT GATATTGCTG TGGTAGGTTT GGCGGGGCGC 
TGGGCTGGCG CTCCCGATAT TGATCACTTT TGGCAGCAGC TTTGTGCTGG AGCCGAGGGC
ATTAGCTTTT TTGATGACGA GCAATTGCTA GCCGCAGGCG TAACGCCTGA GCAACTAGCC
CATCCACGCT ATGTCAAAGC TGGCTCGGTG CTCGAAGGGG TCGAGCTATT CGATGCAGCC
TTTTTTGGCT ATTCGCCACG CGAGGCCGAA TTGCTCGATC CGCAGCAGCG GATTTTTCTG
GAATGTGCTT GGCAAGCCTT GGAACATGCA GGCTACAACG CCTCAACCTA TCCCGGCTTA
ATCGGCGTAT TTGCTGGTTC AAGCCTCAAT ACCTACTTGC TGCGTAACCT AGCCAGCCAA
CAAGGCCGAG GCACAATTCC CGATCTGTTT CAACTGATTA TGGGCAACGA CAAAGACTTT
TTGCCCACTC GAGTCAGCTA TAAGTTGAAT TTGCGCGGGC CAAGTTTTAG CGTGCAAAGT
GCCTGCTCAA CCTCATTGGT CGCAACCCAT TTGGCCTGCC AAAGCCTGTT GGGCTATCAA
TGCGATATCG CGCTGGCTGG CGGAATTTCG ATCACCGTGC CGCAACATCA AGGCTATCTG
GCCCAAGAAG GCGGCATTCT CGCCCCAGAC GGCCATTGCC GCACATTCGA TGCCCAAGCC
CAAGGCACAC TCAACGGCAA CGGTGCAGGT ATCGTGGTTT TGAAACGGCT CGACGATGCC
TTGGCTGATG GCGATTCGAT CTATGCGGTA ATCAAAGGCT CAGCCGTCAA CAACGATGGT
GCGCTCAAAA TTGGCTACAC GGCACCGAGC ATTGATGGGC AGGCCGCTGT GATTCAGGCC
GCTCAAGCGG TGGCTGATGT TGATCCAGTA ACGATTTCCT ATATCGAAGC CCATGGAACG
GCCACTGCCC TCGGCGATCC AATCGAAGTT GCCGCTTTAA CCAAGGCCTT CCGTCAACAA
ACTGATAAAA CGCAGTTTTG TATGCTTGGC TCGGTCAAAT CCAATTTTGG GCATCTTGAT
ACTGCGGCGG GCGTAACCAG CTTGATTAAA ACCGTGCTGG CCCTGCATCA CAATAGAATT
CCCGCCAGCT TGCATTTTCA AACCCCCAAT CCGCAACTTG AGCTAGAAAG CTCACCATTT
TATGTCAACA CTAAATTAAG TGATTGGCCA AGCGATCAGC CAATTCGGCG GGCGGGGGTC
AGTTCATTTG GGATTGGCGG CACGAATGCG CATATTGTGC TCGAAGAAGC GCCCATGCTC
GAACCAACCG AGGAAACTGA CGCTTGGCAC ATGCTGGTGA TTTCGGGGCG TAATCGGGCT
ACCTTGAATG CGGCCACCAA AAATCTGGCC GAGCACCTTG AGGCTAATCC CCAATTGGCC
TTGGCTGATG TTGCCTACAC CTTACAAGTT GGCCGCCAAG CCTTTAATCA TCGGCGGGTG
TTGCTGTGCC GTTCGCTGGA TGAAGCCAGC CAAATTTTGC GCCAACGCGA TAAGCGCATG
ATCAGCGGCC AAGTCAGCGC GGCTACGCCT GCGGTGGTGT TTATGTTTCC GGGCGGCGGC
GTACAATATC CCAGCATGGG CCAAGAACTC TATCAAACTC AGCCGATCTT CCGTGCGGCG
GTTGAACGCT GTTTGGCCAT GCTTAAACCT GAAATCAGCG CCAATCTACG CCAGTTGGTC
TATAGCGAAA ACGCCAAAGT TCATGAGCAA CAATTGGCGA CAATGCTCTA CAGTTTGTTG
GCAATTTTCA TCAGTGAGTA TGCGCTTGCT CAATTATGGC TGGCATGGGG CATTCAGCCT
GTCGCCTTGA TTGGCCATAG TTTGGGTGAA TATACCGCGG CCCATTTGGC TGGCTCGATC
TCACTTGAGG CGGCTTTGCG CTTAGTTGAG CTACGTGGCC GCTTGATGGA TCAGCTTGAA
GATGGCGCGA TGATCAACAT TGCCCTTGCT GAAGCTGAGG TTTTGCCGTT GCTCGGCGAA
CAACTTTCGC TAGCGGCAGT CAATGGGCCT GAACATTCGG TCGTGGCTGG CTCGATTGCA
GCAATTCAAG CCCTTGAGTT TGAGCTAGAA CAGCGGGCGA TCAAATATCG ACGCTTGCCA
ATTCGCGTGG CAGCCCATTC CAGTTTGCTC AAGCCAATTG TGGCCGAATT TGCGACATTT
GCCCAGACGA TCAGCATGCA ACCAGCCACA ATTCCCTATA TTTCCAATGT AACTGGCGGT
TGGATGAGCC ATGAACAATG GTCAAACCCG AACTATTGGA CTGAGCATTT ACAATCGACC
GTGCGATTTA GCGCTGGGAT GAGCCAACTT TTGCAAAATC CAGCTCATCT ATTTCTCGAA
GTTGGGCCTG GCCAAACGCT CACAACCTTA ACTCGCGCTC AAGCCAACTT TGGAGCCGAA
CGGGTGGTAG CGCAGTCGAT GCGCCATCCC CAAGATCAAC AAACTGACAC CCAATGTTTG
TTGACTGCGG TCGGGCGGTT ATGGCTGGCT GGCGTGGCGA TCGATTGGGC CAAACTAAGC
GCCAGCAAGC GTCGCCGCGT GGCCCTACCA ACTTATCCCT TCGAACGCAA ACGCTACTGG
GTTGAGCAAT TTATCAATAA TGAAACCCAA GGCCCGACCT TGCTCGAAGC GGCGCAAGGT
GGTTATAGCT TGCCCGAAAG CAGCGAGCCA GCCGAGGCCA GCCCTGGCTA CGAACGGCCT
AATCTCACGA CTGAATATGT CGCGCCCAGC AATAACCTTG AGCATATGAT CACCGCGCTG
TGGGGTGCTG TCTTGGGCGT ACCGCTGATT GGTATTCACG ACAATTTCTT TGAGTTGGGC
GGCGATTCGC TGCTAGCCTT ACAAGTTGCG ACCCATCTCA CCGAAAAAGT GCATACCACA
ATCGGCGTGC GCAGTTTATT CGAAGCCCCA ACGATCGCCG AGCTGGCCCA GCTTGTGCAA
GCCCAAACCG CCGAACAAGC AGGCGAATTA TCGTCGTTGG TACGCTTGCA ACCACAAGGC
CAAGCAGCGC CGTTCTTTTG TATTCACCCC ATGAGCGGTA TGGCCAATGT CTATGCAGCT
TTGGCCCAAT TGCTTGGCAC GCAACGCCCA TTCTACGGGG TGCAGGCCTT TGGCTTGGAA
TACCCTGAAA TGCCACTTGA CGATATTACA GTTATGGCGC AACGCTATCT CAGCGATATC
CGTCAGGTTC AGCCGCAAGG GCCATATCTG CTTGGCGGCT GGTCGATGGG TGGCTCGATT
GCCTTTGAAA TTGCCAGCCA ATTAGTGGCT CAAGGCGAAA CGGTGAGCTT GCTGGCCTTG
ATCGACACGC CCGCCAAACT AACAGGCACA TCACCAGCAA CACTAACCGA TTTGGAGCTG
TTGATCGCCA TGCTTGGGGT CGATGCCAAC ATTTTAGGGA TCGACGATCC CAAGGCTGAG
GCCAGCGAAG CAGTTTGGAA CGAATTACTG AGCATTATCA AGCAACATCT TGGCTTGCCC
GAAAGCTACA CCTTACAGCG TTTGCGCCAC CTGGTCAGCA CCTTCCGCAC CCACTCGCAA
GCAGTTTGGA ATTATCAGCC AGCTAACTAT CCCAACGATG TGCTAATTTT ACGGGCCGCC
GATTTGGCAG GCGAGCACGA CGACCAACTG AACGAGGCAT ATCGCATAGC CGATTTAGGC
TGGAGCCAGT TTGTCACAGG CCAAATTCAA GTCCAAACCA TTCCAGGCAC GCACAACACC
CTGTTGAACG AACCGTCGTT GCCAATCTTG GCTAGTCATC TCGCGGTAGC GTTACACAAC
GTTGCCGCAA ATACATCTTT GACACCATCA CACCATGAAG GATCAAATGA AATTGCCATT
ATGGATTGA
 
Protein sequence
MSYDETDGNF DIAVVGLAGR WAGAPDIDHF WQQLCAGAEG ISFFDDEQLL AAGVTPEQLA 
HPRYVKAGSV LEGVELFDAA FFGYSPREAE LLDPQQRIFL ECAWQALEHA GYNASTYPGL
IGVFAGSSLN TYLLRNLASQ QGRGTIPDLF QLIMGNDKDF LPTRVSYKLN LRGPSFSVQS
ACSTSLVATH LACQSLLGYQ CDIALAGGIS ITVPQHQGYL AQEGGILAPD GHCRTFDAQA
QGTLNGNGAG IVVLKRLDDA LADGDSIYAV IKGSAVNNDG ALKIGYTAPS IDGQAAVIQA
AQAVADVDPV TISYIEAHGT ATALGDPIEV AALTKAFRQQ TDKTQFCMLG SVKSNFGHLD
TAAGVTSLIK TVLALHHNRI PASLHFQTPN PQLELESSPF YVNTKLSDWP SDQPIRRAGV
SSFGIGGTNA HIVLEEAPML EPTEETDAWH MLVISGRNRA TLNAATKNLA EHLEANPQLA
LADVAYTLQV GRQAFNHRRV LLCRSLDEAS QILRQRDKRM ISGQVSAATP AVVFMFPGGG
VQYPSMGQEL YQTQPIFRAA VERCLAMLKP EISANLRQLV YSENAKVHEQ QLATMLYSLL
AIFISEYALA QLWLAWGIQP VALIGHSLGE YTAAHLAGSI SLEAALRLVE LRGRLMDQLE
DGAMINIALA EAEVLPLLGE QLSLAAVNGP EHSVVAGSIA AIQALEFELE QRAIKYRRLP
IRVAAHSSLL KPIVAEFATF AQTISMQPAT IPYISNVTGG WMSHEQWSNP NYWTEHLQST
VRFSAGMSQL LQNPAHLFLE VGPGQTLTTL TRAQANFGAE RVVAQSMRHP QDQQTDTQCL
LTAVGRLWLA GVAIDWAKLS ASKRRRVALP TYPFERKRYW VEQFINNETQ GPTLLEAAQG
GYSLPESSEP AEASPGYERP NLTTEYVAPS NNLEHMITAL WGAVLGVPLI GIHDNFFELG
GDSLLALQVA THLTEKVHTT IGVRSLFEAP TIAELAQLVQ AQTAEQAGEL SSLVRLQPQG
QAAPFFCIHP MSGMANVYAA LAQLLGTQRP FYGVQAFGLE YPEMPLDDIT VMAQRYLSDI
RQVQPQGPYL LGGWSMGGSI AFEIASQLVA QGETVSLLAL IDTPAKLTGT SPATLTDLEL
LIAMLGVDAN ILGIDDPKAE ASEAVWNELL SIIKQHLGLP ESYTLQRLRH LVSTFRTHSQ
AVWNYQPANY PNDVLILRAA DLAGEHDDQL NEAYRIADLG WSQFVTGQIQ VQTIPGTHNT
LLNEPSLPIL ASHLAVALHN VAANTSLTPS HHEGSNEIAI MD