Gene Haur_1860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1860 
Symbol 
ID5733749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2182669 
End bp2186919 
Gene Length4251 bp 
Protein Length1416 aa 
Translation table11 
GC content54% 
IMG OID641279004 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001544631 
Protein GI159898384 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.699288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGCCT ATACCGACAC TGATATTGCA ATCATTGGGA TGGCTGGACG CTTCCCAGGG 
GCCAACAATC TGACAACCTT TTGGCATAAT ATCTGCCAAG GAATTGTCTG CATTACCGAT
CTAGATCTGC GACAACTTCA GGCCAGCGGA ATCGACCAAG CCCTGCTCAA CGATCCCGCC
TACGTCAAAC GTGCGCCCTT GCTGCTTGAC GATATTGCTG GCTTTGATCA TAACTTTTGG
GGCATTACGC CCAATGAGGC CGCCTTGCTT GATCCACAGC AGCGGATGTT GCTGGAATGT
GCATGGGAAG CCTTGGAGCA GGCGGGCTAT GCGCTTAAAC AGCCACAAAA AGTTGGGGTT
TTTGTGGGAG TCGGCGCAAA TAACTATTTG TTGCAACAGG TGCTAGCCAA TCCAAGCGCA
ATCGAGCGCA ATGGCGATTA TGCCGTGATG CTCGCCAATG ATAAAGATTA TGCGCCAACC
CGAATTTCAT TCAAGCTGAA TTTGCTCGGG CCGAGCGTCA GCGTGCAAAC CGCCTGCTCA
ACCTCGTTGG TGGCAACCCA TATGGCAATT CAGAGCATTC TGAATGGCGA AAGTGCGATG
GCGCTGGCTG GTGGCGCAGC GCTGCGGATT CCCCAAATGC AAGGCCACCT CTATCAAGCT
GGCATGATCC ACTCGCCTGA TGGCAATTGC AAGCCTTTTA CCAGCGAGGC GGCGGGCACA
ATTGGCAGTA GCGGCGCTGG CATGGTGTTG CTCAAACGAG CGATTGATGC AGTAGCTGAT
GGCGATCCAA TTTGGGCAGT GATCAAGGGG AGCGCAATTA ATAATGATGG CGCTCAGAAA
GTTGGCTTTA CCGCACCAAG CATCAAGGGT CAGGCCCAGG TTGTGGCCGA AGCCTTGGCC
TTGGCCGAAG TTGAGCCGAG CATGGTTGGC TACCTTGAAA CCCATGGCAC CGCGACCAAC
CTTGGCGATC CAATCGAGTT AGCAGCGCTC AAAAAGGTTT TTGGGGCAAC TGCTGCACAA
GCCTGGTGCG CCTTAGGCAC GCTCAAAGCT AATATTGGTC ATCTCGATAA TGCGGCTGGG
GTGGCAGGCT TGATCAAAAC TGCCTTGGCC CTTCATCATC GCCAACGACC ACCAACCGCC
TATGCCACCA TTCCACATAG CAATTTGCAA CAATCACCAT TTTATTTGAA CCCGACTGCG
AGCGCTTGGG ATGCAAATCT TTATGCTGGA GTCAGTTCAT TTGGCATTGG CGGTACGAAT
GCCCATGTGG TGCTGGCGGC AGCCGAACCA ATCGCCAAGC AAGCTCAGGC TGAATCATGG
CAATTGCTGC CAATTTCGGC GGCCAGCGTT TGGTCGCTAG AGCAGCAAAC CGAGCGGCTG
GCGGCGCATC TGCAAACTAA CCCAGCGCAA TCGTTGGCTG ATGTAGCCTA TACCCTGCAA
ACTGCACGGC AAGCCTTTGC TCAACGCAGT TTTGTGGTTG CCAAAAGCCA TGAACAAGCC
AGCCAAGCCT TGTTGACCAG CCCAATCCGT GAGTTTGCCA GCCAACCGCC CAAGGTTGCT
TGGCTCTTTT CGGGGCAGGG CAGCCAATGT GCTGGGATGG CCGCTGAACT CTATCAGCAA
GCACCCGCTT TTCGCCAAGC AATTGATCTG GTGAATCGCT ATGCTCAACC CTTACTCGGC
TACGATCTGC GGCAAATGAT GTTCGATCAT TCGGGCGATC TGACGGCGAC GAACGTCGCG
CAGCCATGTT TGTTTATGCT GGAATATGCC TTGGCGCAAC AATGGTTGGC GTGGGGCATT
CAACCACAAG CCTTGTTTGG CCATAGCATT GGCGAATATG TGGCCGCTTG TGTTGCCAAT
GTGCTGGATT TACCGACTGC TGTGCGCTTG GTGGTGGTGC GTGGCCAGTT GATGAACCAA
TTGCCTAGCG GAGCGATGCT CAGCATTAAC GCCAGCCGCG ATCAGATTCA GCCAATCTTG
CCTGCTGAAC TTGATTTGGC GGCGATCAAT ACCAATCAGC TGTTGGTGGT CGCTGGTGAG
CATGTGGCGA TCCAAGCCTT TCAGCAGCAA TTGAACGAGC GTGGCATCGA ATCACGCATA
CTGCACACCT CGCATGCCTT TCATTCGCGA GCAATGCAGC CGATGTTGGC AGCGTTTCGC
CAGCACTTTG CCGATGTGCA ACTTCAAGCG CCAACCATCC CGATCCTCTC GAATGTGAGT
GGTACATGGC TGACGGCGGT CCAAGCTACC AGTGTTGACT ATTGGCTTGA ACAGATTGTT
AATCCAGTGC AGTGTGCTGC TTGTTTGGCC CAGTTGTTGG CCGATGATGC TTGGATTGTG
CAGGAGCTTG GGCCTGGGCA TACGCTGGCA ACCTTTGCCC GTGCCAGCCA GGCTGCTCAA
ACGCCGTTGA TTCTCACCAG TTTGCCGCAT CCACAGGTTA AGCAAGCCGA TTTAGCCGTG
ATGTTGCAAA GTTTAGGTCA ATTGTGGCAA GCAGGCATGA CGGTGCTGTG GCAGGCTCTA
CATCAAACAT CATGTCATAA AGTTTGGTTG CCAACCTATG CGTTCGATCA CCAACGCCAT
TGGATTGAGG CAGAACAGGC CCACGCACCC CGCGCAAGCA ATCAAATAAG CATCAATACC
CCAACCTGGC AGGCTCAAAG CTTGGCAACC ACCCCAATCC AAGCCAGCAC CTGGCTGGTG
TGGGGCAGCC AACCAAGCAC TATCGAACGG CTGCTCGAAC AATTTGCCGC CGATCAGCAG
CTCTATATTT GCAATCAAGA TACGTTTGCC CAACAGAGCA AACAACTTAA TTCTACGACC
AGCCAAATCA TCTATTTCAG CAACTTCAGC AGCCTCTCAG CATGGTCAGC CGCCGCTGAT
CTGTTGACGA TCGCCAAGCG GGTGCGCGAG TTGGGGCTAG CCCAGGTACA TTTGCATGTT
GTAAGCAGCC AGAGTTTGGC GATTGCTCCG CATGAAGATC TCAACCCGCA TGATGCTGCA
TTGCGCGGGG TGGCGATGGT TTTGGCCCAA GAATATCCCG AAATCAGCAG CCATTGGCTT
GACCTTGATT TTACCCATCA AGCGTGGCCG AGTATGTTGG CCCAAGAGCT ACAAAGTGCC
AGCGAGCCAG TGGTGGCATT GCGTGGTTTC AATCGGTTTG TGCCGCAGGC CAGGGTTGCA
AGCTTACCAA CCAATCAGCA GGGCTTCCTC GTTGGTGGTT GCTATTTGAT TACGGGTGGC
AACGGCGGAC TGGGTCGTCA AATTGCGCTC CAACTAGCCC AACAATATCA GGCCAAGATC
GCAATTTTGA GCCGTTCGCT CGTGCCTGAG TCGCCCGCTG CCTATGATTT GCAACAGCAG
ATTACCGAAC GTGGTGGTGA GGCCTTAATT ATTCAGGCCG ATGTGGCACA GCCTAAACAA
TTAGCCGAAG CCTTGGCGCT GGTCGAAACC CATTGGGGCC AATGTCATGG CCTGATTCAC
GCAGCGGGCA GTGCCGCCCA AATTGCTTGT ACTGACACCA CCCAAGCAAC TTGGGATGCG
CTGATGGCGG CCAAGCTCCA GGGAACCCAA AACCTTGCCA TGATGGTGCG GCCTTGGAAA
CTCGATTGGG CGGTGGTGAT GTCGTCGTTG GCCAGTGATT TGGGTGGGCT GGGCTTTGCC
TGCTATGCCA GCGCCAACAT TGCCCAACAT GCCATCGTGC ATCAACTCAA TCAATCAAGC
GCTGTACCGT GGTATTGCAT CAACTGGGAT GGTTGGCAAA GCGCCGATGA AGCCGATCCT
CAACGCTTGA GCTTTGAACA AGGCTGGGCT GCGCTAAGTG CGATTGTCCA ACAACGCGGC
GTACTCAACC CGCAAGTAGT TGTGGGTGAT GTTGCTGCAC GTCGTCAGCG CTGGCTCTAT
CCACAACCGA CAGTTGCTGC GCCAATCACG ACGCATCGTC CACGTTCAGC CAATTTCGAG
GCTCCACGCA GCCAACTTGA ACAGCAACTT GCGGCAATTT GGCACGATTT ACTCGGCGTA
AACGAACTTA GCATTCATGA TAATTTCTTC GATTTGGGCG GCCATTCGTT GTTAGCAACC
CAAGTGCTTG GCCGAATTCG TCAGCAACTT CAACTTGATC TACCGCTACA GCTGCTTTTT
GAAGCGCCAA CCCTTCAAGG CTTGGCCGAG CATATTCGGC AGCAGCAGCT TAGCCAAACG
TTACAAGCCC AAGCAGTTGG CGGTGGCGAG CGTGAGGAGA TTGAGTTATG A
 
Protein sequence
MHAYTDTDIA IIGMAGRFPG ANNLTTFWHN ICQGIVCITD LDLRQLQASG IDQALLNDPA 
YVKRAPLLLD DIAGFDHNFW GITPNEAALL DPQQRMLLEC AWEALEQAGY ALKQPQKVGV
FVGVGANNYL LQQVLANPSA IERNGDYAVM LANDKDYAPT RISFKLNLLG PSVSVQTACS
TSLVATHMAI QSILNGESAM ALAGGAALRI PQMQGHLYQA GMIHSPDGNC KPFTSEAAGT
IGSSGAGMVL LKRAIDAVAD GDPIWAVIKG SAINNDGAQK VGFTAPSIKG QAQVVAEALA
LAEVEPSMVG YLETHGTATN LGDPIELAAL KKVFGATAAQ AWCALGTLKA NIGHLDNAAG
VAGLIKTALA LHHRQRPPTA YATIPHSNLQ QSPFYLNPTA SAWDANLYAG VSSFGIGGTN
AHVVLAAAEP IAKQAQAESW QLLPISAASV WSLEQQTERL AAHLQTNPAQ SLADVAYTLQ
TARQAFAQRS FVVAKSHEQA SQALLTSPIR EFASQPPKVA WLFSGQGSQC AGMAAELYQQ
APAFRQAIDL VNRYAQPLLG YDLRQMMFDH SGDLTATNVA QPCLFMLEYA LAQQWLAWGI
QPQALFGHSI GEYVAACVAN VLDLPTAVRL VVVRGQLMNQ LPSGAMLSIN ASRDQIQPIL
PAELDLAAIN TNQLLVVAGE HVAIQAFQQQ LNERGIESRI LHTSHAFHSR AMQPMLAAFR
QHFADVQLQA PTIPILSNVS GTWLTAVQAT SVDYWLEQIV NPVQCAACLA QLLADDAWIV
QELGPGHTLA TFARASQAAQ TPLILTSLPH PQVKQADLAV MLQSLGQLWQ AGMTVLWQAL
HQTSCHKVWL PTYAFDHQRH WIEAEQAHAP RASNQISINT PTWQAQSLAT TPIQASTWLV
WGSQPSTIER LLEQFAADQQ LYICNQDTFA QQSKQLNSTT SQIIYFSNFS SLSAWSAAAD
LLTIAKRVRE LGLAQVHLHV VSSQSLAIAP HEDLNPHDAA LRGVAMVLAQ EYPEISSHWL
DLDFTHQAWP SMLAQELQSA SEPVVALRGF NRFVPQARVA SLPTNQQGFL VGGCYLITGG
NGGLGRQIAL QLAQQYQAKI AILSRSLVPE SPAAYDLQQQ ITERGGEALI IQADVAQPKQ
LAEALALVET HWGQCHGLIH AAGSAAQIAC TDTTQATWDA LMAAKLQGTQ NLAMMVRPWK
LDWAVVMSSL ASDLGGLGFA CYASANIAQH AIVHQLNQSS AVPWYCINWD GWQSADEADP
QRLSFEQGWA ALSAIVQQRG VLNPQVVVGD VAARRQRWLY PQPTVAAPIT THRPRSANFE
APRSQLEQQL AAIWHDLLGV NELSIHDNFF DLGGHSLLAT QVLGRIRQQL QLDLPLQLLF
EAPTLQGLAE HIRQQQLSQT LQAQAVGGGE REEIEL