Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1860 |
Symbol | |
ID | 5733749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2182669 |
End bp | 2186919 |
Gene Length | 4251 bp |
Protein Length | 1416 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279004 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_001544631 |
Protein GI | 159898384 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.699288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGCCT ATACCGACAC TGATATTGCA ATCATTGGGA TGGCTGGACG CTTCCCAGGG GCCAACAATC TGACAACCTT TTGGCATAAT ATCTGCCAAG GAATTGTCTG CATTACCGAT CTAGATCTGC GACAACTTCA GGCCAGCGGA ATCGACCAAG CCCTGCTCAA CGATCCCGCC TACGTCAAAC GTGCGCCCTT GCTGCTTGAC GATATTGCTG GCTTTGATCA TAACTTTTGG GGCATTACGC CCAATGAGGC CGCCTTGCTT GATCCACAGC AGCGGATGTT GCTGGAATGT GCATGGGAAG CCTTGGAGCA GGCGGGCTAT GCGCTTAAAC AGCCACAAAA AGTTGGGGTT TTTGTGGGAG TCGGCGCAAA TAACTATTTG TTGCAACAGG TGCTAGCCAA TCCAAGCGCA ATCGAGCGCA ATGGCGATTA TGCCGTGATG CTCGCCAATG ATAAAGATTA TGCGCCAACC CGAATTTCAT TCAAGCTGAA TTTGCTCGGG CCGAGCGTCA GCGTGCAAAC CGCCTGCTCA ACCTCGTTGG TGGCAACCCA TATGGCAATT CAGAGCATTC TGAATGGCGA AAGTGCGATG GCGCTGGCTG GTGGCGCAGC GCTGCGGATT CCCCAAATGC AAGGCCACCT CTATCAAGCT GGCATGATCC ACTCGCCTGA TGGCAATTGC AAGCCTTTTA CCAGCGAGGC GGCGGGCACA ATTGGCAGTA GCGGCGCTGG CATGGTGTTG CTCAAACGAG CGATTGATGC AGTAGCTGAT GGCGATCCAA TTTGGGCAGT GATCAAGGGG AGCGCAATTA ATAATGATGG CGCTCAGAAA GTTGGCTTTA CCGCACCAAG CATCAAGGGT CAGGCCCAGG TTGTGGCCGA AGCCTTGGCC TTGGCCGAAG TTGAGCCGAG CATGGTTGGC TACCTTGAAA CCCATGGCAC CGCGACCAAC CTTGGCGATC CAATCGAGTT AGCAGCGCTC AAAAAGGTTT TTGGGGCAAC TGCTGCACAA GCCTGGTGCG CCTTAGGCAC GCTCAAAGCT AATATTGGTC ATCTCGATAA TGCGGCTGGG GTGGCAGGCT TGATCAAAAC TGCCTTGGCC CTTCATCATC GCCAACGACC ACCAACCGCC TATGCCACCA TTCCACATAG CAATTTGCAA CAATCACCAT TTTATTTGAA CCCGACTGCG AGCGCTTGGG ATGCAAATCT TTATGCTGGA GTCAGTTCAT TTGGCATTGG CGGTACGAAT GCCCATGTGG TGCTGGCGGC AGCCGAACCA ATCGCCAAGC AAGCTCAGGC TGAATCATGG CAATTGCTGC CAATTTCGGC GGCCAGCGTT TGGTCGCTAG AGCAGCAAAC CGAGCGGCTG GCGGCGCATC TGCAAACTAA CCCAGCGCAA TCGTTGGCTG ATGTAGCCTA TACCCTGCAA ACTGCACGGC AAGCCTTTGC TCAACGCAGT TTTGTGGTTG CCAAAAGCCA TGAACAAGCC AGCCAAGCCT TGTTGACCAG CCCAATCCGT GAGTTTGCCA GCCAACCGCC CAAGGTTGCT TGGCTCTTTT CGGGGCAGGG CAGCCAATGT GCTGGGATGG CCGCTGAACT CTATCAGCAA GCACCCGCTT TTCGCCAAGC AATTGATCTG GTGAATCGCT ATGCTCAACC CTTACTCGGC TACGATCTGC GGCAAATGAT GTTCGATCAT TCGGGCGATC TGACGGCGAC GAACGTCGCG CAGCCATGTT TGTTTATGCT GGAATATGCC TTGGCGCAAC AATGGTTGGC GTGGGGCATT CAACCACAAG CCTTGTTTGG CCATAGCATT GGCGAATATG TGGCCGCTTG TGTTGCCAAT GTGCTGGATT TACCGACTGC TGTGCGCTTG GTGGTGGTGC GTGGCCAGTT GATGAACCAA TTGCCTAGCG GAGCGATGCT CAGCATTAAC GCCAGCCGCG ATCAGATTCA GCCAATCTTG CCTGCTGAAC TTGATTTGGC GGCGATCAAT ACCAATCAGC TGTTGGTGGT CGCTGGTGAG CATGTGGCGA TCCAAGCCTT TCAGCAGCAA TTGAACGAGC GTGGCATCGA ATCACGCATA CTGCACACCT CGCATGCCTT TCATTCGCGA GCAATGCAGC CGATGTTGGC AGCGTTTCGC CAGCACTTTG CCGATGTGCA ACTTCAAGCG CCAACCATCC CGATCCTCTC GAATGTGAGT GGTACATGGC TGACGGCGGT CCAAGCTACC AGTGTTGACT ATTGGCTTGA ACAGATTGTT AATCCAGTGC AGTGTGCTGC TTGTTTGGCC CAGTTGTTGG CCGATGATGC TTGGATTGTG CAGGAGCTTG GGCCTGGGCA TACGCTGGCA ACCTTTGCCC GTGCCAGCCA GGCTGCTCAA ACGCCGTTGA TTCTCACCAG TTTGCCGCAT CCACAGGTTA AGCAAGCCGA TTTAGCCGTG ATGTTGCAAA GTTTAGGTCA ATTGTGGCAA GCAGGCATGA CGGTGCTGTG GCAGGCTCTA CATCAAACAT CATGTCATAA AGTTTGGTTG CCAACCTATG CGTTCGATCA CCAACGCCAT TGGATTGAGG CAGAACAGGC CCACGCACCC CGCGCAAGCA ATCAAATAAG CATCAATACC CCAACCTGGC AGGCTCAAAG CTTGGCAACC ACCCCAATCC AAGCCAGCAC CTGGCTGGTG TGGGGCAGCC AACCAAGCAC TATCGAACGG CTGCTCGAAC AATTTGCCGC CGATCAGCAG CTCTATATTT GCAATCAAGA TACGTTTGCC CAACAGAGCA AACAACTTAA TTCTACGACC AGCCAAATCA TCTATTTCAG CAACTTCAGC AGCCTCTCAG CATGGTCAGC CGCCGCTGAT CTGTTGACGA TCGCCAAGCG GGTGCGCGAG TTGGGGCTAG CCCAGGTACA TTTGCATGTT GTAAGCAGCC AGAGTTTGGC GATTGCTCCG CATGAAGATC TCAACCCGCA TGATGCTGCA TTGCGCGGGG TGGCGATGGT TTTGGCCCAA GAATATCCCG AAATCAGCAG CCATTGGCTT GACCTTGATT TTACCCATCA AGCGTGGCCG AGTATGTTGG CCCAAGAGCT ACAAAGTGCC AGCGAGCCAG TGGTGGCATT GCGTGGTTTC AATCGGTTTG TGCCGCAGGC CAGGGTTGCA AGCTTACCAA CCAATCAGCA GGGCTTCCTC GTTGGTGGTT GCTATTTGAT TACGGGTGGC AACGGCGGAC TGGGTCGTCA AATTGCGCTC CAACTAGCCC AACAATATCA GGCCAAGATC GCAATTTTGA GCCGTTCGCT CGTGCCTGAG TCGCCCGCTG CCTATGATTT GCAACAGCAG ATTACCGAAC GTGGTGGTGA GGCCTTAATT ATTCAGGCCG ATGTGGCACA GCCTAAACAA TTAGCCGAAG CCTTGGCGCT GGTCGAAACC CATTGGGGCC AATGTCATGG CCTGATTCAC GCAGCGGGCA GTGCCGCCCA AATTGCTTGT ACTGACACCA CCCAAGCAAC TTGGGATGCG CTGATGGCGG CCAAGCTCCA GGGAACCCAA AACCTTGCCA TGATGGTGCG GCCTTGGAAA CTCGATTGGG CGGTGGTGAT GTCGTCGTTG GCCAGTGATT TGGGTGGGCT GGGCTTTGCC TGCTATGCCA GCGCCAACAT TGCCCAACAT GCCATCGTGC ATCAACTCAA TCAATCAAGC GCTGTACCGT GGTATTGCAT CAACTGGGAT GGTTGGCAAA GCGCCGATGA AGCCGATCCT CAACGCTTGA GCTTTGAACA AGGCTGGGCT GCGCTAAGTG CGATTGTCCA ACAACGCGGC GTACTCAACC CGCAAGTAGT TGTGGGTGAT GTTGCTGCAC GTCGTCAGCG CTGGCTCTAT CCACAACCGA CAGTTGCTGC GCCAATCACG ACGCATCGTC CACGTTCAGC CAATTTCGAG GCTCCACGCA GCCAACTTGA ACAGCAACTT GCGGCAATTT GGCACGATTT ACTCGGCGTA AACGAACTTA GCATTCATGA TAATTTCTTC GATTTGGGCG GCCATTCGTT GTTAGCAACC CAAGTGCTTG GCCGAATTCG TCAGCAACTT CAACTTGATC TACCGCTACA GCTGCTTTTT GAAGCGCCAA CCCTTCAAGG CTTGGCCGAG CATATTCGGC AGCAGCAGCT TAGCCAAACG TTACAAGCCC AAGCAGTTGG CGGTGGCGAG CGTGAGGAGA TTGAGTTATG A
|
Protein sequence | MHAYTDTDIA IIGMAGRFPG ANNLTTFWHN ICQGIVCITD LDLRQLQASG IDQALLNDPA YVKRAPLLLD DIAGFDHNFW GITPNEAALL DPQQRMLLEC AWEALEQAGY ALKQPQKVGV FVGVGANNYL LQQVLANPSA IERNGDYAVM LANDKDYAPT RISFKLNLLG PSVSVQTACS TSLVATHMAI QSILNGESAM ALAGGAALRI PQMQGHLYQA GMIHSPDGNC KPFTSEAAGT IGSSGAGMVL LKRAIDAVAD GDPIWAVIKG SAINNDGAQK VGFTAPSIKG QAQVVAEALA LAEVEPSMVG YLETHGTATN LGDPIELAAL KKVFGATAAQ AWCALGTLKA NIGHLDNAAG VAGLIKTALA LHHRQRPPTA YATIPHSNLQ QSPFYLNPTA SAWDANLYAG VSSFGIGGTN AHVVLAAAEP IAKQAQAESW QLLPISAASV WSLEQQTERL AAHLQTNPAQ SLADVAYTLQ TARQAFAQRS FVVAKSHEQA SQALLTSPIR EFASQPPKVA WLFSGQGSQC AGMAAELYQQ APAFRQAIDL VNRYAQPLLG YDLRQMMFDH SGDLTATNVA QPCLFMLEYA LAQQWLAWGI QPQALFGHSI GEYVAACVAN VLDLPTAVRL VVVRGQLMNQ LPSGAMLSIN ASRDQIQPIL PAELDLAAIN TNQLLVVAGE HVAIQAFQQQ LNERGIESRI LHTSHAFHSR AMQPMLAAFR QHFADVQLQA PTIPILSNVS GTWLTAVQAT SVDYWLEQIV NPVQCAACLA QLLADDAWIV QELGPGHTLA TFARASQAAQ TPLILTSLPH PQVKQADLAV MLQSLGQLWQ AGMTVLWQAL HQTSCHKVWL PTYAFDHQRH WIEAEQAHAP RASNQISINT PTWQAQSLAT TPIQASTWLV WGSQPSTIER LLEQFAADQQ LYICNQDTFA QQSKQLNSTT SQIIYFSNFS SLSAWSAAAD LLTIAKRVRE LGLAQVHLHV VSSQSLAIAP HEDLNPHDAA LRGVAMVLAQ EYPEISSHWL DLDFTHQAWP SMLAQELQSA SEPVVALRGF NRFVPQARVA SLPTNQQGFL VGGCYLITGG NGGLGRQIAL QLAQQYQAKI AILSRSLVPE SPAAYDLQQQ ITERGGEALI IQADVAQPKQ LAEALALVET HWGQCHGLIH AAGSAAQIAC TDTTQATWDA LMAAKLQGTQ NLAMMVRPWK LDWAVVMSSL ASDLGGLGFA CYASANIAQH AIVHQLNQSS AVPWYCINWD GWQSADEADP QRLSFEQGWA ALSAIVQQRG VLNPQVVVGD VAARRQRWLY PQPTVAAPIT THRPRSANFE APRSQLEQQL AAIWHDLLGV NELSIHDNFF DLGGHSLLAT QVLGRIRQQL QLDLPLQLLF EAPTLQGLAE HIRQQQLSQT LQAQAVGGGE REEIEL
|
| |