Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1573 |
Symbol | |
ID | 5733460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1824589 |
End bp | 1826274 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278712 |
Product | thioesterase |
Protein accession | YP_001544344 |
Protein GI | 159898097 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATTT ACAATAGCCA ACTCTTAATT CTGAACGATG GTCTACAGCC AGTTCCCGAT GGATGTATTG GCCAAATTTG GATTAGTGGG GCTGGAGTTG GCAATGGCTA TACTGGCCAG CCAGGGTTGA CGGCTGAGCG TTTTCGGCCT AACCCCTTTG ATTATATCCC TGGCTCGCGG ATGTATTGCA GCGGTGATCT GGGTCGGCGT AACCAGAATG GGGCAATCGA ATTTCTCGGG CGTACTGATC ATCAGGTAAA AATTCGCGGC TTTCGGGTTG AATTAACCGA AATTGCACAG CGCTTACAGC AGCATCCGGC GGTTCGCGAG GCGCTTGTTA TGCGGCACGA GCATCCGCTG CGGGGCGAAT ATCTTGTTGC ATATGTTGTG CTGATCCAGC CGGCTCAAGC CGATGTTGGT GCGTTGCGGC GCTATCTTGC AAAGCATAGT CCACCCTATC TCGTTCCAGC TGAAATTGTG CTACTGGAGG CATTTCCGCT TAGCCCAAAT GGCAAGATCG ATCGGCAGGC GTTGCCGCAT CCCACGGCGA ATGTTCATGC TGGTGAGGCT CGCCAACCCG CTGCTCGCTC ATTTGACCCA CTTGAGTTTC AAATACGCAC AATCTGGGAA GAGACGCTTG GCATTGAACC GATTGGGGCA CAAGCGAATT TTTTTGAGCT TGGTGGGCAT TCGTTGCTTG CGATTGCATT GATGGCACGG ATCGAAGAAC GGCTTGGTAA ACGCTTGCCA ATTACCATCC TCTTTGATGC CCCCACGATC GAGCAAATGG TCGGCTTGCT TCGGCAGGAG GGCTATGCGC CGCGATGGTC GCCGATGGTT TTGATGCAAG CAGGGCGTAA GGGGTTTCCT GCACTATTTT GTGTCCATGC ACTTAGTGGC AGCGCCTTTG CCTACACAGT ATTGCCCAAA CATATCTCAC CCGACCAACC ATTTTATGGG ATTGAATCGC GCGGTCGCGA TATTAACCAA GCCCCTGATC GCTCGATTGA GGCCATGGCG GCCTATTATA TTGAGCATAT GCGGCTGATT CAACCTAGTG GACCATATTG TATTGCTGGA TGGTCATTAG GTGGGCCAGT TGCCTTTGAA ATGGCTCAGC AATTATATCG TGCTGGTGAA ACAGTGGCGT TGCTGGCGAT TGTTGATACG GGAGCGCCCT TGGCTGGGCG ATTACCAATT GAACCAGCTG CCATTGATGA TCTTTCTTTA TTGGTGCCAC GCTTACGCCA TTTTGGAATT AATCTCGATT TGGACTTCAT TCAAGCACTT CCGGCAGATC AGCAGCTTCC TTATATTATG CAACAAGCAG TGAGTGGTGG TTTTTGGTCG GCAAACATTA CGCTTGCCGA AATTAAGCGT AAAGGTGAGC TGATTCGCAC AAATTTAGCC GCACGACGCA ACTATAGCGC CCAGCCCTAC CCTGGGACAA TCAACCTCTT CCGCACCAAA CAACATCCAG GTTATAGCGA CGACGAAGTT ACGCTTGGTT GGGATCAATT GGCTTTAACG GGAGTTAAGG TGTGGGAAAT CCCTGGTGAT CACCTCTCAA TTTTTCATAG TCCTTATGTT GAGGTGTTGG GGCACGCATT ATGGGAGTGT CTTCAGGGTT GTACGGATCA GCTAGACGAA CTCGATTACG CGCATGAACG ATATCGAGCG CAATGA
|
Protein sequence | MPIYNSQLLI LNDGLQPVPD GCIGQIWISG AGVGNGYTGQ PGLTAERFRP NPFDYIPGSR MYCSGDLGRR NQNGAIEFLG RTDHQVKIRG FRVELTEIAQ RLQQHPAVRE ALVMRHEHPL RGEYLVAYVV LIQPAQADVG ALRRYLAKHS PPYLVPAEIV LLEAFPLSPN GKIDRQALPH PTANVHAGEA RQPAARSFDP LEFQIRTIWE ETLGIEPIGA QANFFELGGH SLLAIALMAR IEERLGKRLP ITILFDAPTI EQMVGLLRQE GYAPRWSPMV LMQAGRKGFP ALFCVHALSG SAFAYTVLPK HISPDQPFYG IESRGRDINQ APDRSIEAMA AYYIEHMRLI QPSGPYCIAG WSLGGPVAFE MAQQLYRAGE TVALLAIVDT GAPLAGRLPI EPAAIDDLSL LVPRLRHFGI NLDLDFIQAL PADQQLPYIM QQAVSGGFWS ANITLAEIKR KGELIRTNLA ARRNYSAQPY PGTINLFRTK QHPGYSDDEV TLGWDQLALT GVKVWEIPGD HLSIFHSPYV EVLGHALWEC LQGCTDQLDE LDYAHERYRA Q
|
| |