Gene Haur_1573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1573 
Symbol 
ID5733460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1824589 
End bp1826274 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content51% 
IMG OID641278712 
Productthioesterase 
Protein accessionYP_001544344 
Protein GI159898097 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATTT ACAATAGCCA ACTCTTAATT CTGAACGATG GTCTACAGCC AGTTCCCGAT 
GGATGTATTG GCCAAATTTG GATTAGTGGG GCTGGAGTTG GCAATGGCTA TACTGGCCAG
CCAGGGTTGA CGGCTGAGCG TTTTCGGCCT AACCCCTTTG ATTATATCCC TGGCTCGCGG
ATGTATTGCA GCGGTGATCT GGGTCGGCGT AACCAGAATG GGGCAATCGA ATTTCTCGGG
CGTACTGATC ATCAGGTAAA AATTCGCGGC TTTCGGGTTG AATTAACCGA AATTGCACAG
CGCTTACAGC AGCATCCGGC GGTTCGCGAG GCGCTTGTTA TGCGGCACGA GCATCCGCTG
CGGGGCGAAT ATCTTGTTGC ATATGTTGTG CTGATCCAGC CGGCTCAAGC CGATGTTGGT
GCGTTGCGGC GCTATCTTGC AAAGCATAGT CCACCCTATC TCGTTCCAGC TGAAATTGTG
CTACTGGAGG CATTTCCGCT TAGCCCAAAT GGCAAGATCG ATCGGCAGGC GTTGCCGCAT
CCCACGGCGA ATGTTCATGC TGGTGAGGCT CGCCAACCCG CTGCTCGCTC ATTTGACCCA
CTTGAGTTTC AAATACGCAC AATCTGGGAA GAGACGCTTG GCATTGAACC GATTGGGGCA
CAAGCGAATT TTTTTGAGCT TGGTGGGCAT TCGTTGCTTG CGATTGCATT GATGGCACGG
ATCGAAGAAC GGCTTGGTAA ACGCTTGCCA ATTACCATCC TCTTTGATGC CCCCACGATC
GAGCAAATGG TCGGCTTGCT TCGGCAGGAG GGCTATGCGC CGCGATGGTC GCCGATGGTT
TTGATGCAAG CAGGGCGTAA GGGGTTTCCT GCACTATTTT GTGTCCATGC ACTTAGTGGC
AGCGCCTTTG CCTACACAGT ATTGCCCAAA CATATCTCAC CCGACCAACC ATTTTATGGG
ATTGAATCGC GCGGTCGCGA TATTAACCAA GCCCCTGATC GCTCGATTGA GGCCATGGCG
GCCTATTATA TTGAGCATAT GCGGCTGATT CAACCTAGTG GACCATATTG TATTGCTGGA
TGGTCATTAG GTGGGCCAGT TGCCTTTGAA ATGGCTCAGC AATTATATCG TGCTGGTGAA
ACAGTGGCGT TGCTGGCGAT TGTTGATACG GGAGCGCCCT TGGCTGGGCG ATTACCAATT
GAACCAGCTG CCATTGATGA TCTTTCTTTA TTGGTGCCAC GCTTACGCCA TTTTGGAATT
AATCTCGATT TGGACTTCAT TCAAGCACTT CCGGCAGATC AGCAGCTTCC TTATATTATG
CAACAAGCAG TGAGTGGTGG TTTTTGGTCG GCAAACATTA CGCTTGCCGA AATTAAGCGT
AAAGGTGAGC TGATTCGCAC AAATTTAGCC GCACGACGCA ACTATAGCGC CCAGCCCTAC
CCTGGGACAA TCAACCTCTT CCGCACCAAA CAACATCCAG GTTATAGCGA CGACGAAGTT
ACGCTTGGTT GGGATCAATT GGCTTTAACG GGAGTTAAGG TGTGGGAAAT CCCTGGTGAT
CACCTCTCAA TTTTTCATAG TCCTTATGTT GAGGTGTTGG GGCACGCATT ATGGGAGTGT
CTTCAGGGTT GTACGGATCA GCTAGACGAA CTCGATTACG CGCATGAACG ATATCGAGCG
CAATGA
 
Protein sequence
MPIYNSQLLI LNDGLQPVPD GCIGQIWISG AGVGNGYTGQ PGLTAERFRP NPFDYIPGSR 
MYCSGDLGRR NQNGAIEFLG RTDHQVKIRG FRVELTEIAQ RLQQHPAVRE ALVMRHEHPL
RGEYLVAYVV LIQPAQADVG ALRRYLAKHS PPYLVPAEIV LLEAFPLSPN GKIDRQALPH
PTANVHAGEA RQPAARSFDP LEFQIRTIWE ETLGIEPIGA QANFFELGGH SLLAIALMAR
IEERLGKRLP ITILFDAPTI EQMVGLLRQE GYAPRWSPMV LMQAGRKGFP ALFCVHALSG
SAFAYTVLPK HISPDQPFYG IESRGRDINQ APDRSIEAMA AYYIEHMRLI QPSGPYCIAG
WSLGGPVAFE MAQQLYRAGE TVALLAIVDT GAPLAGRLPI EPAAIDDLSL LVPRLRHFGI
NLDLDFIQAL PADQQLPYIM QQAVSGGFWS ANITLAEIKR KGELIRTNLA ARRNYSAQPY
PGTINLFRTK QHPGYSDDEV TLGWDQLALT GVKVWEIPGD HLSIFHSPYV EVLGHALWEC
LQGCTDQLDE LDYAHERYRA Q