Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4109 |
Symbol | |
ID | 5735970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5246700 |
End bp | 5248133 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281263 |
Product | PucR family transcriptional regulator |
Protein accession | YP_001546869 |
Protein GI | 159900622 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism [T] Signal transduction mechanisms |
COG ID | [COG2508] Regulator of polyketide synthase expression |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.222567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAACTC TATATGAAAT TTGGCGCTTG GCCTTACCAC CAACAACCAG CTTGCGAGCT GGCGAGGCCA ACACCCTTGC AGTGCGGGCA GTTGTGCTGG CACGTCCGAC CCAACCAGCG CTGCCCGACC TTGCTGGCTC CGAAGTTGTG CTCGTTAGTA CAACCGTCTT GGATTCGTTG CGGCTTTCGT TGGCTCGCTT GATTGAGCGC TTGAACGGGA CTTCGGTGCT GGCGGTTGGC CTCACCGGAA TGGTTGATGA ACGAGCAGTA GCGGCGGCGG AAACCGCTAA TATTACCTTG TTTGAATTGC CACATAACGC CGATTTGCGC ATGGTACAAC GTGAAAGCGA GCGCTTACTT TCCGATCCCG AGGCACAATA TGAGCGTCGT GCGGCGCAAC TTTATAGTGC TCTAACCACG AATGGCCTGA GTGAAGGCCG CACAACGCTT TTACGCATGC TTGAACTCTG GACTGGCCAT AGTGTGGTTT TTCCGGCTGA TGCGGGGATG CCCACCACCG TACCAGTGCT GCTTGATGGC CATCGCGTTG GTTTTTTGGG CAGTATTGGC AGCCATCCGT GGGATGCAGG GGCGCTCGAA CAAGGCTCAG CCGCATTATC GTTGCTGCTC GATAAAGAAC GGGCAATCGA AGCCACCGAG GATCGTTTGC GCGGCAGCGT GCTTGAATCG TTGCTGGCAG GGATTCCCTT GGATGTTCCT GGGCAACGGC GGGCAGCGGA GCAAGGCATT TTGCTCGATT CAGCCTATGC CCTAGCTGCT TTACGCCCGC AAGATTCCTT GCAGATCGAT CGGGTGATGG CGGCAGTGCG CCGAGCCTGC GATCGCTTGC GCTATCCAGC GTTTATTGCT GATCACGATG GAATTATTGT GCTGGCCATG CCGATCGATA GCCTTGATAA TCCTGAGCAG CGTTTGCGTG AAGTGCATAG TGCTTTGCAT GAAGCCAGTT GGGTACTTGA TGGCGGCTTT GGCATTGCTT CGGAAAACGG TGCATGGTCG GGGGCTTGGG CCGAGGCAAT TGGTGCATTA CGGTTGGGCC GCGAATTACT AGGGGCAGGC GTGTTGGCTG GCGGAGCCGA ATTAGGCGTT TATCGGCTAC TACTGAGTGT GGCAGACTCA GCTCGTGCTA GAATGTTTTA TGATCGGACG ATTGGCCCAT TAGCTGCCCA CGATGCCAAA CAAGATGGCG ACCTGTTGTA CACCCTACAA ATGTTCTTTG CCTATCTTGG CAACCATAGT CAGGCCGCAG CAGCGCTGCA TATTCACCGT AATACCCTCC TCTATCGGCT CGGTCGAATC GAAAATATTA CATCGCATCA TCTCGACCGT GCGCCTGATC GGTTGGCATT ACAGTTGGGT TTAGCCCTTC ATCGCATCTA TCAGAGCCAA AAGCCTGATC TAAAAAAGGC GTAG
|
Protein sequence | MATLYEIWRL ALPPTTSLRA GEANTLAVRA VVLARPTQPA LPDLAGSEVV LVSTTVLDSL RLSLARLIER LNGTSVLAVG LTGMVDERAV AAAETANITL FELPHNADLR MVQRESERLL SDPEAQYERR AAQLYSALTT NGLSEGRTTL LRMLELWTGH SVVFPADAGM PTTVPVLLDG HRVGFLGSIG SHPWDAGALE QGSAALSLLL DKERAIEATE DRLRGSVLES LLAGIPLDVP GQRRAAEQGI LLDSAYALAA LRPQDSLQID RVMAAVRRAC DRLRYPAFIA DHDGIIVLAM PIDSLDNPEQ RLREVHSALH EASWVLDGGF GIASENGAWS GAWAEAIGAL RLGRELLGAG VLAGGAELGV YRLLLSVADS ARARMFYDRT IGPLAAHDAK QDGDLLYTLQ MFFAYLGNHS QAAAALHIHR NTLLYRLGRI ENITSHHLDR APDRLALQLG LALHRIYQSQ KPDLKKA
|
| |