Gene Haur_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4109 
Symbol 
ID5735970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5246700 
End bp5248133 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content55% 
IMG OID641281263 
ProductPucR family transcriptional regulator 
Protein accessionYP_001546869 
Protein GI159900622 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.222567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAACTC TATATGAAAT TTGGCGCTTG GCCTTACCAC CAACAACCAG CTTGCGAGCT 
GGCGAGGCCA ACACCCTTGC AGTGCGGGCA GTTGTGCTGG CACGTCCGAC CCAACCAGCG
CTGCCCGACC TTGCTGGCTC CGAAGTTGTG CTCGTTAGTA CAACCGTCTT GGATTCGTTG
CGGCTTTCGT TGGCTCGCTT GATTGAGCGC TTGAACGGGA CTTCGGTGCT GGCGGTTGGC
CTCACCGGAA TGGTTGATGA ACGAGCAGTA GCGGCGGCGG AAACCGCTAA TATTACCTTG
TTTGAATTGC CACATAACGC CGATTTGCGC ATGGTACAAC GTGAAAGCGA GCGCTTACTT
TCCGATCCCG AGGCACAATA TGAGCGTCGT GCGGCGCAAC TTTATAGTGC TCTAACCACG
AATGGCCTGA GTGAAGGCCG CACAACGCTT TTACGCATGC TTGAACTCTG GACTGGCCAT
AGTGTGGTTT TTCCGGCTGA TGCGGGGATG CCCACCACCG TACCAGTGCT GCTTGATGGC
CATCGCGTTG GTTTTTTGGG CAGTATTGGC AGCCATCCGT GGGATGCAGG GGCGCTCGAA
CAAGGCTCAG CCGCATTATC GTTGCTGCTC GATAAAGAAC GGGCAATCGA AGCCACCGAG
GATCGTTTGC GCGGCAGCGT GCTTGAATCG TTGCTGGCAG GGATTCCCTT GGATGTTCCT
GGGCAACGGC GGGCAGCGGA GCAAGGCATT TTGCTCGATT CAGCCTATGC CCTAGCTGCT
TTACGCCCGC AAGATTCCTT GCAGATCGAT CGGGTGATGG CGGCAGTGCG CCGAGCCTGC
GATCGCTTGC GCTATCCAGC GTTTATTGCT GATCACGATG GAATTATTGT GCTGGCCATG
CCGATCGATA GCCTTGATAA TCCTGAGCAG CGTTTGCGTG AAGTGCATAG TGCTTTGCAT
GAAGCCAGTT GGGTACTTGA TGGCGGCTTT GGCATTGCTT CGGAAAACGG TGCATGGTCG
GGGGCTTGGG CCGAGGCAAT TGGTGCATTA CGGTTGGGCC GCGAATTACT AGGGGCAGGC
GTGTTGGCTG GCGGAGCCGA ATTAGGCGTT TATCGGCTAC TACTGAGTGT GGCAGACTCA
GCTCGTGCTA GAATGTTTTA TGATCGGACG ATTGGCCCAT TAGCTGCCCA CGATGCCAAA
CAAGATGGCG ACCTGTTGTA CACCCTACAA ATGTTCTTTG CCTATCTTGG CAACCATAGT
CAGGCCGCAG CAGCGCTGCA TATTCACCGT AATACCCTCC TCTATCGGCT CGGTCGAATC
GAAAATATTA CATCGCATCA TCTCGACCGT GCGCCTGATC GGTTGGCATT ACAGTTGGGT
TTAGCCCTTC ATCGCATCTA TCAGAGCCAA AAGCCTGATC TAAAAAAGGC GTAG
 
Protein sequence
MATLYEIWRL ALPPTTSLRA GEANTLAVRA VVLARPTQPA LPDLAGSEVV LVSTTVLDSL 
RLSLARLIER LNGTSVLAVG LTGMVDERAV AAAETANITL FELPHNADLR MVQRESERLL
SDPEAQYERR AAQLYSALTT NGLSEGRTTL LRMLELWTGH SVVFPADAGM PTTVPVLLDG
HRVGFLGSIG SHPWDAGALE QGSAALSLLL DKERAIEATE DRLRGSVLES LLAGIPLDVP
GQRRAAEQGI LLDSAYALAA LRPQDSLQID RVMAAVRRAC DRLRYPAFIA DHDGIIVLAM
PIDSLDNPEQ RLREVHSALH EASWVLDGGF GIASENGAWS GAWAEAIGAL RLGRELLGAG
VLAGGAELGV YRLLLSVADS ARARMFYDRT IGPLAAHDAK QDGDLLYTLQ MFFAYLGNHS
QAAAALHIHR NTLLYRLGRI ENITSHHLDR APDRLALQLG LALHRIYQSQ KPDLKKA