Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2768 |
Symbol | |
ID | 5734649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3522599 |
End bp | 3525751 |
Gene Length | 3153 bp |
Protein Length | 1050 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279911 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001545534 |
Protein GI | 159899287 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATGTCA CGCATCAGCC TGATGATTCG GCTCAATCGG TGCCAAGTTT ACGGATTCAC GTTTTTGGCG GCTTGCGGAT CTTTATCGAC AATCAACGCA TTAGTGATCT AGCAACTCGC AAAGTGGAAG CACTGCTGAT CTACCTTGTC GTCAATCCTT ACCCCCACGA ACGTGAGGTT TTGGCACAAC TGTTATGGAA TGATTTATCG GCAGAACGCA CGGCGGGCAA CTTACGCCTC ACACTCAATC AATTACGCAA AGTGATGGAC CCGTTTATTG AGGTCACGCG CCATACGATC GGGTTGCATC CCCAAGCCCG TTATTGGCTC GACCTTCAGC ATTGCACCCA AATTTTTGAG AACCCAGCTA GCCCAACCAA CGAATTAGCC AGCGCGATCG AACATTATCA AGGCGATTTT CTCAATGGGT TTTATCTCCG CGATGCCGAT GGCTTCAGCA GCTGGCAACT CCAACAGGTA GAATATTGGC GACAACAAGC GCTGGTCGCG ATTCGCCGCC TGATCGAACG CTATACCGTG GTAAGCAACT ACAACGAGGC AATTCGTTGG CTTCAGCGCT TGGTGACGCT TGATGAATGT GATGAGGTCG CCCATCGCCA GTTGATGTTG CTGTTGATGC GCACCGGCCA ACGCCACACT GCCCTGCGCC AATATCGGCT GCTCGAACAA GTTTTAGAGC GTGAGTATGG ATTAAAACCC GAGCCAGCGA GTGAGGCCTT GCAGCGCCAA ATTATGGCAA CACCCGCCCA ACGTCCGCAT CGGTTGATTC GGCCAACTCC AATGCTCTAT GGTCGCCAAA CTGAGCTAGA ACGCATGACC CATTGGCTGG CGGCTGAACG TAACCAACTG TTGACGATTA TGGGCGCTGG CGGGAGCGGC AAAACCCAAT TAGCCCTGAC CTTTGGCTGG AAGGTGGTCA ACGAATATTT AGGGGCTTCG AGCAACGGAG TTTTTTATAT TTCCTTGGTC AGTGCCGATC AACAGCCGCG CCTGTTGGAT GCAGAACCTG TGTTGTTGGC AATTGTGCAG ACCTTAAACC TACCACCACC ACGCACCAAC GATCTGGTTG AACACCTGAT TCTCCAGTTG CAACAGCATG AATTGATCAT CATCATTGAT AATGGCGAAT TGTTGGCGAC TAGCGCTCGC TTAGCCCTCA GCAGTCTGAT TCAACATATT CCGCAATTGC GCTTAATTAT TGGCTCGCGC GAGCGCATGC GCCTGCAAAA CGAATATGTA TTGGAATTAG CAGGCTTGGC CTATCCCCAG ATTAATGATG ATTCAGGCTA TTCGCCGCTG TTGGCCGAGC AACTTCAGCA TTTTGCAGCG GTCGATTTAT TTGTGCATTG TTTGCAGCGG CAAGGCAAAT CGGGCGATTT GGCCGATTAT AGCCACGCTG ATCGTCAAGC GATTGGCCAA ATTTGCCAGA TGGTCCATGG CCTGCCGTTG GCAATTGAAT TGATCGCGCC ATGGATGACC ATTCGTAGTG GCCATGAAAT TGTCGAGGCA CTCAGCAATG ACATGCAGCT TTTTCATAGC GATGTGGTCG ATATTCCAAC CCGCCATCGC AGCATCCAGG CCGCCTTTGA TTATTCGTGG CAATTGCTTG ATCCGCATGA ACAGGCTTGT CTAGCACGCT TGGCGGTATT TCCTAGTAGC TTTAATGCTG AAACCGCCAC AATCATTGCC GAGGTCGATT TGAACGTGCT GGCCAATTTA CGGGCCAAAT CGTTGCTCAC GCTTGAAATT TTGCAACAAC AAACCCGCTA TGCCTTACAC CCATTGCTGC ATCAATTTGC CCAAACCAAA CTCCAAGGCT TGGCCGCCGA TCAGCAAACG CTGTATCAAC GCCATGCCCG CTACTTTGGC CAATTTAGCC GTCAGCAAGA GCAATTAATT CATGGCAACG CCAGCCAACA AGGGTTAATG CTCTTAGAAC AAGAGCTTGA TAATATTCGG GGTGGTTGGC TCTGGGCGGT TCAAGCCCAA CAAATTCATA TTTTAGGCGA TTATTGTATT GCTTTACACG ATTTCTTTGC GATTCGCAAC CGCGAGATCG AAGGCCAACA GCTTTTTGCT CCAGCCGCAG CGTTATTAAG CACGCTTGAC CATCAGGATG TTGAGGCTGA GATCGTTTTA ATTGTAGTGC GAATTGTTTC GTGTTACGCT GAGTTTCATT ATATTTTGGG CGAATTAGCA CGTTCTGAGG CATTATTGCA ACAGTGTATC GATGTCTTGT ATCGAAGGCA ATTGCAGAAT GTGGCTGAAT TGTTGTTTAT TTATAAGCAA CTTGGGGTGA TTACCCAACG CCGAGGCGAA TATACCCGCG CCCTTGATCT ATTACAACGC TGTTTGGCTC AGGCTGAAAG CGTCAACGAT CCAATTAAAG TCAGTGATAC ATGGTTATCG ATCGGCGCGG TGTTGTTGGC ACAAGGCAAT TGGCAAGCCG CCGAACAAGC CTTCCAGACC TGTAGCGATC ACTATCAAGC CCAAAAACAT CTGTGGGGGT TATGCCATAG TCAACGCTTT TTGGGGGTGG TGGCGCTGGC ACAAGCCAAT TATAATCTGG CTCAGCATCA TTTTGAGGCC AGCCTTGATT TGGCCTATCA GCTCAAAACC CCGCTTGGCG AGGCATTAAT TCGCGATCAA ATGGGCATGT TGGCGCTGCG CCGCGAAGAT TTTGCCCAAA GTGCGGGCTA TTTGCACAAA GCCTTTGCCA TTTTTCAAGA ACTAGGGGTT GAGGCCATGA TTGGACGGGC GGCGATTCAT CTAGCCCAAC TCGAATTGGC GCAACGTCAT TTTCAATATG TGCAGCCATG GTTGGCGCGG GCGATTATGG TTGCTCAGCA ACGCCAAGAA ATCCCGCTAT TGCTCGAATG TTATGCGACA GTTTTGCAAT TTTGGGCGGC GGTTTGCGAT GGCGAGCCAA GCCATTGGTT CAAACTATGG CATAGCTTGA GCCAACATCC AGCCTGTAGT GCCGAAACCA AAAGTTTGCT CAATCGTTTA CAGTTGCACC AGCATTCCGC AGGCCAAAGC AGGCCGACGA TTGAGGTTGA ACTGTCAGGA ATTGAACGCT ATGTAGCCCT TTGCTTAAGC CAAATTGAGC AAAGCCAGAG TATTGCGGTT TAG
|
Protein sequence | MNVTHQPDDS AQSVPSLRIH VFGGLRIFID NQRISDLATR KVEALLIYLV VNPYPHEREV LAQLLWNDLS AERTAGNLRL TLNQLRKVMD PFIEVTRHTI GLHPQARYWL DLQHCTQIFE NPASPTNELA SAIEHYQGDF LNGFYLRDAD GFSSWQLQQV EYWRQQALVA IRRLIERYTV VSNYNEAIRW LQRLVTLDEC DEVAHRQLML LLMRTGQRHT ALRQYRLLEQ VLEREYGLKP EPASEALQRQ IMATPAQRPH RLIRPTPMLY GRQTELERMT HWLAAERNQL LTIMGAGGSG KTQLALTFGW KVVNEYLGAS SNGVFYISLV SADQQPRLLD AEPVLLAIVQ TLNLPPPRTN DLVEHLILQL QQHELIIIID NGELLATSAR LALSSLIQHI PQLRLIIGSR ERMRLQNEYV LELAGLAYPQ INDDSGYSPL LAEQLQHFAA VDLFVHCLQR QGKSGDLADY SHADRQAIGQ ICQMVHGLPL AIELIAPWMT IRSGHEIVEA LSNDMQLFHS DVVDIPTRHR SIQAAFDYSW QLLDPHEQAC LARLAVFPSS FNAETATIIA EVDLNVLANL RAKSLLTLEI LQQQTRYALH PLLHQFAQTK LQGLAADQQT LYQRHARYFG QFSRQQEQLI HGNASQQGLM LLEQELDNIR GGWLWAVQAQ QIHILGDYCI ALHDFFAIRN REIEGQQLFA PAAALLSTLD HQDVEAEIVL IVVRIVSCYA EFHYILGELA RSEALLQQCI DVLYRRQLQN VAELLFIYKQ LGVITQRRGE YTRALDLLQR CLAQAESVND PIKVSDTWLS IGAVLLAQGN WQAAEQAFQT CSDHYQAQKH LWGLCHSQRF LGVVALAQAN YNLAQHHFEA SLDLAYQLKT PLGEALIRDQ MGMLALRRED FAQSAGYLHK AFAIFQELGV EAMIGRAAIH LAQLELAQRH FQYVQPWLAR AIMVAQQRQE IPLLLECYAT VLQFWAAVCD GEPSHWFKLW HSLSQHPACS AETKSLLNRL QLHQHSAGQS RPTIEVELSG IERYVALCLS QIEQSQSIAV
|
| |