Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0910 |
Symbol | |
ID | 5732811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1043386 |
End bp | 1046580 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278042 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001543686 |
Protein GI | 159897439 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCTGCG CGACGAATAA GTTTTATGCT AAACTTAGGG TTGTGCAATA TGCAATTGAT TACCCCCAAC TGAATGATGA TTTGATTCAA CGCCCACGCC TTCAAGCTCA ATTGGAGGCG TGTTGGTCAT GTCGCTTGAC GATGGTCGTG GCGGCGGCTG GCTATGGCAA AACCACTGCG CTAGCCCAGT GGATGCAGCA AACGCTGGGT GGTGATTGGC TGTGGTATGG CCTGCACAGC CCAACCGAGG CTGATCAGCT TCAGCGTTTG TTGAGCAGCC TAACCAAAGC CCGCTTCCCT CAAATCAATC CAATCAATAG CTTGGCCGAT TTGTTTGATT GCTGGGCCAC CTTGGCTCCG CGCCGAATTG CCTTGGTGTT TGATGATTGT CAACATTTGC AGGCTAGTGC TTGGCGTTTG TTGAATGAAT TGGCCCGTTT TGCGCCAACC AACCTGCATC TTATTTTGAG CAGCCGCCAA TTGCCACAGC TTGATTGGGC TGGCTTGGCG GCACGGGGGC AGTATCGCCA ATTAACTGCC GCTGAATTGC GATTTAGCCA AGCCGAATTA CAGGAGCTAT TGCCTACCCA AAGCCCGAGC CAACGCCAAG CAGCTTGGCA AGCAACCAAT GGCTGGCCTG CCGGTTTGCG CTTGTGGCGA GCCTTGCCCT CAGGCTATCA AGGCACGGTC AGCGATTTTC TTGCCCAAGA TGTGTTGGGG CAACTGCCTG AGGCGGTGCG GCGCACTGCC CAATTTGCCG CGCTCTTGCC CTTTTTTAAT CAAGCGGTGC TCACAGCGAT GGCCGCGCCT GATCCAGCTA GTTTGTGGCA TTACAATCTT TTTGTACTGC CTGATCACGC CGAATGGTGG CGCTTTGAGC CATTTTGGTT GGAAGTGCTG CGCCAACAAC CGTTGGCTGA AATCACCCAA TTGTTGGCCC AAGCGGCCAC ATGGTTTGAG CAGCAGGGGC ATCTGGCGGC AGCCTTGGAT GTTTGGTGTC GTTTGGGACA ATGGGCGCAG GTGGCCGAGC AGCTTAAGCA CCATGGTTTG CAGCTGTTGG CTCAACATCA GCCAGTCTTG CAATGGCTCC AACAACTGCC AGCAACCGAG CGCCAAACTG CCGAATTGCT GCATCTGGAT GGTCTGGCCT TGCGCGAACA CGATCCTGGG TTGGCCGCCC AAACCTTGGC GCAAGCGGCT CAACGCTATC GAGCTGAGCA ACGTTATGCC GAGGCTTTTC AGGTAGTTGG CGAACAATGT TTGATCTATT TTTGGCAGGG CGATGAGCAA GCCTTGATTG CGGTGGCGCG TGAAAGTTTT AGCCTCAAAA GCTTCGTTTG GTATCGCCAA CGCCGTGATT TGTTGAAATT TCCCTTGCTC TTGTTTCAAA TTAAGCGTGG CCGCTATATC AAGGCCTTGG CGACGGCTCA AAGTTTAGCC CAAAGCGAAT TGCCCTTCTT TTGGCGTTGG GTGGCGGCCT GCGTCGTTGG CGGTTTGTAT ACGATTTTGA CCTTGCCGAA TGAGGGCATT CAATGGCTCG AAGTTTGGTT GCAACACCCC CAAGTTCAAG CCGAACCAGC TATGCGCATG AGTTTGCTCG ATTTGTTGGC AACCTGCTTG ATGAGCCGTG CCGCGCCCGA CGATCGCATC GCTGCCCAAG ATTTAGCTGA TCAAGCCAGC CGTTTAAGCG AACGCTATGG CGTGCGTTTG ACTCGTTTGC AAACTCGCGG GACGAAACTC GGTTTGGCCT TGTTGGAAGC TGATCGGCCA ACGACTGAGC GTTTGATTCA GCAATTGCTC TTGCCCAGCG ATGAGCCATT GCCCTCGATT ATGCGCAATC GCTTGCTTGC ACTACGCGCC TTTGCTTGGG CTAGTTTGGG CGAAATTAAG CTGGCGCAGC ATGATGCAAC GCTCAGCGAA CGCGGCTTGA TTCGCGATGA TGCGCTGTAT GGAGCTGATC CACGCATGTG GTTGATGTTG GCGCAAGCCT GGTTTTGCTG TGGCGAATAT CAACGGGCCT TGGCGGCTTT GGAGCGTGCC AAACCGCTGA TCGAGGCCGC TCAATCGCCG ATTTTACGCT TGCGAGCAGG CTTGGTGCAG CTGGCTAGTC GTTGGCAACT TCAGCCTAGC CCGCAGTTAA TTGCTGAAGC GACCACGATT TGGCGCGATT ACTTATCCGA TGGCGACCGC CATATGAACG CAACGCCGCT GCATTTAACC ACGCTGTTGG TAGAATTGGG CTTGCGCACG GGAATCGCGC CACAGCGAAT CGCCCAATTT TTAGCTGAGC GTGATCGAGC GGCTTTAGAA GAGATTTGTT GGAAACTGTA TGCCGAGCAG CCGCAACAAC AAGCCGCGCT CTTGCAACTT TTGGGCTTAT ATGGCTCAGC AGCCAGCCTT GAGCGTTTGC AAGAAGTGAT TAAACAAGCC TCAACCCAGC AGCGCAAAAT TGCCCAACAC AGCTTAACTA GCATTCGCCA GCGCCCAGCC TATGCGCTGA AAATCCAGCT TTTTGGTAGT TTGCAACTTT GGCGTGGCAC GGAATTGGTC GATCCTAACG AGTGGTCACG CGAGAAGGCA CGCCAACTAT TGGCCTTATT GGTGCTGCAA CGCCCAAGAA TTATCAGCCG TGAAGCCTTG ATCGAGCACT TTTGGCCTGA TCTTACGCCG CAAGCCGCTG ATGGAGCCTT GCGCGTGACC TTGAATGCCC TGTTGCATGT GCTAGAGCCG GAACGCAGCG GCGGCGCTAA TTCGGCCTTT GTCTTGAGCG AAGCAGCGGG CTTGCGTTTA AATCCACAGG CACAGATCGA TACTGATTAT GCTGAATTTC AAACCTTGCT CCAAACTGCC GCCAAGCAAC GCCAGCAGGG AGCCATGCCC GCCGCTTTGC AAGCCTATCA AGCAGCCTTG GCCTTATATC AGGACGATTT ATTAAGCGAT ATTGCTTACG CCGAATGGGT GCTGGATTGG CGTGAGCAAG CTTTGAGCCA ATTTGTGGCT GCAACCAGCG ATTATTTGGA ATTATTGTTG GCTTATGGCC CGACTGCTGA GGCCATTCCC TATGCTGAGC GTTTGTTGAG CTATGACCCC TATCACGAAC CAACTTACTT GCGCCTGATC GACATCTACC ACATGTTGGG CAATGCTAGT GCTGCTGAGC GCATTCGAAA ACGCCTCGAA CGTATTTCCT TATAA
|
Protein sequence | MCCATNKFYA KLRVVQYAID YPQLNDDLIQ RPRLQAQLEA CWSCRLTMVV AAAGYGKTTA LAQWMQQTLG GDWLWYGLHS PTEADQLQRL LSSLTKARFP QINPINSLAD LFDCWATLAP RRIALVFDDC QHLQASAWRL LNELARFAPT NLHLILSSRQ LPQLDWAGLA ARGQYRQLTA AELRFSQAEL QELLPTQSPS QRQAAWQATN GWPAGLRLWR ALPSGYQGTV SDFLAQDVLG QLPEAVRRTA QFAALLPFFN QAVLTAMAAP DPASLWHYNL FVLPDHAEWW RFEPFWLEVL RQQPLAEITQ LLAQAATWFE QQGHLAAALD VWCRLGQWAQ VAEQLKHHGL QLLAQHQPVL QWLQQLPATE RQTAELLHLD GLALREHDPG LAAQTLAQAA QRYRAEQRYA EAFQVVGEQC LIYFWQGDEQ ALIAVARESF SLKSFVWYRQ RRDLLKFPLL LFQIKRGRYI KALATAQSLA QSELPFFWRW VAACVVGGLY TILTLPNEGI QWLEVWLQHP QVQAEPAMRM SLLDLLATCL MSRAAPDDRI AAQDLADQAS RLSERYGVRL TRLQTRGTKL GLALLEADRP TTERLIQQLL LPSDEPLPSI MRNRLLALRA FAWASLGEIK LAQHDATLSE RGLIRDDALY GADPRMWLML AQAWFCCGEY QRALAALERA KPLIEAAQSP ILRLRAGLVQ LASRWQLQPS PQLIAEATTI WRDYLSDGDR HMNATPLHLT TLLVELGLRT GIAPQRIAQF LAERDRAALE EICWKLYAEQ PQQQAALLQL LGLYGSAASL ERLQEVIKQA STQQRKIAQH SLTSIRQRPA YALKIQLFGS LQLWRGTELV DPNEWSREKA RQLLALLVLQ RPRIISREAL IEHFWPDLTP QAADGALRVT LNALLHVLEP ERSGGANSAF VLSEAAGLRL NPQAQIDTDY AEFQTLLQTA AKQRQQGAMP AALQAYQAAL ALYQDDLLSD IAYAEWVLDW REQALSQFVA ATSDYLELLL AYGPTAEAIP YAERLLSYDP YHEPTYLRLI DIYHMLGNAS AAERIRKRLE RISL
|
| |