Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1995 |
Symbol | |
ID | 5733884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2461972 |
End bp | 2465031 |
Gene Length | 3060 bp |
Protein Length | 1019 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279139 |
Product | ATP-dependent transcription regulator LuxR |
Protein accession | YP_001544766 |
Protein GI | 159898519 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.686742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTTCC GTGCGATTAG CCATCGAACC ACACCCATTT GTGATACTGA ATGGCTTGAA TATACGATCA ATGGTCGAAT CCAACGGGTA GCGGTTGAAT CAAGGGCGTG GTACGAATGG CTGCATGCTC CGGCGCACAC AGCATTCGCT TTTATGTGCA GTAGCGGAAC CTTTACCGCC CGCCGTGAAG CGCGTCGTGC CCGCGCATAT TGGTATGCCT ATCGTAAAAA GGCCGGAAAA ATAGCTAAGG TGTATCTCGG ATCAGCCGAG CAACTCACGC TCGAACGGCT GATCCGAGCG GCTGCAACGT TGGCCAAGCA GCCAAGCGTA CCCGTACCAA CTCCGTTGCT TCACACCAAG TTGAGCATTC CAGCAGTGCG ATCAAGGTTC GTTTCACGCC GATCGATCTT CACAGTTTTC AACCAAATGC TGCCGTTAAC ACTGGTAAGT GCCTCGGCTG GCTTTGGCAA AACGACGCTG ATCGCTGAAT GGGCGCGAAC ACAGCCGTAT CCGTTGGCCT GGCTTACGCT TGATTCAAAT GATAACGAGC CTAGTCGTTT CTGGGGCTAT AGCCTGACAG CACTGCATAT GGTTGCGCCA CTGCTCACAA CCGAGGCTTT GGCACTGCTC CATGCGCCAC AAGCAATTGA TCTATCATTT GTGCTAACGA ATCTGATTAA TAACCTAATG CTATTGAATC AGCCTCTTAC CCTCGTGCTC GATGACTATC ACACAATCAA CAATCAAGCG ATTCACAATC AGCTGACATG GCTGCTCGAT CATGCGCCGC AGGCGTTTCG GCTTGTGCTG ATCAGCCGTA CCGAGCCACC AATGCCACTC GCTCGTTGGC AGGCAGCAGG TCGATTGAAT ACGATTTTGG TAGATGAGTT ACGGTTCAAC AATTCCGATA TTCATAGGTT TTTCCACACC ACCATGCAGC TCGATCTGGC AGCTGATGTG CTCGCAAGCC TAGCGGCACG TACCGAGGGC TGGATCGCAG GGCTACAACT AGCGGCGCTC TCACTTCAAG GTCAGCCCGA GTTAACCATG AGTGAAGGAC TAGATTCCAG CGTTAGCAAT CAACGGGCGC TATTTGATTA TTTTTCGCAC GAGGTGCTCC AACAGCAACC GAGCGCAATC CAGCAATTTT TGCTGCAAAC GGCGATCCTT GATCAACTAT GCGAGCCACT CTGTGCCGCT GTGACCGATC ACGGTGCAAC AGCGGGAATG CTCGATTACC TTGAACGCTC GCATTTATTT GTGGTAGCAC TTGATCGCAA GCACCACTGG TATCGCTACC ATCAGCTTTT TCGCGAAAGC TTGCTGCACC ATGCCAAGCA GCAATGGGGT GCAGCAGGAA TCGCACAACT TCACAAGCGG GCTAGTTGTT GGTTTGAACA AGCGGGCTAT CCAGCAGAGG CGATCAACCA TGCGTTGGCC GCCGCCGATT TTGAACGGGC TGGGCGATTG ATTGCCAAAA TCGGTTTTCG CATGTTGTGG CGCGGTGAAC ATACAATCTT GAAGGGATGG TTACATGCGT TACCCGCCAC GATCATTGAG CATAATGCCT ATCTCTGTCT TTGGTCGGCA TGGCTTCTCG TCGAGCAAAA CCAGCTAGAA GCAAGTGGCT ATTATCTGAG CCTGATTGAC GAATTGCTCA GCCACACCAT GAGCGACGAA GCTGCCGAAA CCAGGGCGAT AGATGGTCAC CGCAAGGCAC TTCAGGCTAG TATCGCCCGT CGGCGGGGCG ATATGCCAAC CACGTTGGCA CTAACCCACC AAGCCTTAAA CGCGCTGCCA CGAGATAGTG CCTTGCTGCG CAGCATGATT ACGCGCAATC TCTGTGCTGG CTACATTATC AGTGGCGATA CGGTAGCAGC AGAGGCAGCA CTACACCAAG CGTTATGTGA ACAGGAATTA CTTGAGGCCT CCATCGCATC AGATCATCAT CACCCCAGCG AACGCCAGCA TGCTCATACA ATTCGGTTGC TCTTGTGGAT TGAGACAACT TCACTGCGTT GGCTCCAAGG CCAATTTCAT GCTGCTGCCG ATTTGTATCG CCAAACCTTG CATCTAGCCC GTGAACAGCA CCAACATGCC GTAAGCGCCA TCGCCTGCGT AAATTTAGGT CAGATTTTGC GGCAATGGAA CAACCTAGCC GAAGCCCGCG AGTATCTGCA ACAAGGTATT GGCTATAGCC TGAACGTTGG TGCGGATGTA ACCCGGCGCA ATGGGTTAAT TGAACTTGCT CGCATCCAAC AAGCCCATGG TGAACCAGCG CAAGCACTGG CAACGATGGC CCAAGCGGTT GCGCTTGCCC AGACCCTCCC TTCACCTCGT GGCTTGCTCT GGGCTACAAC CTGGCAAGCA CGGCTCCAAC TAGCACAGGG CGATCTAGCG GCAGCAACCC GCTGGGCGCA GGAATACCAA CGGCTAGCAA ATCCATTTCC GCAGTTTAAT ATATATGATG CCGAAGATTT GACGCTGGCG CGTATCCTGA TCGCCCAAGG TCAGCATCAG CAAGCCAGCG CCCTGCTTGA GCAACTGCTC CCTGCATATC AAGCCGCAGG ACGGCTTCCT AGTGTGATCG AAGTTTATCT GCTGCAAGCA CTCAATCTTG CCGCACAGCA GGATTGGTCA GTTGCGGGCA GGGTGCTCAT TCAAGCCCTA CGCTTGGCAG AACCAGAGAA TTATCTACGC CTATTTGTTG ATGAAGGCCC AGCACTTTCC AACCTGTTGG TGCAGATCGA ACCACAGGTG CAGGCAACGT TGCGCCAGTA TGTACAACGT TTATTGGTTG TTTGTGAACT GCCTAGAAGC ACCACGCCAG AGCAGCTTAG CCCAATCTAT CGCTTAATCG AGCCGCTCAG CGAACGCGAA CTTACGGTTA TACGATTGCT AGCGGCGGGT TTCTCGAATC AAGAGATTGC CCAGCAGCTC GTTGTAACGC TCAACACGAT CAAAACCCAT CTAAAGAATA TTTATAGCAA ATTGGCGGTC ACTAGCCGTA CCCAAGCAAT TGCTCGTGCC CGTAGGCTCA ACCTGATCGC CAATCCTTAA
|
Protein sequence | MVFRAISHRT TPICDTEWLE YTINGRIQRV AVESRAWYEW LHAPAHTAFA FMCSSGTFTA RREARRARAY WYAYRKKAGK IAKVYLGSAE QLTLERLIRA AATLAKQPSV PVPTPLLHTK LSIPAVRSRF VSRRSIFTVF NQMLPLTLVS ASAGFGKTTL IAEWARTQPY PLAWLTLDSN DNEPSRFWGY SLTALHMVAP LLTTEALALL HAPQAIDLSF VLTNLINNLM LLNQPLTLVL DDYHTINNQA IHNQLTWLLD HAPQAFRLVL ISRTEPPMPL ARWQAAGRLN TILVDELRFN NSDIHRFFHT TMQLDLAADV LASLAARTEG WIAGLQLAAL SLQGQPELTM SEGLDSSVSN QRALFDYFSH EVLQQQPSAI QQFLLQTAIL DQLCEPLCAA VTDHGATAGM LDYLERSHLF VVALDRKHHW YRYHQLFRES LLHHAKQQWG AAGIAQLHKR ASCWFEQAGY PAEAINHALA AADFERAGRL IAKIGFRMLW RGEHTILKGW LHALPATIIE HNAYLCLWSA WLLVEQNQLE ASGYYLSLID ELLSHTMSDE AAETRAIDGH RKALQASIAR RRGDMPTTLA LTHQALNALP RDSALLRSMI TRNLCAGYII SGDTVAAEAA LHQALCEQEL LEASIASDHH HPSERQHAHT IRLLLWIETT SLRWLQGQFH AAADLYRQTL HLAREQHQHA VSAIACVNLG QILRQWNNLA EAREYLQQGI GYSLNVGADV TRRNGLIELA RIQQAHGEPA QALATMAQAV ALAQTLPSPR GLLWATTWQA RLQLAQGDLA AATRWAQEYQ RLANPFPQFN IYDAEDLTLA RILIAQGQHQ QASALLEQLL PAYQAAGRLP SVIEVYLLQA LNLAAQQDWS VAGRVLIQAL RLAEPENYLR LFVDEGPALS NLLVQIEPQV QATLRQYVQR LLVVCELPRS TTPEQLSPIY RLIEPLSERE LTVIRLLAAG FSNQEIAQQL VVTLNTIKTH LKNIYSKLAV TSRTQAIARA RRLNLIANP
|
| |