Gene Haur_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1995 
Symbol 
ID5733884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2461972 
End bp2465031 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content53% 
IMG OID641279139 
ProductATP-dependent transcription regulator LuxR 
Protein accessionYP_001544766 
Protein GI159898519 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.686742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGTTCC GTGCGATTAG CCATCGAACC ACACCCATTT GTGATACTGA ATGGCTTGAA 
TATACGATCA ATGGTCGAAT CCAACGGGTA GCGGTTGAAT CAAGGGCGTG GTACGAATGG
CTGCATGCTC CGGCGCACAC AGCATTCGCT TTTATGTGCA GTAGCGGAAC CTTTACCGCC
CGCCGTGAAG CGCGTCGTGC CCGCGCATAT TGGTATGCCT ATCGTAAAAA GGCCGGAAAA
ATAGCTAAGG TGTATCTCGG ATCAGCCGAG CAACTCACGC TCGAACGGCT GATCCGAGCG
GCTGCAACGT TGGCCAAGCA GCCAAGCGTA CCCGTACCAA CTCCGTTGCT TCACACCAAG
TTGAGCATTC CAGCAGTGCG ATCAAGGTTC GTTTCACGCC GATCGATCTT CACAGTTTTC
AACCAAATGC TGCCGTTAAC ACTGGTAAGT GCCTCGGCTG GCTTTGGCAA AACGACGCTG
ATCGCTGAAT GGGCGCGAAC ACAGCCGTAT CCGTTGGCCT GGCTTACGCT TGATTCAAAT
GATAACGAGC CTAGTCGTTT CTGGGGCTAT AGCCTGACAG CACTGCATAT GGTTGCGCCA
CTGCTCACAA CCGAGGCTTT GGCACTGCTC CATGCGCCAC AAGCAATTGA TCTATCATTT
GTGCTAACGA ATCTGATTAA TAACCTAATG CTATTGAATC AGCCTCTTAC CCTCGTGCTC
GATGACTATC ACACAATCAA CAATCAAGCG ATTCACAATC AGCTGACATG GCTGCTCGAT
CATGCGCCGC AGGCGTTTCG GCTTGTGCTG ATCAGCCGTA CCGAGCCACC AATGCCACTC
GCTCGTTGGC AGGCAGCAGG TCGATTGAAT ACGATTTTGG TAGATGAGTT ACGGTTCAAC
AATTCCGATA TTCATAGGTT TTTCCACACC ACCATGCAGC TCGATCTGGC AGCTGATGTG
CTCGCAAGCC TAGCGGCACG TACCGAGGGC TGGATCGCAG GGCTACAACT AGCGGCGCTC
TCACTTCAAG GTCAGCCCGA GTTAACCATG AGTGAAGGAC TAGATTCCAG CGTTAGCAAT
CAACGGGCGC TATTTGATTA TTTTTCGCAC GAGGTGCTCC AACAGCAACC GAGCGCAATC
CAGCAATTTT TGCTGCAAAC GGCGATCCTT GATCAACTAT GCGAGCCACT CTGTGCCGCT
GTGACCGATC ACGGTGCAAC AGCGGGAATG CTCGATTACC TTGAACGCTC GCATTTATTT
GTGGTAGCAC TTGATCGCAA GCACCACTGG TATCGCTACC ATCAGCTTTT TCGCGAAAGC
TTGCTGCACC ATGCCAAGCA GCAATGGGGT GCAGCAGGAA TCGCACAACT TCACAAGCGG
GCTAGTTGTT GGTTTGAACA AGCGGGCTAT CCAGCAGAGG CGATCAACCA TGCGTTGGCC
GCCGCCGATT TTGAACGGGC TGGGCGATTG ATTGCCAAAA TCGGTTTTCG CATGTTGTGG
CGCGGTGAAC ATACAATCTT GAAGGGATGG TTACATGCGT TACCCGCCAC GATCATTGAG
CATAATGCCT ATCTCTGTCT TTGGTCGGCA TGGCTTCTCG TCGAGCAAAA CCAGCTAGAA
GCAAGTGGCT ATTATCTGAG CCTGATTGAC GAATTGCTCA GCCACACCAT GAGCGACGAA
GCTGCCGAAA CCAGGGCGAT AGATGGTCAC CGCAAGGCAC TTCAGGCTAG TATCGCCCGT
CGGCGGGGCG ATATGCCAAC CACGTTGGCA CTAACCCACC AAGCCTTAAA CGCGCTGCCA
CGAGATAGTG CCTTGCTGCG CAGCATGATT ACGCGCAATC TCTGTGCTGG CTACATTATC
AGTGGCGATA CGGTAGCAGC AGAGGCAGCA CTACACCAAG CGTTATGTGA ACAGGAATTA
CTTGAGGCCT CCATCGCATC AGATCATCAT CACCCCAGCG AACGCCAGCA TGCTCATACA
ATTCGGTTGC TCTTGTGGAT TGAGACAACT TCACTGCGTT GGCTCCAAGG CCAATTTCAT
GCTGCTGCCG ATTTGTATCG CCAAACCTTG CATCTAGCCC GTGAACAGCA CCAACATGCC
GTAAGCGCCA TCGCCTGCGT AAATTTAGGT CAGATTTTGC GGCAATGGAA CAACCTAGCC
GAAGCCCGCG AGTATCTGCA ACAAGGTATT GGCTATAGCC TGAACGTTGG TGCGGATGTA
ACCCGGCGCA ATGGGTTAAT TGAACTTGCT CGCATCCAAC AAGCCCATGG TGAACCAGCG
CAAGCACTGG CAACGATGGC CCAAGCGGTT GCGCTTGCCC AGACCCTCCC TTCACCTCGT
GGCTTGCTCT GGGCTACAAC CTGGCAAGCA CGGCTCCAAC TAGCACAGGG CGATCTAGCG
GCAGCAACCC GCTGGGCGCA GGAATACCAA CGGCTAGCAA ATCCATTTCC GCAGTTTAAT
ATATATGATG CCGAAGATTT GACGCTGGCG CGTATCCTGA TCGCCCAAGG TCAGCATCAG
CAAGCCAGCG CCCTGCTTGA GCAACTGCTC CCTGCATATC AAGCCGCAGG ACGGCTTCCT
AGTGTGATCG AAGTTTATCT GCTGCAAGCA CTCAATCTTG CCGCACAGCA GGATTGGTCA
GTTGCGGGCA GGGTGCTCAT TCAAGCCCTA CGCTTGGCAG AACCAGAGAA TTATCTACGC
CTATTTGTTG ATGAAGGCCC AGCACTTTCC AACCTGTTGG TGCAGATCGA ACCACAGGTG
CAGGCAACGT TGCGCCAGTA TGTACAACGT TTATTGGTTG TTTGTGAACT GCCTAGAAGC
ACCACGCCAG AGCAGCTTAG CCCAATCTAT CGCTTAATCG AGCCGCTCAG CGAACGCGAA
CTTACGGTTA TACGATTGCT AGCGGCGGGT TTCTCGAATC AAGAGATTGC CCAGCAGCTC
GTTGTAACGC TCAACACGAT CAAAACCCAT CTAAAGAATA TTTATAGCAA ATTGGCGGTC
ACTAGCCGTA CCCAAGCAAT TGCTCGTGCC CGTAGGCTCA ACCTGATCGC CAATCCTTAA
 
Protein sequence
MVFRAISHRT TPICDTEWLE YTINGRIQRV AVESRAWYEW LHAPAHTAFA FMCSSGTFTA 
RREARRARAY WYAYRKKAGK IAKVYLGSAE QLTLERLIRA AATLAKQPSV PVPTPLLHTK
LSIPAVRSRF VSRRSIFTVF NQMLPLTLVS ASAGFGKTTL IAEWARTQPY PLAWLTLDSN
DNEPSRFWGY SLTALHMVAP LLTTEALALL HAPQAIDLSF VLTNLINNLM LLNQPLTLVL
DDYHTINNQA IHNQLTWLLD HAPQAFRLVL ISRTEPPMPL ARWQAAGRLN TILVDELRFN
NSDIHRFFHT TMQLDLAADV LASLAARTEG WIAGLQLAAL SLQGQPELTM SEGLDSSVSN
QRALFDYFSH EVLQQQPSAI QQFLLQTAIL DQLCEPLCAA VTDHGATAGM LDYLERSHLF
VVALDRKHHW YRYHQLFRES LLHHAKQQWG AAGIAQLHKR ASCWFEQAGY PAEAINHALA
AADFERAGRL IAKIGFRMLW RGEHTILKGW LHALPATIIE HNAYLCLWSA WLLVEQNQLE
ASGYYLSLID ELLSHTMSDE AAETRAIDGH RKALQASIAR RRGDMPTTLA LTHQALNALP
RDSALLRSMI TRNLCAGYII SGDTVAAEAA LHQALCEQEL LEASIASDHH HPSERQHAHT
IRLLLWIETT SLRWLQGQFH AAADLYRQTL HLAREQHQHA VSAIACVNLG QILRQWNNLA
EAREYLQQGI GYSLNVGADV TRRNGLIELA RIQQAHGEPA QALATMAQAV ALAQTLPSPR
GLLWATTWQA RLQLAQGDLA AATRWAQEYQ RLANPFPQFN IYDAEDLTLA RILIAQGQHQ
QASALLEQLL PAYQAAGRLP SVIEVYLLQA LNLAAQQDWS VAGRVLIQAL RLAEPENYLR
LFVDEGPALS NLLVQIEPQV QATLRQYVQR LLVVCELPRS TTPEQLSPIY RLIEPLSERE
LTVIRLLAAG FSNQEIAQQL VVTLNTIKTH LKNIYSKLAV TSRTQAIARA RRLNLIANP