Gene Haur_0910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0910 
Symbol 
ID5732811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1043386 
End bp1046580 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content54% 
IMG OID641278042 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001543686 
Protein GI159897439 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCTGCG CGACGAATAA GTTTTATGCT AAACTTAGGG TTGTGCAATA TGCAATTGAT 
TACCCCCAAC TGAATGATGA TTTGATTCAA CGCCCACGCC TTCAAGCTCA ATTGGAGGCG
TGTTGGTCAT GTCGCTTGAC GATGGTCGTG GCGGCGGCTG GCTATGGCAA AACCACTGCG
CTAGCCCAGT GGATGCAGCA AACGCTGGGT GGTGATTGGC TGTGGTATGG CCTGCACAGC
CCAACCGAGG CTGATCAGCT TCAGCGTTTG TTGAGCAGCC TAACCAAAGC CCGCTTCCCT
CAAATCAATC CAATCAATAG CTTGGCCGAT TTGTTTGATT GCTGGGCCAC CTTGGCTCCG
CGCCGAATTG CCTTGGTGTT TGATGATTGT CAACATTTGC AGGCTAGTGC TTGGCGTTTG
TTGAATGAAT TGGCCCGTTT TGCGCCAACC AACCTGCATC TTATTTTGAG CAGCCGCCAA
TTGCCACAGC TTGATTGGGC TGGCTTGGCG GCACGGGGGC AGTATCGCCA ATTAACTGCC
GCTGAATTGC GATTTAGCCA AGCCGAATTA CAGGAGCTAT TGCCTACCCA AAGCCCGAGC
CAACGCCAAG CAGCTTGGCA AGCAACCAAT GGCTGGCCTG CCGGTTTGCG CTTGTGGCGA
GCCTTGCCCT CAGGCTATCA AGGCACGGTC AGCGATTTTC TTGCCCAAGA TGTGTTGGGG
CAACTGCCTG AGGCGGTGCG GCGCACTGCC CAATTTGCCG CGCTCTTGCC CTTTTTTAAT
CAAGCGGTGC TCACAGCGAT GGCCGCGCCT GATCCAGCTA GTTTGTGGCA TTACAATCTT
TTTGTACTGC CTGATCACGC CGAATGGTGG CGCTTTGAGC CATTTTGGTT GGAAGTGCTG
CGCCAACAAC CGTTGGCTGA AATCACCCAA TTGTTGGCCC AAGCGGCCAC ATGGTTTGAG
CAGCAGGGGC ATCTGGCGGC AGCCTTGGAT GTTTGGTGTC GTTTGGGACA ATGGGCGCAG
GTGGCCGAGC AGCTTAAGCA CCATGGTTTG CAGCTGTTGG CTCAACATCA GCCAGTCTTG
CAATGGCTCC AACAACTGCC AGCAACCGAG CGCCAAACTG CCGAATTGCT GCATCTGGAT
GGTCTGGCCT TGCGCGAACA CGATCCTGGG TTGGCCGCCC AAACCTTGGC GCAAGCGGCT
CAACGCTATC GAGCTGAGCA ACGTTATGCC GAGGCTTTTC AGGTAGTTGG CGAACAATGT
TTGATCTATT TTTGGCAGGG CGATGAGCAA GCCTTGATTG CGGTGGCGCG TGAAAGTTTT
AGCCTCAAAA GCTTCGTTTG GTATCGCCAA CGCCGTGATT TGTTGAAATT TCCCTTGCTC
TTGTTTCAAA TTAAGCGTGG CCGCTATATC AAGGCCTTGG CGACGGCTCA AAGTTTAGCC
CAAAGCGAAT TGCCCTTCTT TTGGCGTTGG GTGGCGGCCT GCGTCGTTGG CGGTTTGTAT
ACGATTTTGA CCTTGCCGAA TGAGGGCATT CAATGGCTCG AAGTTTGGTT GCAACACCCC
CAAGTTCAAG CCGAACCAGC TATGCGCATG AGTTTGCTCG ATTTGTTGGC AACCTGCTTG
ATGAGCCGTG CCGCGCCCGA CGATCGCATC GCTGCCCAAG ATTTAGCTGA TCAAGCCAGC
CGTTTAAGCG AACGCTATGG CGTGCGTTTG ACTCGTTTGC AAACTCGCGG GACGAAACTC
GGTTTGGCCT TGTTGGAAGC TGATCGGCCA ACGACTGAGC GTTTGATTCA GCAATTGCTC
TTGCCCAGCG ATGAGCCATT GCCCTCGATT ATGCGCAATC GCTTGCTTGC ACTACGCGCC
TTTGCTTGGG CTAGTTTGGG CGAAATTAAG CTGGCGCAGC ATGATGCAAC GCTCAGCGAA
CGCGGCTTGA TTCGCGATGA TGCGCTGTAT GGAGCTGATC CACGCATGTG GTTGATGTTG
GCGCAAGCCT GGTTTTGCTG TGGCGAATAT CAACGGGCCT TGGCGGCTTT GGAGCGTGCC
AAACCGCTGA TCGAGGCCGC TCAATCGCCG ATTTTACGCT TGCGAGCAGG CTTGGTGCAG
CTGGCTAGTC GTTGGCAACT TCAGCCTAGC CCGCAGTTAA TTGCTGAAGC GACCACGATT
TGGCGCGATT ACTTATCCGA TGGCGACCGC CATATGAACG CAACGCCGCT GCATTTAACC
ACGCTGTTGG TAGAATTGGG CTTGCGCACG GGAATCGCGC CACAGCGAAT CGCCCAATTT
TTAGCTGAGC GTGATCGAGC GGCTTTAGAA GAGATTTGTT GGAAACTGTA TGCCGAGCAG
CCGCAACAAC AAGCCGCGCT CTTGCAACTT TTGGGCTTAT ATGGCTCAGC AGCCAGCCTT
GAGCGTTTGC AAGAAGTGAT TAAACAAGCC TCAACCCAGC AGCGCAAAAT TGCCCAACAC
AGCTTAACTA GCATTCGCCA GCGCCCAGCC TATGCGCTGA AAATCCAGCT TTTTGGTAGT
TTGCAACTTT GGCGTGGCAC GGAATTGGTC GATCCTAACG AGTGGTCACG CGAGAAGGCA
CGCCAACTAT TGGCCTTATT GGTGCTGCAA CGCCCAAGAA TTATCAGCCG TGAAGCCTTG
ATCGAGCACT TTTGGCCTGA TCTTACGCCG CAAGCCGCTG ATGGAGCCTT GCGCGTGACC
TTGAATGCCC TGTTGCATGT GCTAGAGCCG GAACGCAGCG GCGGCGCTAA TTCGGCCTTT
GTCTTGAGCG AAGCAGCGGG CTTGCGTTTA AATCCACAGG CACAGATCGA TACTGATTAT
GCTGAATTTC AAACCTTGCT CCAAACTGCC GCCAAGCAAC GCCAGCAGGG AGCCATGCCC
GCCGCTTTGC AAGCCTATCA AGCAGCCTTG GCCTTATATC AGGACGATTT ATTAAGCGAT
ATTGCTTACG CCGAATGGGT GCTGGATTGG CGTGAGCAAG CTTTGAGCCA ATTTGTGGCT
GCAACCAGCG ATTATTTGGA ATTATTGTTG GCTTATGGCC CGACTGCTGA GGCCATTCCC
TATGCTGAGC GTTTGTTGAG CTATGACCCC TATCACGAAC CAACTTACTT GCGCCTGATC
GACATCTACC ACATGTTGGG CAATGCTAGT GCTGCTGAGC GCATTCGAAA ACGCCTCGAA
CGTATTTCCT TATAA
 
Protein sequence
MCCATNKFYA KLRVVQYAID YPQLNDDLIQ RPRLQAQLEA CWSCRLTMVV AAAGYGKTTA 
LAQWMQQTLG GDWLWYGLHS PTEADQLQRL LSSLTKARFP QINPINSLAD LFDCWATLAP
RRIALVFDDC QHLQASAWRL LNELARFAPT NLHLILSSRQ LPQLDWAGLA ARGQYRQLTA
AELRFSQAEL QELLPTQSPS QRQAAWQATN GWPAGLRLWR ALPSGYQGTV SDFLAQDVLG
QLPEAVRRTA QFAALLPFFN QAVLTAMAAP DPASLWHYNL FVLPDHAEWW RFEPFWLEVL
RQQPLAEITQ LLAQAATWFE QQGHLAAALD VWCRLGQWAQ VAEQLKHHGL QLLAQHQPVL
QWLQQLPATE RQTAELLHLD GLALREHDPG LAAQTLAQAA QRYRAEQRYA EAFQVVGEQC
LIYFWQGDEQ ALIAVARESF SLKSFVWYRQ RRDLLKFPLL LFQIKRGRYI KALATAQSLA
QSELPFFWRW VAACVVGGLY TILTLPNEGI QWLEVWLQHP QVQAEPAMRM SLLDLLATCL
MSRAAPDDRI AAQDLADQAS RLSERYGVRL TRLQTRGTKL GLALLEADRP TTERLIQQLL
LPSDEPLPSI MRNRLLALRA FAWASLGEIK LAQHDATLSE RGLIRDDALY GADPRMWLML
AQAWFCCGEY QRALAALERA KPLIEAAQSP ILRLRAGLVQ LASRWQLQPS PQLIAEATTI
WRDYLSDGDR HMNATPLHLT TLLVELGLRT GIAPQRIAQF LAERDRAALE EICWKLYAEQ
PQQQAALLQL LGLYGSAASL ERLQEVIKQA STQQRKIAQH SLTSIRQRPA YALKIQLFGS
LQLWRGTELV DPNEWSREKA RQLLALLVLQ RPRIISREAL IEHFWPDLTP QAADGALRVT
LNALLHVLEP ERSGGANSAF VLSEAAGLRL NPQAQIDTDY AEFQTLLQTA AKQRQQGAMP
AALQAYQAAL ALYQDDLLSD IAYAEWVLDW REQALSQFVA ATSDYLELLL AYGPTAEAIP
YAERLLSYDP YHEPTYLRLI DIYHMLGNAS AAERIRKRLE RISL