Gene Haur_4768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4768 
Symbol 
ID5736612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6080389 
End bp6084831 
Gene Length4443 bp 
Protein Length1480 aa 
Translation table11 
GC content52% 
IMG OID641281933 
Productprotein serine phosphatase with GAF(s) sensor(s) 
Protein accessionYP_001547527 
Protein GI159901280 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAAC GTACTCGTAA ACCCACGCTG AAAAATCGGG TGGAAGCCCT CGAGGCTGAA 
CGCCATCAAA CCCAACGTGC TCTCGATGCG CTCTATCGCA TTGGGCTAGC TTGTCGTGGG
CAGCAAGATT CGTTCGAGTT GTTGCGCACC ATTTATCAAG AATTGCAGAC AGTTTGGAAT
TTTGATGCCT GTTTTATTGC CCTCTCCGAC CAATACGACG ATAGCGCGTA TCGTATCGCG
ATGTTGGCCG ATGAGGGCGT GATTGAATTT AGTGAGCATG ATCCAATTGG CCCATTGACT
GGCTATCTGA TTCGCCAACG TCAGCCCTTG CTCTTTCGTG ATTTGGCGAT TGAACGTGAG
ACGATTGGCT TACCGCGCCT TTTGCAATTT GGCAGCGATA AGCTTTCGCG CGGCTGGATG
GGCGTACCAT TAGTTATTGG TACTTCGGCC TTGGGCGTGA TTTCGCTGCA AAGTTACACC
GTTGGCGCAT TCGATGAGAC AGATCTTGAT CTGTTGACCC GGATTGCTAA TTCGATGGCA
GTGGCGCTGG AAAATGCCCT GCTCTCGCAT CGTCAGGCTG AATTAACTGG CTCGTTGGAA
CGTCAAATCG AGCTGCGTAA CGCCGATCTA TTTGTGATCA GTGATATTGC GGCGATGTTG
ACTCAGCAAT TGCCATTGAC TGAGATGTTT GGCCTAGCGC TCGATTTTAT GCTGGAGTTG
ATGCATCTCA ACGCTGGCGT GGTGTGTTCG CTGAGCGGGA GCGAGCTTCA GCCAATTGTT
CAGCGTGGCA TGCAACATGG CAGCCTGCCT GAATCGTTTG CGCCGCTTGA TTCAATTTTT
GGGGTTGCGA TTAGTAGCAA TTCGGCGATT GTCGATAATC ATACTCACTC CAAATATTGG
CAAATGGCAC AACTCAGCCA TCTCGGCACA ACCCTCGCCA TCCCTTTGCG CCGCCATGAA
ACCGTGATCG GTGTCTTGAT GGTTGGCCAA ACTGAGCAAC GCGCCTTCAA AACTGAAGAA
GTTGAGTTGT TGCAAGTGGT CAGCAATCAA TTGGCTTTGG CCTTGGAGCA TGGCAATATT
TTGGCCCAAC AACGTCGCCA AATTGCTGAA CTAGAGGCCT TGAGTGCGAT CAGTACGGCA
ACCGTTCGAG CGCTCAATCT CTCAACCTTG TTGCATCAAT TAAATGATGC AATTCGCTCG
TTCTTGCCAG TCGATGTGTT TTATATGGCG ATTTACGACC CTGAGCGCCA ATTATTGACT
GATAGCATTG CGATTGAGGA TGAGAATGAG GCGACTTACC TGAGCGAAGA GGCGATGCCG
CGCAAAGGTT CGTTTACCGA TTGGGTGCTG AGCAAATGTG AACCATTGTT TTTGCGCCAT
GTTTCACGCG ATATTCGCCA TTACCCAACC ATTATTCGGC GCACGATCAA AGGTTCGCCC
TCGGAAAGTT GGCTGGGCGT GCCAATGCTC GATGCCAATC GCCGTCCTTT GGGCGTGATC
GCCATTCAAA ATTATCGCCC CTATGCCTTC AGCGATCGCG ATCGCTTTTT TATGCAATCG
GTCGCCAGCC AAGTTAGCTT ACATGTGTTG AATGTGCAGT TGTATCAGCA GCGCGAACGC
CAATTGGCTG AACTGAATGC CTTGCAACGG ATCAGTAGTT TGCTCGGCTC AACCCTCGAA
ATTGAGGCCA TGCTGCGGGC AATCGATGCT GTGCTCACTG AATTTTTGCA TATTGATGGC
TTTTTTGTGA TGCTCAATCA TCCCCAAAGC CACATGGTCG AGGCGGTCTA TGCCTTGAAT
CGCGAGGGCG AGGGTGATTT TCGTTGGATG ATCGGCCTGA TTCCGCCCGA GCACACACCA
ACGTGGGAAG TGTTGCACAC TCGCAAGCCG TTGCGCTTTG GTGATATTTC ACAAGAAACC
TCGACTGAGG TTGCCCCCGA AGATAACGAA GTGCGTATTT CGTCGGATCG AACGAAGGCT
TGGCTGGGCG TGCCGCTCAA CGATCAAACA ACCAATGTGA TTGGGCTGAT TGCGATTCAA
AGTTTTCAAG CCAATGTGTT TAGCAATCGC GATGAGCAAT TTATGGCGCA GGTCGGCCAA
CAACTGACTT TAGCAATTCA AAATGCGCGG CTGTTTGCGC AACGCGAACG TCAATTGGCC
GAACTCAATA CCTTGAAATT GGTTGGCGAA TTGCTTAATC GCACCATGGA TATCCACGAA
ATGTTCCGTG GCTTGAATCC ATTATTGACT TCGTTTCTCA AAATCGATGG CTTTTATATC
TTATTAAACA ACCCCCAAAC GTATGTGATC GAAGATTTAT GCGTGGTTGA GCGTGGCGAG
TTGCTCGATT ATGGCTCGAT GATCGGTACA TCGCTGCCGC TCAATACACC GACAGCCTGG
ATTTTGCGCA ATGGTAAAGC GCTGCGTTTC AACAATACAA TTACCGATAT TCCCAAACTG
TATCCCGAGT TGAAAACGGT GCAGGTCAAT GATGAAGTCG CCTTGTCGTG GCTTGGCACA
CCCTTGATCA ACCACCGTGG CGAGGTTTTG GGCGCGATCA CCACGCAATC GATGAATGCT
AGCCATTTCA GCGAGAGCGA TGAGCAATTT ATGCTGCAAG TGGCGCATCA ATTGGGCTTG
GCAATTCAAA ATGCTCGTTC ATTTGCCCAA CGCGAGCGCC AGTTGGCTGA GCTTGATGCC
CAACAAGGCA TCACCCAATT GGTCACTTCG ACCCTTGATT TATATGAAAT GCTACGCTCG
ATGGATTTGG TGTTGCGTAG TTTTCTGAAT GCCGATGCCT TCCAAGTGGT GATTGGCAAC
GCTGATCGGG TGGAAACGGC GGTAGTTTTA GAAGAAGGCA AGGAAGTTGA AACGGCGGTG
ATTGGCCATC CTTTGCCCGA AGGCTCACTG ACGCGCTGGA CCTATCTGCA TGTCAAGCCG
CTGCGCATGA ACGATATTTA TCGCGATTGG GCGCTCTATC CCGATTTGCA GGAGCCGCCG
GTTCCGACCA ACTCTGGGTT TATGCACTCG TGGCTGAGCG TCCCGCTGAT CGCCTCGGAT
CAGCCGTTGG GTGTGCTAGC AGTGCGGGCA ACGCGACCAG CGGCCTTTGG CCCAAGTGAT
GAGCAATTTT TGTTTAATGT CGGTCGCCAG CTGGCGCTGA GTGTGCGCAA TGCCCGCTTG
TATGCGGCTG AACAAACTGC CCACCGCACT GCCGAAACTA TGCGCGAAAT TGCCCGCGTG
CTCAACACCA CCTTCAATCC CGATGAAGTG CTCGATTTGA TTTTGCGTGA ATTGCGCAAG
GTAATCACCT TTGACTCAAC CTCGGTCATG CTCCCATCGA ATAATTTGCT GCGGATTGTT
GCGCGGCAAG CCCAAGATGA GCAGTTGGCG GTCGAATGGC GCGAATTGAC CTTCCCGCTT
GATCAGACCA GTGGTGCGGG ACGGGTGATG TTGAGCGGTC AGCCGTTGGT GGTTCCCGAT
ACCGTCAGCG ATCCGCAATG GACGCGCTCG CCGATGCCAA GTGTGGTGCG CTCGTGGATT
GGCGTGCCGT TGATCAGCAA GGGCGTGGTA CTCGGCGTGC TGAATATTAA TTCGTTACAA
CCCAACGCCT TTACCCAAAG TGATACTGAT TTGGCGATGA CCTTCGCCAA CCAAGCAGCA
ACAGCGCTTG AGCATGCGCG GCTCTACCAA GAATCAGTTA CGCGGGTTGA GCAAGAACTA
GAAATTGCCC GCCAAATTCA GAGCAACTTG TTTCCACGTA GCTTGCCTGT GGCCCAAGGC
GTGGAGTTGG CGGCCTTGTG TTTGCCAGCC CGCGAAACTG GCGGCGATTT CTACGAGGTA
ACCGAGTTGC GTGATGGTCG CTGGGCTTTG ATGGTTGGCG ATGCTTCGGG CAAGAGCATT
CCTGGGGCAA TGTTGATGGC GGTGGCACGT TCGATTGTGC GCTCGGAAGC ATGGGATCAC
GAGATACCGC AAATTGTGAT GCAGGAAACC AATCGCTGGG TAACCATGGA TATTCCACGA
CATACTTTTG TGGCCTTAGC CTATGCAACC TTTGATACGC TTGATTATAG TTTGGCCTTG
GCGAATGCTG GCCAACTCGA CCCGATTATT CGCCGCGCTA ATGGCGATTT GGAATATGCG
ACCGCGCCAG GCCCACACTT TCCGCTGGGC ATTATGGCCA ACACACCCTA TGAAACGGCC
AGCTATCAGC TTGAACCCAA CGATATGGTG CTGTTCTACA CCGATGGTGT GGTTGAATCG
AAAAATACCA GCGGCGAGAT GTGGGGCTTC GATCGCTTCG AGACCTTGCT GCGCGAACAC
GATCATAGCC TGACCAGCGC TGAATGGGTG AATTTAGTGA TCGACGAAAT TAATCAATTT
ATCGGCGATC ACCCGCAACA CGACGATATT ACCCTGGTGG CGCTCAAAGT TGCTGGCGCT
TAA
 
Protein sequence
MTQRTRKPTL KNRVEALEAE RHQTQRALDA LYRIGLACRG QQDSFELLRT IYQELQTVWN 
FDACFIALSD QYDDSAYRIA MLADEGVIEF SEHDPIGPLT GYLIRQRQPL LFRDLAIERE
TIGLPRLLQF GSDKLSRGWM GVPLVIGTSA LGVISLQSYT VGAFDETDLD LLTRIANSMA
VALENALLSH RQAELTGSLE RQIELRNADL FVISDIAAML TQQLPLTEMF GLALDFMLEL
MHLNAGVVCS LSGSELQPIV QRGMQHGSLP ESFAPLDSIF GVAISSNSAI VDNHTHSKYW
QMAQLSHLGT TLAIPLRRHE TVIGVLMVGQ TEQRAFKTEE VELLQVVSNQ LALALEHGNI
LAQQRRQIAE LEALSAISTA TVRALNLSTL LHQLNDAIRS FLPVDVFYMA IYDPERQLLT
DSIAIEDENE ATYLSEEAMP RKGSFTDWVL SKCEPLFLRH VSRDIRHYPT IIRRTIKGSP
SESWLGVPML DANRRPLGVI AIQNYRPYAF SDRDRFFMQS VASQVSLHVL NVQLYQQRER
QLAELNALQR ISSLLGSTLE IEAMLRAIDA VLTEFLHIDG FFVMLNHPQS HMVEAVYALN
REGEGDFRWM IGLIPPEHTP TWEVLHTRKP LRFGDISQET STEVAPEDNE VRISSDRTKA
WLGVPLNDQT TNVIGLIAIQ SFQANVFSNR DEQFMAQVGQ QLTLAIQNAR LFAQRERQLA
ELNTLKLVGE LLNRTMDIHE MFRGLNPLLT SFLKIDGFYI LLNNPQTYVI EDLCVVERGE
LLDYGSMIGT SLPLNTPTAW ILRNGKALRF NNTITDIPKL YPELKTVQVN DEVALSWLGT
PLINHRGEVL GAITTQSMNA SHFSESDEQF MLQVAHQLGL AIQNARSFAQ RERQLAELDA
QQGITQLVTS TLDLYEMLRS MDLVLRSFLN ADAFQVVIGN ADRVETAVVL EEGKEVETAV
IGHPLPEGSL TRWTYLHVKP LRMNDIYRDW ALYPDLQEPP VPTNSGFMHS WLSVPLIASD
QPLGVLAVRA TRPAAFGPSD EQFLFNVGRQ LALSVRNARL YAAEQTAHRT AETMREIARV
LNTTFNPDEV LDLILRELRK VITFDSTSVM LPSNNLLRIV ARQAQDEQLA VEWRELTFPL
DQTSGAGRVM LSGQPLVVPD TVSDPQWTRS PMPSVVRSWI GVPLISKGVV LGVLNINSLQ
PNAFTQSDTD LAMTFANQAA TALEHARLYQ ESVTRVEQEL EIARQIQSNL FPRSLPVAQG
VELAALCLPA RETGGDFYEV TELRDGRWAL MVGDASGKSI PGAMLMAVAR SIVRSEAWDH
EIPQIVMQET NRWVTMDIPR HTFVALAYAT FDTLDYSLAL ANAGQLDPII RRANGDLEYA
TAPGPHFPLG IMANTPYETA SYQLEPNDMV LFYTDGVVES KNTSGEMWGF DRFETLLREH
DHSLTSAEWV NLVIDEINQF IGDHPQHDDI TLVALKVAGA