Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4768 |
Symbol | |
ID | 5736612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6080389 |
End bp | 6084831 |
Gene Length | 4443 bp |
Protein Length | 1480 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281933 |
Product | protein serine phosphatase with GAF(s) sensor(s) |
Protein accession | YP_001547527 |
Protein GI | 159901280 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAAC GTACTCGTAA ACCCACGCTG AAAAATCGGG TGGAAGCCCT CGAGGCTGAA CGCCATCAAA CCCAACGTGC TCTCGATGCG CTCTATCGCA TTGGGCTAGC TTGTCGTGGG CAGCAAGATT CGTTCGAGTT GTTGCGCACC ATTTATCAAG AATTGCAGAC AGTTTGGAAT TTTGATGCCT GTTTTATTGC CCTCTCCGAC CAATACGACG ATAGCGCGTA TCGTATCGCG ATGTTGGCCG ATGAGGGCGT GATTGAATTT AGTGAGCATG ATCCAATTGG CCCATTGACT GGCTATCTGA TTCGCCAACG TCAGCCCTTG CTCTTTCGTG ATTTGGCGAT TGAACGTGAG ACGATTGGCT TACCGCGCCT TTTGCAATTT GGCAGCGATA AGCTTTCGCG CGGCTGGATG GGCGTACCAT TAGTTATTGG TACTTCGGCC TTGGGCGTGA TTTCGCTGCA AAGTTACACC GTTGGCGCAT TCGATGAGAC AGATCTTGAT CTGTTGACCC GGATTGCTAA TTCGATGGCA GTGGCGCTGG AAAATGCCCT GCTCTCGCAT CGTCAGGCTG AATTAACTGG CTCGTTGGAA CGTCAAATCG AGCTGCGTAA CGCCGATCTA TTTGTGATCA GTGATATTGC GGCGATGTTG ACTCAGCAAT TGCCATTGAC TGAGATGTTT GGCCTAGCGC TCGATTTTAT GCTGGAGTTG ATGCATCTCA ACGCTGGCGT GGTGTGTTCG CTGAGCGGGA GCGAGCTTCA GCCAATTGTT CAGCGTGGCA TGCAACATGG CAGCCTGCCT GAATCGTTTG CGCCGCTTGA TTCAATTTTT GGGGTTGCGA TTAGTAGCAA TTCGGCGATT GTCGATAATC ATACTCACTC CAAATATTGG CAAATGGCAC AACTCAGCCA TCTCGGCACA ACCCTCGCCA TCCCTTTGCG CCGCCATGAA ACCGTGATCG GTGTCTTGAT GGTTGGCCAA ACTGAGCAAC GCGCCTTCAA AACTGAAGAA GTTGAGTTGT TGCAAGTGGT CAGCAATCAA TTGGCTTTGG CCTTGGAGCA TGGCAATATT TTGGCCCAAC AACGTCGCCA AATTGCTGAA CTAGAGGCCT TGAGTGCGAT CAGTACGGCA ACCGTTCGAG CGCTCAATCT CTCAACCTTG TTGCATCAAT TAAATGATGC AATTCGCTCG TTCTTGCCAG TCGATGTGTT TTATATGGCG ATTTACGACC CTGAGCGCCA ATTATTGACT GATAGCATTG CGATTGAGGA TGAGAATGAG GCGACTTACC TGAGCGAAGA GGCGATGCCG CGCAAAGGTT CGTTTACCGA TTGGGTGCTG AGCAAATGTG AACCATTGTT TTTGCGCCAT GTTTCACGCG ATATTCGCCA TTACCCAACC ATTATTCGGC GCACGATCAA AGGTTCGCCC TCGGAAAGTT GGCTGGGCGT GCCAATGCTC GATGCCAATC GCCGTCCTTT GGGCGTGATC GCCATTCAAA ATTATCGCCC CTATGCCTTC AGCGATCGCG ATCGCTTTTT TATGCAATCG GTCGCCAGCC AAGTTAGCTT ACATGTGTTG AATGTGCAGT TGTATCAGCA GCGCGAACGC CAATTGGCTG AACTGAATGC CTTGCAACGG ATCAGTAGTT TGCTCGGCTC AACCCTCGAA ATTGAGGCCA TGCTGCGGGC AATCGATGCT GTGCTCACTG AATTTTTGCA TATTGATGGC TTTTTTGTGA TGCTCAATCA TCCCCAAAGC CACATGGTCG AGGCGGTCTA TGCCTTGAAT CGCGAGGGCG AGGGTGATTT TCGTTGGATG ATCGGCCTGA TTCCGCCCGA GCACACACCA ACGTGGGAAG TGTTGCACAC TCGCAAGCCG TTGCGCTTTG GTGATATTTC ACAAGAAACC TCGACTGAGG TTGCCCCCGA AGATAACGAA GTGCGTATTT CGTCGGATCG AACGAAGGCT TGGCTGGGCG TGCCGCTCAA CGATCAAACA ACCAATGTGA TTGGGCTGAT TGCGATTCAA AGTTTTCAAG CCAATGTGTT TAGCAATCGC GATGAGCAAT TTATGGCGCA GGTCGGCCAA CAACTGACTT TAGCAATTCA AAATGCGCGG CTGTTTGCGC AACGCGAACG TCAATTGGCC GAACTCAATA CCTTGAAATT GGTTGGCGAA TTGCTTAATC GCACCATGGA TATCCACGAA ATGTTCCGTG GCTTGAATCC ATTATTGACT TCGTTTCTCA AAATCGATGG CTTTTATATC TTATTAAACA ACCCCCAAAC GTATGTGATC GAAGATTTAT GCGTGGTTGA GCGTGGCGAG TTGCTCGATT ATGGCTCGAT GATCGGTACA TCGCTGCCGC TCAATACACC GACAGCCTGG ATTTTGCGCA ATGGTAAAGC GCTGCGTTTC AACAATACAA TTACCGATAT TCCCAAACTG TATCCCGAGT TGAAAACGGT GCAGGTCAAT GATGAAGTCG CCTTGTCGTG GCTTGGCACA CCCTTGATCA ACCACCGTGG CGAGGTTTTG GGCGCGATCA CCACGCAATC GATGAATGCT AGCCATTTCA GCGAGAGCGA TGAGCAATTT ATGCTGCAAG TGGCGCATCA ATTGGGCTTG GCAATTCAAA ATGCTCGTTC ATTTGCCCAA CGCGAGCGCC AGTTGGCTGA GCTTGATGCC CAACAAGGCA TCACCCAATT GGTCACTTCG ACCCTTGATT TATATGAAAT GCTACGCTCG ATGGATTTGG TGTTGCGTAG TTTTCTGAAT GCCGATGCCT TCCAAGTGGT GATTGGCAAC GCTGATCGGG TGGAAACGGC GGTAGTTTTA GAAGAAGGCA AGGAAGTTGA AACGGCGGTG ATTGGCCATC CTTTGCCCGA AGGCTCACTG ACGCGCTGGA CCTATCTGCA TGTCAAGCCG CTGCGCATGA ACGATATTTA TCGCGATTGG GCGCTCTATC CCGATTTGCA GGAGCCGCCG GTTCCGACCA ACTCTGGGTT TATGCACTCG TGGCTGAGCG TCCCGCTGAT CGCCTCGGAT CAGCCGTTGG GTGTGCTAGC AGTGCGGGCA ACGCGACCAG CGGCCTTTGG CCCAAGTGAT GAGCAATTTT TGTTTAATGT CGGTCGCCAG CTGGCGCTGA GTGTGCGCAA TGCCCGCTTG TATGCGGCTG AACAAACTGC CCACCGCACT GCCGAAACTA TGCGCGAAAT TGCCCGCGTG CTCAACACCA CCTTCAATCC CGATGAAGTG CTCGATTTGA TTTTGCGTGA ATTGCGCAAG GTAATCACCT TTGACTCAAC CTCGGTCATG CTCCCATCGA ATAATTTGCT GCGGATTGTT GCGCGGCAAG CCCAAGATGA GCAGTTGGCG GTCGAATGGC GCGAATTGAC CTTCCCGCTT GATCAGACCA GTGGTGCGGG ACGGGTGATG TTGAGCGGTC AGCCGTTGGT GGTTCCCGAT ACCGTCAGCG ATCCGCAATG GACGCGCTCG CCGATGCCAA GTGTGGTGCG CTCGTGGATT GGCGTGCCGT TGATCAGCAA GGGCGTGGTA CTCGGCGTGC TGAATATTAA TTCGTTACAA CCCAACGCCT TTACCCAAAG TGATACTGAT TTGGCGATGA CCTTCGCCAA CCAAGCAGCA ACAGCGCTTG AGCATGCGCG GCTCTACCAA GAATCAGTTA CGCGGGTTGA GCAAGAACTA GAAATTGCCC GCCAAATTCA GAGCAACTTG TTTCCACGTA GCTTGCCTGT GGCCCAAGGC GTGGAGTTGG CGGCCTTGTG TTTGCCAGCC CGCGAAACTG GCGGCGATTT CTACGAGGTA ACCGAGTTGC GTGATGGTCG CTGGGCTTTG ATGGTTGGCG ATGCTTCGGG CAAGAGCATT CCTGGGGCAA TGTTGATGGC GGTGGCACGT TCGATTGTGC GCTCGGAAGC ATGGGATCAC GAGATACCGC AAATTGTGAT GCAGGAAACC AATCGCTGGG TAACCATGGA TATTCCACGA CATACTTTTG TGGCCTTAGC CTATGCAACC TTTGATACGC TTGATTATAG TTTGGCCTTG GCGAATGCTG GCCAACTCGA CCCGATTATT CGCCGCGCTA ATGGCGATTT GGAATATGCG ACCGCGCCAG GCCCACACTT TCCGCTGGGC ATTATGGCCA ACACACCCTA TGAAACGGCC AGCTATCAGC TTGAACCCAA CGATATGGTG CTGTTCTACA CCGATGGTGT GGTTGAATCG AAAAATACCA GCGGCGAGAT GTGGGGCTTC GATCGCTTCG AGACCTTGCT GCGCGAACAC GATCATAGCC TGACCAGCGC TGAATGGGTG AATTTAGTGA TCGACGAAAT TAATCAATTT ATCGGCGATC ACCCGCAACA CGACGATATT ACCCTGGTGG CGCTCAAAGT TGCTGGCGCT TAA
|
Protein sequence | MTQRTRKPTL KNRVEALEAE RHQTQRALDA LYRIGLACRG QQDSFELLRT IYQELQTVWN FDACFIALSD QYDDSAYRIA MLADEGVIEF SEHDPIGPLT GYLIRQRQPL LFRDLAIERE TIGLPRLLQF GSDKLSRGWM GVPLVIGTSA LGVISLQSYT VGAFDETDLD LLTRIANSMA VALENALLSH RQAELTGSLE RQIELRNADL FVISDIAAML TQQLPLTEMF GLALDFMLEL MHLNAGVVCS LSGSELQPIV QRGMQHGSLP ESFAPLDSIF GVAISSNSAI VDNHTHSKYW QMAQLSHLGT TLAIPLRRHE TVIGVLMVGQ TEQRAFKTEE VELLQVVSNQ LALALEHGNI LAQQRRQIAE LEALSAISTA TVRALNLSTL LHQLNDAIRS FLPVDVFYMA IYDPERQLLT DSIAIEDENE ATYLSEEAMP RKGSFTDWVL SKCEPLFLRH VSRDIRHYPT IIRRTIKGSP SESWLGVPML DANRRPLGVI AIQNYRPYAF SDRDRFFMQS VASQVSLHVL NVQLYQQRER QLAELNALQR ISSLLGSTLE IEAMLRAIDA VLTEFLHIDG FFVMLNHPQS HMVEAVYALN REGEGDFRWM IGLIPPEHTP TWEVLHTRKP LRFGDISQET STEVAPEDNE VRISSDRTKA WLGVPLNDQT TNVIGLIAIQ SFQANVFSNR DEQFMAQVGQ QLTLAIQNAR LFAQRERQLA ELNTLKLVGE LLNRTMDIHE MFRGLNPLLT SFLKIDGFYI LLNNPQTYVI EDLCVVERGE LLDYGSMIGT SLPLNTPTAW ILRNGKALRF NNTITDIPKL YPELKTVQVN DEVALSWLGT PLINHRGEVL GAITTQSMNA SHFSESDEQF MLQVAHQLGL AIQNARSFAQ RERQLAELDA QQGITQLVTS TLDLYEMLRS MDLVLRSFLN ADAFQVVIGN ADRVETAVVL EEGKEVETAV IGHPLPEGSL TRWTYLHVKP LRMNDIYRDW ALYPDLQEPP VPTNSGFMHS WLSVPLIASD QPLGVLAVRA TRPAAFGPSD EQFLFNVGRQ LALSVRNARL YAAEQTAHRT AETMREIARV LNTTFNPDEV LDLILRELRK VITFDSTSVM LPSNNLLRIV ARQAQDEQLA VEWRELTFPL DQTSGAGRVM LSGQPLVVPD TVSDPQWTRS PMPSVVRSWI GVPLISKGVV LGVLNINSLQ PNAFTQSDTD LAMTFANQAA TALEHARLYQ ESVTRVEQEL EIARQIQSNL FPRSLPVAQG VELAALCLPA RETGGDFYEV TELRDGRWAL MVGDASGKSI PGAMLMAVAR SIVRSEAWDH EIPQIVMQET NRWVTMDIPR HTFVALAYAT FDTLDYSLAL ANAGQLDPII RRANGDLEYA TAPGPHFPLG IMANTPYETA SYQLEPNDMV LFYTDGVVES KNTSGEMWGF DRFETLLREH DHSLTSAEWV NLVIDEINQF IGDHPQHDDI TLVALKVAGA
|
| |