Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1421 |
Symbol | |
ID | 5733329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1639891 |
End bp | 1642878 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278559 |
Product | protein serine phosphatase with GAF(s) sensor(s) |
Protein accession | YP_001544193 |
Protein GI | 159897946 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGCCA GTTTTGATCG TGGAACCCGA CTCTCGAAAC ATCGCATGGT GGCCCTTGGT TGGGCTACCA TGCTCTTTTT GTTGGCAAGT GCCAGCATTA TTTGGCAAGG CTTGCCAATC TTCAACTCGG CTACCTTGCC AGTAGCTGGC GCGTTATTGC TGTTGCTCGA TGGCTTGGCA GTGCGCGATC GTTCGAACCA ACCGCTTAGC CTCTCCGGCA TTGTGCTCTT GGCCGTGGCG TTGATCAGCA CGCCGAGCAC AACTTTGGTT TTAGCCGCTG CCTCGGGTTT ATGCATTCGG CTGAGTCGTT TGGCCCGCTC ACGCTACGAA GAATGGGGCC GCCGCGCCCT TGAGGCTGGC GCACGTACCT TGGGCATCGC GCTCGTCCTG CCATTTATCA GCGATTTTGG GCTGTGGTCA AAACTGATCA TGCTCATCAT TAGTTATATC GTGGTGGTTC AAGGCGCACG CTTGATCTTT GCTTTGCTCT GGGATGGCAA GCAAATCACC ATGGGCACGT GGCAGGTCAG CGCTCCGAAT ATTTTCTCAA TTGAAATTTT GCCCTTGCCG CTGGCGGCAA TTGGTGCCCA AATGGCCGAA GATTTTCCAT TTAGTTTGGT GGCAATCTCA GCAGCAGGCT TGGTTGGCAG CGCATGGATG GTACAACGTG CCAGCCGATC ATTAGGTTTG CAGCGCCGCA CCGTGGCTGA ACTTGGCCAA ATTAATGCGA TCAGCCGCGC GATCATTCGC GCCGAATTAG ACGTTGATTC GCTGTGTCAG TTGGTCTATG GCGAGACCAG CAAGGTTGTC GATACCAGCA ATTTTCGCTT GGGCTTATTT GAAGGCCGTT TCTTTGAACT CAAAGTGCGG GTGCAAGATG GCCATCATGA GCCACCATTG CGCGTCGAAC TCCCCAATGA TCGTGGCATC GTCAGTTGGA TTCGGCGCAC AGGTCGCTCA TTGTTGGTCG AAGATTTCGA CAGCGAAATG GATCGCTTGC CAGCCCAGCC AACCTATCAA GCCGAATATC CGCCACGCTC AGGGGTTTAT ATTCCCTTGA TGACTGGCGA TGAGGTTTTA GGCACAATTT CGATTCAAAG CAGCGAACCA CGTGCTTTTG ATACCGACGA TTTACGCTTG CTCTCGCTGA TTGCCGACCA AGCTGCGGTG GCGATTGACA AGGCACGAGC CTACTCAGCA GCCCGTCGTC GCGCTGCCCA ACTAGCCACT ATCGGCGAAG TTAGCCGCCA AGTTACAGCG ATTCTCGATT TAGATCGTTT GTTGCCTTCG GTAGTGCATC GCATTCGGGT CAGCTTTGGT TATTCGCAAG TGCATCTTTT TACCTTTGAT GAATTGCATC AACAGCTTTT CTTCCGTGCC AGCACCGCCA GCGATAGCCC ATTTTGGCAA CGCCAAGGCA AACGCTTACC GTTAGGCCTT GGTATCGTTG GCCATGTCGC CGTCACTGGC GAACCAATGC TGGTTAATGA TGTGCGCGAA GAACCACGTT TCTTGCCCGA TCAACATGGC ATCGCCGCCG AACTAGCCGT GCCCATGCGT GTTGGACAGC AGTTGCTCGG CGTGTTGGAT GTGCAAAGCG AGAGCTACGG CGCATTTGAT GAAAATGATT TCTTCGTGGT GCAAACCCTT GCCGATCAAA TTGCGATCGC GATTGATAGC GCCTCGGTCT TCCAATCGCA ACAAGAGGAA GCTTGGGTGC TGAACGCCTT GTTGCAATCA GCCGAAAATT TTGCCTGGGT CAGCGAGATC AGCGAAATGC TCTATCTCAG CGTGCGCCTG CCAGCCTTGT TGGTTGGTTG CGAACGCGCC TTATGTCTCC TTTGGCAGCG TGAAAGCAAT CGCTGGATTT TGGCTGAAGG CTGGGGCTTG ACCAACGAAC AACGCCAATC AATTGGTGCA AGCGCTACCG ATGAGCAAGT GCCATGGCTC GAACGGATGC GTTCTGAAGG CGAATCGTTT GCTGCTGAAT TGGTCGATCT CGAACAGCTT TCAAGCGCAG GCTTAGTGCC CTACAGCAGC TATGGGGCAG TGCTGGCCCA ACCGCTCAAC TCACGCGGAG CCACGCTGGG GGTGCTGTTG CTTGAGCAAT GCGGCCACGA CGAAACTTGG CTGCCACGCC AAATTACTAT CGCCGCTGGA ATTGCTGGCC AAGCTGCGGC GGCGATTGAA AGCGCCTTGC TGGCGCAAAT CGAAGCTGCC CGCCAACATA TCGAACAAGA AATTAGCGTG GCCCGCGAAA TTCAAATGAG CTTGTTGCCA TCGCGCTTGC CGCAACTTGC GGGCTGGGAT AGCGGCGCAC ATTGGAATTT GGCCCGCCAA GTTGGCGGCG ATTTCTACGA TTTTTGGAGT TTTCGCAGCG GGCCTTCGGC AGGTGAGATG GGCTTTGTGA TTGCCGATGT CTCGGATAAG GGCGTGCCTG CGGCCTTGTT TATGGCACTT TCGCGCTCGT TGGTGCGTGG TGCGGCGCTC GATGGCTCAC CACCATCACA GGCGATCGAA CGTGCCAACC GCTGGATTAT GCGCGATAGC CAATCGTATA TGTTCGTAAC GCTTTTCTAC GGAATTATTA ATCCAGTGAC TGGGCGTTTA CGCTACACTT GTGCTGGGCA TAATCCGCCA TTGCTGTATC GCGCCGCTAC AGGCCAGATC GAGCAATTGC GCACACCTGG AATTGCCTTG GGCGTAATCG ATGATGCAGT TTTAGGCGAA GCTGAAACGA TTATTGAATT AGGCGATGTT TTGGTCTGTT ATACCGATGG CGTAACCGAG GCAGTTGATA GTACAATGGA TGAGTGGGGC GTGCCACGTT TGATGGAGAC AATTCATCAG ACCGCCCATT GCGATGCAGC GACTATGTTG CATACAATTA GTAGTCGCCT TGCGGCGCAT ACTGGCGATT TACCAGCCTT CGATGACCTT ACTTTAGTGG TGATTAAACG GCTGGCCGAT GCCAATCAAC CTGTTTGA
|
Protein sequence | MLASFDRGTR LSKHRMVALG WATMLFLLAS ASIIWQGLPI FNSATLPVAG ALLLLLDGLA VRDRSNQPLS LSGIVLLAVA LISTPSTTLV LAAASGLCIR LSRLARSRYE EWGRRALEAG ARTLGIALVL PFISDFGLWS KLIMLIISYI VVVQGARLIF ALLWDGKQIT MGTWQVSAPN IFSIEILPLP LAAIGAQMAE DFPFSLVAIS AAGLVGSAWM VQRASRSLGL QRRTVAELGQ INAISRAIIR AELDVDSLCQ LVYGETSKVV DTSNFRLGLF EGRFFELKVR VQDGHHEPPL RVELPNDRGI VSWIRRTGRS LLVEDFDSEM DRLPAQPTYQ AEYPPRSGVY IPLMTGDEVL GTISIQSSEP RAFDTDDLRL LSLIADQAAV AIDKARAYSA ARRRAAQLAT IGEVSRQVTA ILDLDRLLPS VVHRIRVSFG YSQVHLFTFD ELHQQLFFRA STASDSPFWQ RQGKRLPLGL GIVGHVAVTG EPMLVNDVRE EPRFLPDQHG IAAELAVPMR VGQQLLGVLD VQSESYGAFD ENDFFVVQTL ADQIAIAIDS ASVFQSQQEE AWVLNALLQS AENFAWVSEI SEMLYLSVRL PALLVGCERA LCLLWQRESN RWILAEGWGL TNEQRQSIGA SATDEQVPWL ERMRSEGESF AAELVDLEQL SSAGLVPYSS YGAVLAQPLN SRGATLGVLL LEQCGHDETW LPRQITIAAG IAGQAAAAIE SALLAQIEAA RQHIEQEISV AREIQMSLLP SRLPQLAGWD SGAHWNLARQ VGGDFYDFWS FRSGPSAGEM GFVIADVSDK GVPAALFMAL SRSLVRGAAL DGSPPSQAIE RANRWIMRDS QSYMFVTLFY GIINPVTGRL RYTCAGHNPP LLYRAATGQI EQLRTPGIAL GVIDDAVLGE AETIIELGDV LVCYTDGVTE AVDSTMDEWG VPRLMETIHQ TAHCDAATML HTISSRLAAH TGDLPAFDDL TLVVIKRLAD ANQPV
|
| |