Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2154 |
Symbol | |
ID | 5734027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2712626 |
End bp | 2715553 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279295 |
Product | two component transcriptional regulator |
Protein accession | YP_001544922 |
Protein GI | 159898675 |
COG category | [K] Transcription [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000789084 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAAC AGCCTACAAT CTTAGTGATT CAACCAGATC GAGCCTTGCA AGCCTTAATT GTGCGGGTCT TGAAAACCGC TAGTTTTCAG GTGTTACGCA GCGATAGTAT TCAGTCAGCC CAGCTCCTCG TTGAGCAAGA GGCGCTTGAT TTGGTGGTGA TCGATCAGTG GATTCCCGAC CATGATGCGT TGGAGTTTTG TCGTAATCTT CGTGAGCATT CAAATTTGCC GTTGCTAATG GTGGGAGCCA GCAGCGATGC CTATATGCGA GGCATTGCGC TCGATCAAGG GATTGATGAT TATGTAATTA CGCCGTTTCA TGAGAGCGAA TTTTTGGCCC GCGTTCGAGC CTTGTTGCGG CGCAGCCAAC AAGCGACTCA GCAACAACAG CCGCAATATA CCGAATGTGG CGCGTTGCTG ATCAACTGGC AAGACCAGCA AGTACGCAAA TATGGCGAAT TAGTTCATCT TACCAAAACT GAATGGGCCT TGCTCAAATT ATTTATTCAA TATCATGGCC AAGTTTTAAC CCATCGTATG TTGCTACAAC AGGTTTGGGG AAAAAGCTAT AGCGAAGATC GGGCCTATCT GCATGCCTAT ATTCGGCGTT TACGCGCCAA ACTCGAAGAC GATCCAACCA ATCCGCAATT AATTCACTCA GAATCAGGGA TTGGCTATCG TTTTATGCGG ATTGAAGCGC CTACTCAAGC GCCGCAAGCC AACCCGACCA GCCTTGCCCA TTTGCGCTTA CCCTACCCGA TCGCCGCGCT GATTGGCCGC CAGCATGAAT TAACCGCCTT ACAAGAACTG CTGAGCAAAT CCGAAGTCCG ATTAATCACG ATCACAGGGA TGGGTGGATC GGGCAAAACC AGCATGGCCA GCTATATTGC CCAGCAATTG CATCAAACGC AACATATGCC AGTGGTATTT GTCGCGCTCG ACACCATCAA CGATCCCAAT ATGGTAGCGG CAACCTTAGC CCGCGCCGCT GGCTTACGCG ACCACGGCGA TGATCAGTCA CTCGAACGCT TGCAAGATTG GATTAGCAAT CAAACCATGC TCTTCATTTT GGATAATTTT GAGCAGGTGT TGGGTGCAGC GCCGCAAGTC AGCCAATTGC TCCAGCATTG CCCAAATTTA AAAATTGTGG TCACCAGCCG AATTGTTTTG GGAGTGTATG GTGAATATGA GTTTGTACTG CCCCCGCTCG GTTTGCCCGA CCTGCAACAA AGCCCACCAC TTGAGCAAAT TGCCGCTAGT CCGGCGATTC AACTCTTTGT GCAGCGAGCG CAAGCAGTTG ATAGCCAATT TCGCCTGACT GCTGAAAATG CTGCTAGCGT GGCCGAAATT TGTGTGCGGC TCGATGGTTT GGCCTTGGCG ATTGAGTTGG CAGCCGTTCA CAGCAAATTT TATCCGCCCA AGGTGATGTT GCAACGACTC AACCAGCGGC TCGATTTTCT CTATCATAAC AGCCCGGATC GAACACAGCG CCAACATTCG CTGCGCGGGG CAATCGATTG GAGCTACGAA CTGCTTGGCA GCTATGAACA AACGATTTTT CAAGGCCTTG GCTTGTTTGC TGGCAGTTTT ACCCGTGAGG CGGCCCAAGC GTTATGGCCC AACGACGAAC CAAGCCGGAT CGAACGGGTT TTGCAGCATT TGGTCAATGC CAGTTTGTTA CAACGCGAAA CCAGCAGCGA CGGCCTGAGT TGGTTCGCCA TGCTCGATAC CATTCGCGAA TATAGCTTGA GTAAATTGCC AAAAGGCGAG GCCCATTGGC TCCAACAACA ATTACTTGAT TATTATGTTG AGTTGATGCA GCAAGCCGAA CAAGCCTTTT TGGTCAGCAA CCATACTGGC TGGATCAAAC GGCTCGAACG CGAACTGCCA ACGATTCGCA GCATTCTCGC ATGGGGCATT CAGCAAGAAT ATAGCTTGGC AGTCTGGCAA TTATGCGCGA GTTTTTGGCG CTTTTGGCAT GAGCAAGGCC TGATCAGCGA AGGGCGCGAA TGGCTAGCCA AAATCCAACA CCTGCAACCA AGCACAATCC CCTTGGCAAT TCGCGATAAA GTGCGGCTTG GGGCGGGAGT TTTGGCTTTT ATCCAAGATG ATTATTCAGC AGCTAATCAG GCTTTTAGCG AAGTGTTGGT CGAGCCACGC GCCGAACATC AACCCAAGGC AATTGCCCAT GCACTAACCA ATATTGGCAT GGTCGCCTAT TGGCAAGGGC GTTATGGCGA GGCAATTCAG GCCTTGGAAG AAAGTCTGCC ATTATTAAAA ATGCTTGATG ATCGCTATGG TATGGCCAGC AGTTTGCGCC ATTTGGGCAT GAGTCAGTTG GTGCAACATG GCTCACGCAG TGCCTTGGCG CTGTTAGCCG AAAGTTTAAG CTTCTATCAA GAGCTTGGCA GCAAAAGTGG CATTGGCACG GCGATGGGGT TTTATGGTCG GGCTTTATTA ATTTATGGCG ACGATCACGA AGCTCAGCAA TGGCTCGAAC AAAGCATCGC CATGCTTGAG CCATTGGGCA ATTGGCCTGC CATGGCCCGT AGTCAAACCT TTTTGGGGCG AGTAGCCTTG GCCCAACGCC GTTATGCAGA TGCTCAACAG TTGCTCAGCC AAAGTTTAGC CACGCTCTAT CGGGTTGGTG ATCGCGAAGG CGTGGCTGCT TCAATCGAGG GTTTAGCCGT TTGGAGCGCA CTCAACCAGC AAGCTGAGCG GGCACAAGCG CTTTGGAGTG GAGCAGATTG GCTACGTGAA TTAATTGGTG CACCAATTCC ACCAGCCGAT TTTCAAGCGC TACGCCGCAT GTTGCCCCAA TCTTTCAGTT TTATGCAGCA AGCTCAAACG CCCAAATCGC TACGTCACTT GGTTGGCTGC GCCTTAGCCA GCGATTGTAG CTCGTTGGGA TGCGATGAAC ATGGATAG
|
Protein sequence | MTQQPTILVI QPDRALQALI VRVLKTASFQ VLRSDSIQSA QLLVEQEALD LVVIDQWIPD HDALEFCRNL REHSNLPLLM VGASSDAYMR GIALDQGIDD YVITPFHESE FLARVRALLR RSQQATQQQQ PQYTECGALL INWQDQQVRK YGELVHLTKT EWALLKLFIQ YHGQVLTHRM LLQQVWGKSY SEDRAYLHAY IRRLRAKLED DPTNPQLIHS ESGIGYRFMR IEAPTQAPQA NPTSLAHLRL PYPIAALIGR QHELTALQEL LSKSEVRLIT ITGMGGSGKT SMASYIAQQL HQTQHMPVVF VALDTINDPN MVAATLARAA GLRDHGDDQS LERLQDWISN QTMLFILDNF EQVLGAAPQV SQLLQHCPNL KIVVTSRIVL GVYGEYEFVL PPLGLPDLQQ SPPLEQIAAS PAIQLFVQRA QAVDSQFRLT AENAASVAEI CVRLDGLALA IELAAVHSKF YPPKVMLQRL NQRLDFLYHN SPDRTQRQHS LRGAIDWSYE LLGSYEQTIF QGLGLFAGSF TREAAQALWP NDEPSRIERV LQHLVNASLL QRETSSDGLS WFAMLDTIRE YSLSKLPKGE AHWLQQQLLD YYVELMQQAE QAFLVSNHTG WIKRLERELP TIRSILAWGI QQEYSLAVWQ LCASFWRFWH EQGLISEGRE WLAKIQHLQP STIPLAIRDK VRLGAGVLAF IQDDYSAANQ AFSEVLVEPR AEHQPKAIAH ALTNIGMVAY WQGRYGEAIQ ALEESLPLLK MLDDRYGMAS SLRHLGMSQL VQHGSRSALA LLAESLSFYQ ELGSKSGIGT AMGFYGRALL IYGDDHEAQQ WLEQSIAMLE PLGNWPAMAR SQTFLGRVAL AQRRYADAQQ LLSQSLATLY RVGDREGVAA SIEGLAVWSA LNQQAERAQA LWSGADWLRE LIGAPIPPAD FQALRRMLPQ SFSFMQQAQT PKSLRHLVGC ALASDCSSLG CDEHG
|
| |