Gene Haur_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2154 
Symbol 
ID5734027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2712626 
End bp2715553 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content51% 
IMG OID641279295 
Producttwo component transcriptional regulator 
Protein accessionYP_001544922 
Protein GI159898675 
COG category[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000789084 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAC AGCCTACAAT CTTAGTGATT CAACCAGATC GAGCCTTGCA AGCCTTAATT 
GTGCGGGTCT TGAAAACCGC TAGTTTTCAG GTGTTACGCA GCGATAGTAT TCAGTCAGCC
CAGCTCCTCG TTGAGCAAGA GGCGCTTGAT TTGGTGGTGA TCGATCAGTG GATTCCCGAC
CATGATGCGT TGGAGTTTTG TCGTAATCTT CGTGAGCATT CAAATTTGCC GTTGCTAATG
GTGGGAGCCA GCAGCGATGC CTATATGCGA GGCATTGCGC TCGATCAAGG GATTGATGAT
TATGTAATTA CGCCGTTTCA TGAGAGCGAA TTTTTGGCCC GCGTTCGAGC CTTGTTGCGG
CGCAGCCAAC AAGCGACTCA GCAACAACAG CCGCAATATA CCGAATGTGG CGCGTTGCTG
ATCAACTGGC AAGACCAGCA AGTACGCAAA TATGGCGAAT TAGTTCATCT TACCAAAACT
GAATGGGCCT TGCTCAAATT ATTTATTCAA TATCATGGCC AAGTTTTAAC CCATCGTATG
TTGCTACAAC AGGTTTGGGG AAAAAGCTAT AGCGAAGATC GGGCCTATCT GCATGCCTAT
ATTCGGCGTT TACGCGCCAA ACTCGAAGAC GATCCAACCA ATCCGCAATT AATTCACTCA
GAATCAGGGA TTGGCTATCG TTTTATGCGG ATTGAAGCGC CTACTCAAGC GCCGCAAGCC
AACCCGACCA GCCTTGCCCA TTTGCGCTTA CCCTACCCGA TCGCCGCGCT GATTGGCCGC
CAGCATGAAT TAACCGCCTT ACAAGAACTG CTGAGCAAAT CCGAAGTCCG ATTAATCACG
ATCACAGGGA TGGGTGGATC GGGCAAAACC AGCATGGCCA GCTATATTGC CCAGCAATTG
CATCAAACGC AACATATGCC AGTGGTATTT GTCGCGCTCG ACACCATCAA CGATCCCAAT
ATGGTAGCGG CAACCTTAGC CCGCGCCGCT GGCTTACGCG ACCACGGCGA TGATCAGTCA
CTCGAACGCT TGCAAGATTG GATTAGCAAT CAAACCATGC TCTTCATTTT GGATAATTTT
GAGCAGGTGT TGGGTGCAGC GCCGCAAGTC AGCCAATTGC TCCAGCATTG CCCAAATTTA
AAAATTGTGG TCACCAGCCG AATTGTTTTG GGAGTGTATG GTGAATATGA GTTTGTACTG
CCCCCGCTCG GTTTGCCCGA CCTGCAACAA AGCCCACCAC TTGAGCAAAT TGCCGCTAGT
CCGGCGATTC AACTCTTTGT GCAGCGAGCG CAAGCAGTTG ATAGCCAATT TCGCCTGACT
GCTGAAAATG CTGCTAGCGT GGCCGAAATT TGTGTGCGGC TCGATGGTTT GGCCTTGGCG
ATTGAGTTGG CAGCCGTTCA CAGCAAATTT TATCCGCCCA AGGTGATGTT GCAACGACTC
AACCAGCGGC TCGATTTTCT CTATCATAAC AGCCCGGATC GAACACAGCG CCAACATTCG
CTGCGCGGGG CAATCGATTG GAGCTACGAA CTGCTTGGCA GCTATGAACA AACGATTTTT
CAAGGCCTTG GCTTGTTTGC TGGCAGTTTT ACCCGTGAGG CGGCCCAAGC GTTATGGCCC
AACGACGAAC CAAGCCGGAT CGAACGGGTT TTGCAGCATT TGGTCAATGC CAGTTTGTTA
CAACGCGAAA CCAGCAGCGA CGGCCTGAGT TGGTTCGCCA TGCTCGATAC CATTCGCGAA
TATAGCTTGA GTAAATTGCC AAAAGGCGAG GCCCATTGGC TCCAACAACA ATTACTTGAT
TATTATGTTG AGTTGATGCA GCAAGCCGAA CAAGCCTTTT TGGTCAGCAA CCATACTGGC
TGGATCAAAC GGCTCGAACG CGAACTGCCA ACGATTCGCA GCATTCTCGC ATGGGGCATT
CAGCAAGAAT ATAGCTTGGC AGTCTGGCAA TTATGCGCGA GTTTTTGGCG CTTTTGGCAT
GAGCAAGGCC TGATCAGCGA AGGGCGCGAA TGGCTAGCCA AAATCCAACA CCTGCAACCA
AGCACAATCC CCTTGGCAAT TCGCGATAAA GTGCGGCTTG GGGCGGGAGT TTTGGCTTTT
ATCCAAGATG ATTATTCAGC AGCTAATCAG GCTTTTAGCG AAGTGTTGGT CGAGCCACGC
GCCGAACATC AACCCAAGGC AATTGCCCAT GCACTAACCA ATATTGGCAT GGTCGCCTAT
TGGCAAGGGC GTTATGGCGA GGCAATTCAG GCCTTGGAAG AAAGTCTGCC ATTATTAAAA
ATGCTTGATG ATCGCTATGG TATGGCCAGC AGTTTGCGCC ATTTGGGCAT GAGTCAGTTG
GTGCAACATG GCTCACGCAG TGCCTTGGCG CTGTTAGCCG AAAGTTTAAG CTTCTATCAA
GAGCTTGGCA GCAAAAGTGG CATTGGCACG GCGATGGGGT TTTATGGTCG GGCTTTATTA
ATTTATGGCG ACGATCACGA AGCTCAGCAA TGGCTCGAAC AAAGCATCGC CATGCTTGAG
CCATTGGGCA ATTGGCCTGC CATGGCCCGT AGTCAAACCT TTTTGGGGCG AGTAGCCTTG
GCCCAACGCC GTTATGCAGA TGCTCAACAG TTGCTCAGCC AAAGTTTAGC CACGCTCTAT
CGGGTTGGTG ATCGCGAAGG CGTGGCTGCT TCAATCGAGG GTTTAGCCGT TTGGAGCGCA
CTCAACCAGC AAGCTGAGCG GGCACAAGCG CTTTGGAGTG GAGCAGATTG GCTACGTGAA
TTAATTGGTG CACCAATTCC ACCAGCCGAT TTTCAAGCGC TACGCCGCAT GTTGCCCCAA
TCTTTCAGTT TTATGCAGCA AGCTCAAACG CCCAAATCGC TACGTCACTT GGTTGGCTGC
GCCTTAGCCA GCGATTGTAG CTCGTTGGGA TGCGATGAAC ATGGATAG
 
Protein sequence
MTQQPTILVI QPDRALQALI VRVLKTASFQ VLRSDSIQSA QLLVEQEALD LVVIDQWIPD 
HDALEFCRNL REHSNLPLLM VGASSDAYMR GIALDQGIDD YVITPFHESE FLARVRALLR
RSQQATQQQQ PQYTECGALL INWQDQQVRK YGELVHLTKT EWALLKLFIQ YHGQVLTHRM
LLQQVWGKSY SEDRAYLHAY IRRLRAKLED DPTNPQLIHS ESGIGYRFMR IEAPTQAPQA
NPTSLAHLRL PYPIAALIGR QHELTALQEL LSKSEVRLIT ITGMGGSGKT SMASYIAQQL
HQTQHMPVVF VALDTINDPN MVAATLARAA GLRDHGDDQS LERLQDWISN QTMLFILDNF
EQVLGAAPQV SQLLQHCPNL KIVVTSRIVL GVYGEYEFVL PPLGLPDLQQ SPPLEQIAAS
PAIQLFVQRA QAVDSQFRLT AENAASVAEI CVRLDGLALA IELAAVHSKF YPPKVMLQRL
NQRLDFLYHN SPDRTQRQHS LRGAIDWSYE LLGSYEQTIF QGLGLFAGSF TREAAQALWP
NDEPSRIERV LQHLVNASLL QRETSSDGLS WFAMLDTIRE YSLSKLPKGE AHWLQQQLLD
YYVELMQQAE QAFLVSNHTG WIKRLERELP TIRSILAWGI QQEYSLAVWQ LCASFWRFWH
EQGLISEGRE WLAKIQHLQP STIPLAIRDK VRLGAGVLAF IQDDYSAANQ AFSEVLVEPR
AEHQPKAIAH ALTNIGMVAY WQGRYGEAIQ ALEESLPLLK MLDDRYGMAS SLRHLGMSQL
VQHGSRSALA LLAESLSFYQ ELGSKSGIGT AMGFYGRALL IYGDDHEAQQ WLEQSIAMLE
PLGNWPAMAR SQTFLGRVAL AQRRYADAQQ LLSQSLATLY RVGDREGVAA SIEGLAVWSA
LNQQAERAQA LWSGADWLRE LIGAPIPPAD FQALRRMLPQ SFSFMQQAQT PKSLRHLVGC
ALASDCSSLG CDEHG