Gene Haur_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1473 
Symbol 
ID5733358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1718640 
End bp1722782 
Gene Length4143 bp 
Protein Length1380 aa 
Translation table11 
GC content45% 
IMG OID641278611 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001544245 
Protein GI159897998 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTGTT ATGCCAATGA ATCGCTGCTG GAACTCCAAT TAAAGCAACA AGTCAGCCTG 
CTTGAAACCG AACTTCAGCA TGTTCGCAGC CATACTGCCA AACTCCAAGC CTTGATTGAC
CATACTGAGA GCCTGATTTG GTCGATTGAT AGTAGTCAGC ATTTAATTAT GAGCAACCGC
ACCTTTCATG ATTATTTGCA TCAATGGTAC GACCAACATA TTGAGCTGGG CCAGCCAATG
AGCATGTTGC ATCTGCCTAA ACCAATTGCT AAGCCGTGGA ATGAAGCCTA TCGCCGCGCG
TTGCAAGGTG AGGCGTTTGC TCAAGAATTC GTGCATCACT ATGCTGGGGC CACTCGTTAC
TATGAATTTC ACTTTGCGCC AATTCTCGAT GAACAGCGAG TGGTGCAAGG GGTTAATATT
TCGGGCCGCG ATATTAGCAG CCGCCGTTTG GCCGAAGTGA TGTTGCGCAG CAGCGAGGCT
CGCTATCGGG CAATTAACGA GGCCGCTCCG TATGGGGTCT TTTTCTGCGA TACCCAAGGC
CAATGTGTCT ATGCCAACAC TGCGCTCTTG CATATGCTTG AAGTTAAGCT CGAAGATGTA
CTGGGTGATG CTTGGATGGA ATTTTTACAC CCTAAAGATG CCTCAGATAG CGAGAGTTTT
TGGCACAATT TGGCGCTTTC AGTCAATCAA CAGCCGATTC CTAGCGTTTT GTTTGATCAA
ACGGTGCGTT TGCATACCAA GCTGAATCAA TTAATTTGGC TGCGGTTGCG CATTTCGCCA
GTAATTGAAG CTGGCGTGCT CTGTGGCTTT GTTGGCATCA CTGATGATCT GAGTGCCTTG
AAAAAAGCTG AAACGGCGGC CCAAGCCAAG CAGCATTTTA TTCAAAAAAT TGCCGATACC
TTGCCATCGC AACTGTTTAT TTACGATCGT GCTTCGGCCT CGTTGATCTA TACCAACGAA
GTGAGCCGCC AATGGTTCAA AAGCTTCAAC ATTGATCCTA GTGATATGAA CACCATTCGG
CCCCTATTCC ACCCTGACGA TCATGCTATC ATCCGCAAAG TCCTACAAGG ACGTACAGTT
ACTAGCAGCG ATCCCCAATT AATTGTTGAT TTTGAATGTC GCTTACGCTC ACCAGAAGGC
ATCTATCGCT ATTTTGCTCT ACGTATGACC CCTTTTGTAA TCGATGCTGA TGGTACAGTT
TCACAATTAT TGGGCGTGGC AAGCGATATT ACTGAACGTA AATTACAAGA GCAGCAAATT
CGGCAATTGA ACGAGCAGCT TGAACAACGT GTTATCGAAC GAACCCAAGA GTTAGCTCAA
TCGTATCGGT TTCAGCGCAC TATGATTCAA CATGCACCTT CAATTATTAT TTCGCTTGAT
GCTGGAGGCG TGGTGCGCGG CTTTAATCGC GCGGCTGAAT TTGAGTTTGG CTATCAAGAA
ACTGAATTAT TAGGCCGTCC CTTCCCGACT AAATTATTTG ATCGCTCCGA TTTGAGGTAT
CGCTGGGAGA CCGAGCAACA TCGTAATCGT GGCTTGTTCT ATTCCGATTT GGATATTTTG
CTGGCCGAGG CGCGGCGTGG TGTGGCTGAG CCTCACGAAT GGCAAGCTAT CCACCGTTCT
GGCTCACGCT TTCCGCTTGA ATTAACCATC ACGCCACTGT TGCATGCCGA TGTGCTCGAA
GGCTTTTTGC TGATTGGCAA TAATATCGCT GCCCGCAAAC GCACCGAAGA AGAATTTCAT
TTGCTCTATC GTACAACCCG CTCGGTCAGT GAGGCCGCTG ATTTTAATAG TGCGCTTGAA
GTGGTGTTGC GCAACATCTG TGCAGCAATT GGCTGGGATT TGAGTGTAGC CTGGGTTCCA
GATGCCAACC AAGATTTCTT GGCGCTTGCC CCGATTCGCT GGAGCAGCAA CGAACGTTTT
CAACATTTTT ATACGTATCT TCAAAGCTTG GAATTACCGC AAGGTGCTGG TTTGGCTGGG
CGGGTCTGGC AAACCAAACA CTCAGCCAAT TATGCGATTG AGCACGATCA TTGGACTGAA
GGCAATCTTA ATCAACGTGG CTACGAATTA GCCCAGCAAG TTGGGCTTCA ATCGGCGGTT
GCTGTGCCAA TTTTGGCTAA TGATCAAGTT GTGGCAATTT TGGAATTTTT TCGCAGTAGT
CGTGGGGTTG ATCACGAACG CACCACTAAT TTGATTTCAG TGATTGCCAC CCAATTGGGC
ACGTTATTTC AACGCAAGCA TGCCGAAATG CAACTGCGCC AGAGCGAAGC CAAAAATCGA
GCGTTGCTAG CAGCCATGCC CGATTTGATG ATTCGATTTA CGCTCAAGGG CGAGATTTTA
GATTATCACA CCAACGATCC CTCGGATCTC TTTTTGCCCC AAGATGCCAT GATTGGGGCG
AATGCCCATA ATCATCAGCC GCAACCGCAA ATTCTGAATA TTATGAGCGC TACTCAACGG
GCGATTGAGA CCAATAGCAC CCAAAATGTT GAGTATGAAC TGACCTTGCC CAAGGGGAAT
ACCGTATTTG AGGCGCGGAT TGCGCCAAGT GCTAATGATG AGGTCGTGAT GGTGATTCGC
AATATTACCG AGCGCAAGCG GATTGAACAA ACCTTGCAGC AACAAACCGA TGAATTGAGC
ATTGCCAATG CTGAACTAGC CAAAGCAGCA CGGCTTAAAG ATGAATTTTT GGCTAGTATG
AGCCATGAAC TGCGTACACC CTTAACTGGA ATCTTAGCCT TTACTGAAGC CTTGCGCTAC
GACCAATATG GAGCATTAAA CCAAGCTCAA GCACAAGCAT TGCAACAAAT TGATGAAAAT
AGCCGCCATT TACTCGATTT GATCAACGAT ATTCTTGATC TTTCCAAAAT TGAAGCTGGC
AAACTTACAA TTAATCATCA ATCAATGCTG ATTGATGAAA TTTGCCAAGC CAGCATTCGC
ATGGTGCAAA AATTAGCTCA GAATAAACAA CTTGAATTAT TGTATGAACC ATGTGCCGCC
GATGCGATGC TTTGCGCCGA TTCACGTCGT TTGAAACAAA TGTTGGTCAA TTTATTAAGC
AATGCGGTGA AGTTTACACC TGCTGGTGGC CGAATTGGTT TATCAGTCAA GTTGGATAGC
CAATTCCATC AGGTTGAATT AACCGTTTGG GATACTGGGA TTGGCATCAA TACTCAGGAT
ATCCCAAAAT TATTTCGGCC ATTTTCACAG CTTGATAGCA AACTTTCACG CCAATATGCA
GGAACTGGCT TAGGTTTGGC CTTAGTGTAC CATATGGCCA ATTTGCATGG TGGACGAGTT
GAATTACAAA GTGAAGTTGG GGTTGGTAGT CAGTTTCGCC TAGTGCTGCC ATGGTATGGC
AATGCTGAAA TTGTTGAGCC AACCGAGCAA CCGATGCTAC TGGCGGTGAC CGAAGATCGC
ACCCTGAATA TGCAATTGGC GGCCTATGCC GAACAATTGG GTTTAGCGCT ACGTTTTTGC
AAACCATATT TCGATGTACA AGCTTACTTA CAACAACCCA ATCAAACGCT CATTTATGAT
TTACGCCAAA CAAGCTTATC ACAACAGCTT TTTGAGCAAC TTCGCCATCA AACTGCCGCT
CATCCAGTCA TTTTATTCTG TGATCAAGAT TTCAAGCTTG ATCTGACGAT TCCTGCGCAT
TGGCATTGCA TGTATCAGCC GTTAAATCAG GCTCGTTTGT TGAATGCATT GCAATATATC
GATTCGCGCT ATCAGCTGCC GCAAAAATTG CCGATTAATC AAGCCAAGCA GATTCTGTTT
GCTGCCGATA ATTTGGCTAA TAGCTTATTG ATTCGCGATT TCTTGAGTGA ATTTGGCTGG
AATGTCAACT TAGTCTACAA CCAACACGAT ATCTATGAAG CAATTGCCAA TCAGCCAATT
GATTTATTGA TGCTCGATTT ACAATTAGCT GGCGGTGATG CACTCCAGAT GGTTAATACA
ATTCGACGAC ATAAGCGTTA TTATGATCTG CCGATTATAG CTTTAAGCGC TTTAGCAATT
CTTGATCAAC CACAGCTTGC CGAGCAAGCC GATACAGTTT TATATAAACC ACTTAATTTA
GTTGAACTTG AACAATTAAT TAATACCTTG TGCAACCAAC AAAGGAGTCT TGATCGTGAA
TAA
 
Protein sequence
MTCYANESLL ELQLKQQVSL LETELQHVRS HTAKLQALID HTESLIWSID SSQHLIMSNR 
TFHDYLHQWY DQHIELGQPM SMLHLPKPIA KPWNEAYRRA LQGEAFAQEF VHHYAGATRY
YEFHFAPILD EQRVVQGVNI SGRDISSRRL AEVMLRSSEA RYRAINEAAP YGVFFCDTQG
QCVYANTALL HMLEVKLEDV LGDAWMEFLH PKDASDSESF WHNLALSVNQ QPIPSVLFDQ
TVRLHTKLNQ LIWLRLRISP VIEAGVLCGF VGITDDLSAL KKAETAAQAK QHFIQKIADT
LPSQLFIYDR ASASLIYTNE VSRQWFKSFN IDPSDMNTIR PLFHPDDHAI IRKVLQGRTV
TSSDPQLIVD FECRLRSPEG IYRYFALRMT PFVIDADGTV SQLLGVASDI TERKLQEQQI
RQLNEQLEQR VIERTQELAQ SYRFQRTMIQ HAPSIIISLD AGGVVRGFNR AAEFEFGYQE
TELLGRPFPT KLFDRSDLRY RWETEQHRNR GLFYSDLDIL LAEARRGVAE PHEWQAIHRS
GSRFPLELTI TPLLHADVLE GFLLIGNNIA ARKRTEEEFH LLYRTTRSVS EAADFNSALE
VVLRNICAAI GWDLSVAWVP DANQDFLALA PIRWSSNERF QHFYTYLQSL ELPQGAGLAG
RVWQTKHSAN YAIEHDHWTE GNLNQRGYEL AQQVGLQSAV AVPILANDQV VAILEFFRSS
RGVDHERTTN LISVIATQLG TLFQRKHAEM QLRQSEAKNR ALLAAMPDLM IRFTLKGEIL
DYHTNDPSDL FLPQDAMIGA NAHNHQPQPQ ILNIMSATQR AIETNSTQNV EYELTLPKGN
TVFEARIAPS ANDEVVMVIR NITERKRIEQ TLQQQTDELS IANAELAKAA RLKDEFLASM
SHELRTPLTG ILAFTEALRY DQYGALNQAQ AQALQQIDEN SRHLLDLIND ILDLSKIEAG
KLTINHQSML IDEICQASIR MVQKLAQNKQ LELLYEPCAA DAMLCADSRR LKQMLVNLLS
NAVKFTPAGG RIGLSVKLDS QFHQVELTVW DTGIGINTQD IPKLFRPFSQ LDSKLSRQYA
GTGLGLALVY HMANLHGGRV ELQSEVGVGS QFRLVLPWYG NAEIVEPTEQ PMLLAVTEDR
TLNMQLAAYA EQLGLALRFC KPYFDVQAYL QQPNQTLIYD LRQTSLSQQL FEQLRHQTAA
HPVILFCDQD FKLDLTIPAH WHCMYQPLNQ ARLLNALQYI DSRYQLPQKL PINQAKQILF
AADNLANSLL IRDFLSEFGW NVNLVYNQHD IYEAIANQPI DLLMLDLQLA GGDALQMVNT
IRRHKRYYDL PIIALSALAI LDQPQLAEQA DTVLYKPLNL VELEQLINTL CNQQRSLDRE