Gene Haur_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4054 
Symbol 
ID5735912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5171495 
End bp5174866 
Gene Length3372 bp 
Protein Length1123 aa 
Translation table11 
GC content35% 
IMG OID641281205 
Productputative signal transduction histidine kinase 
Protein accessionYP_001546814 
Protein GI159900567 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATACA TAATGCGTGT AATTAGCATC TTTTTTGGAT GTTTTGTTGT CATCTCAATG 
GCAATTTCCG TTCGAACTAG CTTAAACACG ATTGATGATC CAGCGCTTGA ATTAATTACC
TATTGGTCTG CAATAAACAA TTGTAATGAA ATTCATGCTA TAACCCCAGC TGAATGGATG
CCCATTGCCC AAGAAATTGT TGCACCAGGC GATTGTATTC TCAGTGTTTC AGGTTATGGT
TATAATTCGC CAGAAATGAT TGATCATCTG AAGAAGAAAC TAGAGCTAGA TTATGCTAAT
CGCTTTGTAC CGGTAGAAAT TAGACGCGAT CAATCTCAGT TTACCGTTTA TCTACCAATC
ACAAAAATTA GTATTCATCA TATTTTACAA ATCTATTTAG CAATCATCCT TACCGCAGCA
TTACTATGGA TTCTAGGTAT TATTGTATTA ATTTCAAACC CAGATAAGGA AAATAATATT
GTATTTGGTT ATATTCTTTG GTTTCTTGCT TTAGCTATTA CAAGTATTCG GCATAGAGTA
CCAGATTTTG GCGAATATCT TACCCTTGTT GCAACTGTAG TGCCATTAGC ACTACTTGGT
GGAAGTTTCT TTCATTTAGG GTATATCTTT CCAGTAAAGA TTGAGCGTTA TCATGGGTTT
AAATATGTGC TGTATATCCC TTCGCTCTTT AGTATTGCTT TATATACCTA TGTTATTATT
AATATTGCCT CAGGTAAAAG TAATATTCCT ACAATTTCCA ATCAAGCCAA CTATATCATT
GCGACCATAT TTATCATCGG ATTTAGCTTT ACATTAATTC GTTGGGCATA TCTCTGGATC
ACCTATCGCC ATCAAACTAA ATCGAAAATT ATTTTGCAAG CTAAAATTGC GTTTACCACT
TGGTTAATAG GTGGATTATT CAACGTTATA TTGATGATTG TGTATCAGGA ATTTAATTTA
AAATTACCAA TAATCGGGAA ATCAATATTT TTTGGCTTAT TTCTATTAGT CATTCCGGCA
GCTGGGACTG CATTTACGCT ATTACGTTAT GAATCAGTTC ATGCAAAACG TTCATTTAGC
TTAGATTTAT TAATGATAAT TTTGATTAGT GCGGTTATTG TGGATATTTC AATCCTTATT
AATAGTTATA TTGAAATTAA TGGAATTATA TTTATCAATA TCTTCTTTGT TGCAGTAATT
ACCTCAATTT TCTGGTATAT CGACAACCCA TTTCGTAGAA TATTTGAAAA ATATTTCCTG
CGCCATCAAA ATGATGCTGA AATATTGTTT GGCTTGTTAA ACCATCTTGA TATCACTAAA
GATATTCATT CAGCAATCTA CTCATCAATT GATTATTTGA CTAAGCAATT AGAGATAGAA
TCTTTGCGTT TTGTGATCGA AAAGAATATT ATCAAAACAT CTGAACATTT TTATATTATG
GAAAACGAGC CAATTGATTT ATTTTTTGAG CACAATTCTG AAGCAGGATC GTTTTCAAAC
TCGCTCAGTA AGTTCTATAA ACATCGAGAA ATTGTGTATG ACAATAACAA CAATCAAATT
GGATACTTAT ACCTTGGGCC AAAAATTACC AATGAAGATT TTGATACCAA AGACTATGAG
CTTATTCGAT TAATCACTCA ATATTTGAGT TGGTTTGTGC TGGCGAAAAA TCAGTTAATC
TTAATCAACC AAATTATCCA AAGGATTATC AAGGCGCGTG ATAGTATTTT TGGTGATATC
AATCATGTTA TTCACGACGA TATTTTAGGC AAACTAAACT CGGTTACGCT TGGCATTGAT
ATGATCTGTG AGTTTGATCA GATTACACCT GATACAAAAG CTCGACTTGT TCAATACAAA
GCTTCGACCG ACCAAACAGT AGAAATCCTG AAACGCTTTA TCATTCAAAA ACAAATTGCA
ATTCCAGGTG TAAAAAAGAA TTTCTTACCA GAAATTCATC GGCTAATTCG TGAGCTTATT
CAACATCAAC AAATCGAACT TCATTGGGAA ATGCCACCAA GAGATGATTT GAATCTTTGG
AATAATCTAT CAATTGACAA AAAACGTGAT ATTTTTCGGA TTATTCATTC AGCAGTTGCA
AATACCCTTG CTTATGCTCA AGCAAGAAAT ATTAATATTA CATTTAGTAA AAATAATGCT
ATGTTATCCT TGTCAATCAT TGATGATGGT GTTGGTTTTG AGATTGATAA AGATATTAAA
CAAAATAGCA CTGGCTTGAT GACGATGTAT GAGCGCACAA AAAATATTGA TGGTATCATC
GAAATACGTT CGATTATCGA TCAAGGTACA ACAATCGAGT TATCAATTCC AATGGAAATA
GCTATTCCGA CTCTGATCCA TGAACAATCA ACTGAGCAAG CAACTCCATT AATTAAAAAG
CCCAAGCTAG AAATGCTCCA TTCAGATATT CAAGTCAAAG AACAATCCAA GAAATATAAT
TTAGCATTAA TCTGTATGGC ATTTATGCTT GGCATTGGCA TAACATTTAT TTTAAAAACC
GATTTATTAT CGCTAGAAAA AGTAGCCTCA CAACCAAAAG TCTGGTTTGA TGTTGATGTT
CGTTTAGTTA ATCAACAAGC TGAGAATGGT GAATTTTGGC GAACTCGTTT ATTTGATGGT
AGTCGTTTTA TCCAAAATTA TGATACTATC GCCGTCGGGA CAAGAGTAAC AATGACATTT
AGTTTAAGAA ACCATAAGGC TAAACCAGCT TTACTCCGAA ACTTAGTCGC AGGGGCACGT
GGTCCTAACG TCTTAGAACA AGGCTGGAGT GCCTCCACAA TGGATTTTCC TTCAGTTCAG
AATATTCTGA TAGAACCATA CAAAACCTAT ACATTTACGG CTAGCCGAAT TTATGATCAG
CCAGGTAATT ATTTTCTAGA GCCAATGTTT GAAGATGAGG CTGGTGATTG GCGAGCTATT
GCTGACTTTA CAAGAATTAC GTTTTTTGTG GCCGATTTAA CTAATCCAAT TGTTAGTGAA
GTCGTTGAGG TTGCTGCTGA TCGTGATTGG AAATATACGC CAATCTATGT TCAACCTGGC
GATACAATCG AATTTATAGC TCAAAAAGGT TCTTGGACAA CTGATATGAA TAGTCTACCA
TTCGTTAATG CTGATGGGTA TCAAAACCAG CATTACGATT GGACAACACT GCCAGCTGCA
AATTATGGGC AATTAATTGG CTCAATTGGC GATTGGAAGT TTGCAATTGG CAAAGAATCG
AGCATTCAAG CCCCGAACTA TCAGGGAATC TTACGACTTG GAATTAATGA TGCGCATTGT
GCCGATGTCT GTTTGAGCGA TAATCGTGGC TCGATGATGG TTGCTATTGT GGTCAGACGC
GCTAAAAAAT AA
 
Protein sequence
MKYIMRVISI FFGCFVVISM AISVRTSLNT IDDPALELIT YWSAINNCNE IHAITPAEWM 
PIAQEIVAPG DCILSVSGYG YNSPEMIDHL KKKLELDYAN RFVPVEIRRD QSQFTVYLPI
TKISIHHILQ IYLAIILTAA LLWILGIIVL ISNPDKENNI VFGYILWFLA LAITSIRHRV
PDFGEYLTLV ATVVPLALLG GSFFHLGYIF PVKIERYHGF KYVLYIPSLF SIALYTYVII
NIASGKSNIP TISNQANYII ATIFIIGFSF TLIRWAYLWI TYRHQTKSKI ILQAKIAFTT
WLIGGLFNVI LMIVYQEFNL KLPIIGKSIF FGLFLLVIPA AGTAFTLLRY ESVHAKRSFS
LDLLMIILIS AVIVDISILI NSYIEINGII FINIFFVAVI TSIFWYIDNP FRRIFEKYFL
RHQNDAEILF GLLNHLDITK DIHSAIYSSI DYLTKQLEIE SLRFVIEKNI IKTSEHFYIM
ENEPIDLFFE HNSEAGSFSN SLSKFYKHRE IVYDNNNNQI GYLYLGPKIT NEDFDTKDYE
LIRLITQYLS WFVLAKNQLI LINQIIQRII KARDSIFGDI NHVIHDDILG KLNSVTLGID
MICEFDQITP DTKARLVQYK ASTDQTVEIL KRFIIQKQIA IPGVKKNFLP EIHRLIRELI
QHQQIELHWE MPPRDDLNLW NNLSIDKKRD IFRIIHSAVA NTLAYAQARN INITFSKNNA
MLSLSIIDDG VGFEIDKDIK QNSTGLMTMY ERTKNIDGII EIRSIIDQGT TIELSIPMEI
AIPTLIHEQS TEQATPLIKK PKLEMLHSDI QVKEQSKKYN LALICMAFML GIGITFILKT
DLLSLEKVAS QPKVWFDVDV RLVNQQAENG EFWRTRLFDG SRFIQNYDTI AVGTRVTMTF
SLRNHKAKPA LLRNLVAGAR GPNVLEQGWS ASTMDFPSVQ NILIEPYKTY TFTASRIYDQ
PGNYFLEPMF EDEAGDWRAI ADFTRITFFV ADLTNPIVSE VVEVAADRDW KYTPIYVQPG
DTIEFIAQKG SWTTDMNSLP FVNADGYQNQ HYDWTTLPAA NYGQLIGSIG DWKFAIGKES
SIQAPNYQGI LRLGINDAHC ADVCLSDNRG SMMVAIVVRR AKK