Gene Haur_4477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4477 
Symbol 
ID5736328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5728801 
End bp5730546 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content50% 
IMG OID641281640 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001547237 
Protein GI159900990 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00493244 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACC AACCCGCAGC TTCAGATTCA GCCGCATCAA CCAATCTTCA GACAGCCCTC 
AACGCTGAGG TGGCGTTTGG TCTTCGCCCA ACCGTAACAG GATTGGGTTT TTTATATTTG
TTATTTAGTA TTGCCCATGC GTTGGTTTTG CCCAGCCCAA TTAAACTACC AATGGTCATC
GTAGCGCTCA GTAGTGTCAT TTTTTTTGGA TTTTGGTGGT GGCGCTTGCA AAGCTGGCGG
CCAGCCCCCG AGCTGACCCA TCCGCTGGCC ACGCTCTTTA TTGTGGTTGG CGGCTTCAAT
AGCGTTTTGC ATATCTGGCT GAGCGGCGAG ATTCACCAAA GCACCAATAT TGCCTTTATT
TTAATTGGCA CTGGCTGTTT GTTGCTTTCT TGGAATTGGT TCATCGTAGC GAGTGGGGCT
ATTTTGCTGG CGTGGATTGC GGCGATCATT TCATTGCCAA CCTCACCGTT GACCATGCAT
TTTATTTTTA TGGTGGTTAG TGCCACGATT GCCGCTGCCA CAATTCAAGG CATTCGTTTG
CGCACGGTCA AGGGCTTGAT CAAATTGCGC TTGCAAGAAA GCACCTACAA ACAAGAGCTA
CAAGAGGCCT TAATCCAAAT CAAAATGAGT GAAGAGCGTT TTCGCGCCTT GGCCGAAGCA
ACCTCCGAAG GGGTGGTATT GCAAGATGAA GGCGTGGTGA TGGATGCCAA CGAACGCTTT
GGCGAGATGT TTGGCTATCA TCGTGATGAA ATTCTCGGCC ACTCGTTACG CGAATTTGTC
GAGCCACAAT CGCTGCAACG GGCAATGCAA AAATATAAAG ATGGTGCGCC CTACGAAGTT
ACCGCACTGC GCAAAGATGG CAGCACCTTT ATCGCCTTGG TGTTGGGCAC CAATTTGCCC
TATAGCAATC GGGTGGTGCG GGTTGCGGCG GTGCGCGATA TTACTGAGCA GCGCCATTTT
GAAAATTTAT TGCTGACTGC CAAAGATGAT GCTGAGGCCG CCAACCGCGC CAAAAGCACT
TTCCTTTCAA CTGTCAGCCA CGAATTACGC ACGCCACTGA ATGCGATTGT TGGCTATAGC
GAAATGATCT ACGAGGATTT GATCGATCGC AGCATGCCTG AGTTGGCCAT GGATATGACC
CGCATTCGTA GCGCTAGCGA CCGCTTGTTG AGCTTGATCG ATGGCGTTCT GACGATTACC
GATCTTGATG CGGAAGTTGT GCGTTTGGAG TATGAAACGA TCGATTTGGC GCTGGCGATT
GGCAGCATCA GCGACCAATT GCAAGCCAAA GCCCAAGATA ACAAAAATAC TGTGCAATTA
TTGGGTAGCC AAAACTGGGG TTCGATTATC AGCGATGATC ATAAATTGCG CATGATTATT
TACCATCTGC TGGATAATGC AATTAAATTC ACCCACGCAG GCTTAATTAG CATCTCGGTG
CAACGCTTGC AACACGCTGC TGGCGGTTGG CTCGAAATTG CAATTCGCGA TACGGGGATT
GGCATTGCCC ATGAGCAATT TGAACGGATT TTTGAGCCAT TTGTTCAAGC CGATTCCTCG
GCCACTCGCC AATATGAAGG TACCGGTTTG GGCTTGGCCG TGAGCATGCG GCTGGCTCGT
GCTTTGGGTG GCACGATCGA GCTTGATAGC CGCTTAGGGA TTGGCTCAAC GTTTACTCTG
CATATGCCCG AACATCCCAC CAAACCCAAT GTGCCTTCAC CTCAAATGTC ACACGCGAAC
GTATAG
 
Protein sequence
MSDQPAASDS AASTNLQTAL NAEVAFGLRP TVTGLGFLYL LFSIAHALVL PSPIKLPMVI 
VALSSVIFFG FWWWRLQSWR PAPELTHPLA TLFIVVGGFN SVLHIWLSGE IHQSTNIAFI
LIGTGCLLLS WNWFIVASGA ILLAWIAAII SLPTSPLTMH FIFMVVSATI AAATIQGIRL
RTVKGLIKLR LQESTYKQEL QEALIQIKMS EERFRALAEA TSEGVVLQDE GVVMDANERF
GEMFGYHRDE ILGHSLREFV EPQSLQRAMQ KYKDGAPYEV TALRKDGSTF IALVLGTNLP
YSNRVVRVAA VRDITEQRHF ENLLLTAKDD AEAANRAKST FLSTVSHELR TPLNAIVGYS
EMIYEDLIDR SMPELAMDMT RIRSASDRLL SLIDGVLTIT DLDAEVVRLE YETIDLALAI
GSISDQLQAK AQDNKNTVQL LGSQNWGSII SDDHKLRMII YHLLDNAIKF THAGLISISV
QRLQHAAGGW LEIAIRDTGI GIAHEQFERI FEPFVQADSS ATRQYEGTGL GLAVSMRLAR
ALGGTIELDS RLGIGSTFTL HMPEHPTKPN VPSPQMSHAN V