Gene Haur_2940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2940 
Symbol 
ID5734812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3715489 
End bp3716760 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content48% 
IMG OID641280084 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001545706 
Protein GI159899459 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGATG GATGGGGCAA TTTAGCGCAA TCATGGTTCA ACGCCTATCA ACGCATCGGC 
CAACAACTTG ATCCATTGCC GTGGATTACA TGGCTGATTG CCTGCTTTAG TGTGGGAGTC
GGCGCTCTCG GCTTAACTGG CATCGATCAA ACCGCGCCTA AACTCTTAAT TTGGATCGAT
TTTGGCCTAG CGTTGTATCT GTTTTGGCTG GGATGGGTGG TCTATCGCCG CCCAAACTTG
AGCTGGCCGC GCTGGCTGAT TGTGCTCAGT ATGACTTCAA CCTGTTTGAT CGCCCTCAGT
ATTGTCGAGC TTGAAAGCGC CGCAGTCGTC TTGGTGACCG TGCCAATTTT GTTGGCAATT
ATCACGACCG GACGTTTATG GCCAATTATT AGTTATCCGG CGTTTATGCT AATTGCGACA
CTGGTTCACC CTGCTGGCTG GAGTTTGTTT AACCCAGCGG TGATTGTGTT TAATTTGTTG
TTGTTTTTGA CGATGTGGCA ATTGCGCAGC TCGGCCAGCC GCGCCACCCA AATTCAGGTG
TTGGCCAAAG CCCAGCTTGA GCAAGAACAA CAAGCGCGTC AATTTATTCG CTTTTTTCAG
CACGAATTAC GCGGCTATAG CAGTGGCTTA GAAGGTTTAG TACCAATTTT TGAGCGTTTT
TTGGCCAACG CGCCGCAACA AAATGCTGGT ATTAGCAGTG ACGAAGCCTT GAGTGCCCTC
GCCACCACAA CCCAACATCT CAAGCAATTA ACCACACAAC TATTAATTTT AACTCGCGAT
GGGGTTTTAC AACCCCAACA ATTACAAACT CTCAATTTAG CCGAATTATT ACAACAAATT
GTCAATGAGG TGCGCACTTC ACATCAGCTT GATGCTGAGC ACCTGCGTTT GAGTTTATGC
AAACAGCAAC CAGTCTTGAT TAAAGGCAAT GATTTGTTTA TTACTCTAGC CTTACGAACA
ATTATTCAAA ATTCAATCGA GGCCAACCCA AGCTCAATCG CTGGACAACA GATTAGCCTT
GAGTGCTGTA CCAACAACGA AATGGCCGTA ATTAGCATTA GCGATCAAGG CGCTGGTTAT
CCCAACGAAT TGCTGCAACG GCTCGCTACC AATCAGATTC CGGCAGTGCT AGGCTGGAGC
GGCAAACTTG GTGGCAATGG CCTAGGCTTG GCATTAATTA TGCAGGTTGC TCGTTTGCAT
CAAGGCTCGG TTCGTTTTGA AAATCAAGCT CAAGGCGGCG CACGCACCAG TTTGGCCTTG
CCGCGCATTT AG
 
Protein sequence
MKDGWGNLAQ SWFNAYQRIG QQLDPLPWIT WLIACFSVGV GALGLTGIDQ TAPKLLIWID 
FGLALYLFWL GWVVYRRPNL SWPRWLIVLS MTSTCLIALS IVELESAAVV LVTVPILLAI
ITTGRLWPII SYPAFMLIAT LVHPAGWSLF NPAVIVFNLL LFLTMWQLRS SASRATQIQV
LAKAQLEQEQ QARQFIRFFQ HELRGYSSGL EGLVPIFERF LANAPQQNAG ISSDEALSAL
ATTTQHLKQL TTQLLILTRD GVLQPQQLQT LNLAELLQQI VNEVRTSHQL DAEHLRLSLC
KQQPVLIKGN DLFITLALRT IIQNSIEANP SSIAGQQISL ECCTNNEMAV ISISDQGAGY
PNELLQRLAT NQIPAVLGWS GKLGGNGLGL ALIMQVARLH QGSVRFENQA QGGARTSLAL
PRI