Gene Haur_0725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0725 
Symbol 
ID5732611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp833061 
End bp835349 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content55% 
IMG OID641277855 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_001543501 
Protein GI159897254 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGGTT TTGATTTATC AGCATTTTTT GGCCAGTTTC GCGAAGAAAC TGAAGAGAAT 
GTGCGGGCCT TGACCACAGG CTTATTGGCC TTGGAGTCAA ACCCAGGCGA TCGCGAAGCG
ATTGATACGA TTTTTCGGGC GGCGCATACG ATCAAAGGTT CGGCGCGTAT GCTGGGTCAA
GTCGATATGG GGCGGTTGGC GCATACCATG GAAAGTTTGC TTTCGGCCTT GCGCAGTGGC
ATGCTAGCCA TGAATTCGAG CATTAACGAT GTGCTACTGG CCAGTGTTGA TGTATTGCTG
GTGTTGAATT CCCAAGTCAA CGAGCCACCG CCAACCGACC CCAACGTTGA TCGTTTGGTT
GAGCAACTGA ATGCCTTGGC TGCTGGCGAA AGCTTGCCTG CTGCGCCGAT TGTCGCGCCA
GTAACCGAAC CAGAACCAGA GCCTGAGCCA GTAGTGCTTG AGCAACCCAA GCCCGAACCA
GCGGTTGCTA AGCCAGCCGC GCCTGCCAAA CCCAAAAAAT CAGCCTCAGC CGAAGCCCCC
AAGTCGGTGA GTAGCACCCG TTCAACCGTG CGTGTGCCAA TTTCGCGCTT AGATCGTTTG
TTGAATACCG CTGGCGAGTT GGTCGTAACC CGCCAATTGC ACCTTGAGCA TGTCGCTGAT
CTTGAGGCTT TGGATAAATT GCTGACCAAA AGTGAGCGCC TGAGCCAACA ATTGAGCGAA
CGCTTGACGG GTCAACGGGT GACCTTTCAG CAACGGCGCG AGGCCAGCGA ATTAGCCAGC
CAATTGCAAA ATCTGGCCCA ATCGACCCGC AATCAGTTGC GTTTGCTAAC CGAGCGTTGG
AGCAGCCATA GCGCCGCCAG TGAGGCCTTG GTCGATGAAC TTGAGGCTGA GGTGATGGCG
ACCCGTTTGC AACCAGTCGC TGGTTTGTTT GCACCAATTC CTCGGGCCGT GCGCGAGCTG
GCTCGTTCGT TGGGCAAAGA AGTTAACTTA ATCACCGAAG GCGAAACCAC CGAGGCCGAT
CGCAAAGTGA TTGAGTTAAT GGCTGATCCG TTGGTGCATT TGGTGCGCAA CGCGCTTGAT
CATGGCATCG AAAGCCCCGA TGAGCGGGTG AAAGCCCACA AGCCTGCCGA AGCAAGCTTG
CGTTTAGAAG CTCGCTCGTT GGGCGGCACG ATTGAAATTA TTATTAGCGA CGATGGCCGT
GGCATCGATC CAGCGGTGAT TCGGGCAACT GCAATTAAAC GCGGAATTAT CGAGGCTGAT
ACAGCGGCTC GCTTGCGTGA TGAAGAAGCT TTGGAGTTGA TCTGGCAGCC TGGTTTTTCC
ACCAGCGCAA TCATCACCGA TGTTTCAGGC CGTGGCGTTG GCATGGACGT GGTACGGGCA
GCAGTGACCG AGGTTGGTGG GCGGGTCGAT GTGCATTCGG TGCTTGGCCA AGGCACGACC
TTCACGCTGA TTTTGCCAAT TACCTTGCTA ACCACCCGCG TGTTGTTGTT TGATGTGGCT
GGCACAACCT ATGCCTTGCC TTCGACTGCT TGTCTAGGTG GGCGGCGGGT TGCTGGCGGG
CAAATTCAGA CCGTCGAAGG GCGACCAACC GTGCGGGTTG ATGAGCGCAG CGTGAGCATT
GTAGCGCTTG CGCCCTTGCT TGAGCAGCGT GGCCCCTTGC CGCAACCATC GGATATTTCC
AATTTGGTGA TTTTGGGGCC AGCTAATCGC CCATTGGCCT TGTTGGTCGA TAAATTGGTC
GATGAACGTG AGGTGGTGGT TAAATCGTTG GGCGCATTGT TGCATGAACA ACGTTTGTGT
ACTGGCGCGA TTGCCCTGCC TGATGGGCGT TTGGTGTTAG TGCTCAATCC CTTGGCGATT
GCGGCGCGGG CACGTGAATG GGGCAAACCA GTTGCCTTGC CAGCGCCAAC CAAGCTCCAG
CCTGCCAAAT TATTGGTCGC GGAAGATTCA TTTACCACCC GCGAACTGCT CCGATCCATG
CTGCAATCGG CGGGCTATGT GGTTGAAACG GCGATTAACG GCCAAGATGC GCTTGACAAG
CTCAATCACA ATTCCTACGA TCTGCTGGTA AGCGATGTTG AAATGCCGTT GCTAACTGGC
TTTGAGCTAA CCCGCCGTGT GCGTGCCCAT GACCGTTTGC GCCAACTGCC AATTATCATT
ATCACCAGCT TGGCCCGCGA TAGCGATCGG CGTGAAGGCT TGTTGGCTGG TGCGCAAGCC
TATATCGTCA AAAGCCAGTT TGATCAAAGC AACTTGCTCG AAACGATTCA TCAATTACTT
GGCCGCTAA
 
Protein sequence
MGGFDLSAFF GQFREETEEN VRALTTGLLA LESNPGDREA IDTIFRAAHT IKGSARMLGQ 
VDMGRLAHTM ESLLSALRSG MLAMNSSIND VLLASVDVLL VLNSQVNEPP PTDPNVDRLV
EQLNALAAGE SLPAAPIVAP VTEPEPEPEP VVLEQPKPEP AVAKPAAPAK PKKSASAEAP
KSVSSTRSTV RVPISRLDRL LNTAGELVVT RQLHLEHVAD LEALDKLLTK SERLSQQLSE
RLTGQRVTFQ QRREASELAS QLQNLAQSTR NQLRLLTERW SSHSAASEAL VDELEAEVMA
TRLQPVAGLF APIPRAVREL ARSLGKEVNL ITEGETTEAD RKVIELMADP LVHLVRNALD
HGIESPDERV KAHKPAEASL RLEARSLGGT IEIIISDDGR GIDPAVIRAT AIKRGIIEAD
TAARLRDEEA LELIWQPGFS TSAIITDVSG RGVGMDVVRA AVTEVGGRVD VHSVLGQGTT
FTLILPITLL TTRVLLFDVA GTTYALPSTA CLGGRRVAGG QIQTVEGRPT VRVDERSVSI
VALAPLLEQR GPLPQPSDIS NLVILGPANR PLALLVDKLV DEREVVVKSL GALLHEQRLC
TGAIALPDGR LVLVLNPLAI AARAREWGKP VALPAPTKLQ PAKLLVAEDS FTTRELLRSM
LQSAGYVVET AINGQDALDK LNHNSYDLLV SDVEMPLLTG FELTRRVRAH DRLRQLPIII
ITSLARDSDR REGLLAGAQA YIVKSQFDQS NLLETIHQLL GR