Gene Haur_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2122 
Symbol 
ID5734010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2665038 
End bp2666588 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content50% 
IMG OID641279263 
Producthistidine kinase internal region 
Protein accessionYP_001544890 
Protein GI159898643 
COG category[T] Signal transduction mechanisms 
COG ID[COG2972] Predicted signal transduction protein with a C-terminal ATPase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.190674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAA AGCTGCCAAA TCATGAGCCG ATGGGAGTTG TGCTGGCTAG GTGGGCGGCT 
GCCCCACGGC TACGCCTACG ATTGTCTATC CGCGACAAAA TCCTGTTGGC ATTGTTGATG
GTGGTCATCT TGATGAGCGT GCCATATATC TTTCTGATTG TCCCCGGGCT AGAATACAAG
ACCCAATATG ACGTGCTGAT TCAAAATATT ACTACCGCCA ACAGCATCAA CGGCTATATC
AAGCCATCGA TCGATGCCGA ACTGTGGGAG ATTATTGCCG GCAAGAAACC GTTTGCCCAA
GGTACGCAGT ATGCAATCTT GAACGATATC GACCATCGCA TTGAGCTAAT GATCGATAAC
TCTAGCTCTC AAAAGGGTCG CGTTAAACTT GGTATTATTC AGCACACGCT GCAAACGCTC
CGCAGGTTGA TCGATAAGGT TGGTATTCAG ATTGCCCAAG CTAAAACCTT TGCCGAAAAT
TTGGTGCTGA TGGAGGAAAT TCGTAGTATT ACCCAGTTGA TTGAGGGCAA CGTACAAGCC
TATGCCTTGT TTGAAGTTAA TCGAACCCAA CAGCAATATC AGGCAATGCA AAGCGACCTG
ACACGCTGGG CGATCGGTGG TATAGGCGTG ATCATGGCTT CGATCCTCTT CTCGATCGTT
GCTGCTTGGC GGATCTCAAA AGGTATCTAT ATTCCGATTA AGAAGCTGCA CGATGTGACT
ACTACGATTG CCCGCCAAGA TCTCGAAGGG CTGGTGATGG CTGATAACGC CGACGAGATC
ACAGAATTAG GCTTGAGCTT TAATATTATG GTTGGTAAGA TCAAAGAATT GCTTGATGCT
AAGCTCGAAG AACACGAGAA CCTCAAAAAG GCTGAGCTAC GGGTGCTCCA AGCACAAATC
AACCCCCATT TCCTCTACAA CACCCTTGAT GCAATTATCT GGATGGCCGA AGCCAAGCGC
ACAGCACAGA TTATCGATCT GGTTTCGGCG TTGTCGCGCT TCTTCCGAAT TACGCTCAGT
AAAGGCCGAG ATTGGATCAG CGTTCCTGAC GAGATCGCGC ATATCGAAAG CTATCTGGCG
ATCCAGAAAA TTCGCTACCG CGATATCCTC GATTACCAGA TCGATATTCC TGAGGACACC
CAGAGCACCG AGATGCTCAA GCTGACGCTC CAACCGTTGG TGGAGAATGC ACTGTATCAC
GGGATCAAGA ATAAGCGGAG CGGTGGAACT ATTGTGGTGC GTGGCCGCTG GCTCGATGGT
GATCGACTGC GAATCGAAGT CGAAGACAAT GGGATTGGCA TGACCCAAGA GCGGCTGCAG
CAGGTGCGGA CGTTGCTGGA GGCGGGGAAC CTGTGGGTTA CAGGCGTAAT GCCCATCGTC
GAGGATGGCT ACGGGATCAG TAATGTTAAC CAGCGTATCA AGCTCTACTA TGGCTCCGAC
TATGGATTGT CGATCGAAAG CGAACATGGG CGTGGCACGT GTGTGGCGCT GATCATCCCG
CGCTACCGTG GCATTACCAC CCAACCGCCA CTGGCGCTTG CTGCTCGCTA A
 
Protein sequence
MDEKLPNHEP MGVVLARWAA APRLRLRLSI RDKILLALLM VVILMSVPYI FLIVPGLEYK 
TQYDVLIQNI TTANSINGYI KPSIDAELWE IIAGKKPFAQ GTQYAILNDI DHRIELMIDN
SSSQKGRVKL GIIQHTLQTL RRLIDKVGIQ IAQAKTFAEN LVLMEEIRSI TQLIEGNVQA
YALFEVNRTQ QQYQAMQSDL TRWAIGGIGV IMASILFSIV AAWRISKGIY IPIKKLHDVT
TTIARQDLEG LVMADNADEI TELGLSFNIM VGKIKELLDA KLEEHENLKK AELRVLQAQI
NPHFLYNTLD AIIWMAEAKR TAQIIDLVSA LSRFFRITLS KGRDWISVPD EIAHIESYLA
IQKIRYRDIL DYQIDIPEDT QSTEMLKLTL QPLVENALYH GIKNKRSGGT IVVRGRWLDG
DRLRIEVEDN GIGMTQERLQ QVRTLLEAGN LWVTGVMPIV EDGYGISNVN QRIKLYYGSD
YGLSIESEHG RGTCVALIIP RYRGITTQPP LALAAR