Gene Haur_3933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3933 
Symbol 
ID5735794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4926913 
End bp4928181 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content53% 
IMG OID641281084 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001546695 
Protein GI159900448 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAATC GATTGCGTTG GCGCTTAACC TTAATTTATG CTTGCACAGC CTTGCTGCTG 
TTGGCGGCAG TTGGCGGTAG CGTTTATTGG ATGACCGTGC GCTATTTTCG CTTTGCTACC
GAACGGGCCT TGCTTGAACG CATGACCTTG GAATTTGAGC AATTGGCAGC GCCCTTGCCG
CCCGCCCTGC AAGCCTATCG CCCCCAGCAA TTGCCCGATG AACGGCCCGT TGATAATGGA
ACACTGGCTA GCACGTTCGT TTTTACCATC GACCAAAACG GCCAAGTCTT CAACCCAAAC
CCGTGGAACC CGCCAATTCA GCCTGATCAA GCGGCGATTG AGGCCGCCAA GGCTGGCAAA
CTTGATCTGC GCACCATCAA CTTGGCTGAT GGAACTCAAG TCGCTTTGTT GACTGAAAAA
TTGACTCGCA GCGATGGCCC AGCTTTTTTG CAAGTTGGGC GGGTGCTCAA CGAGCAAGAA
GCCGCCTTGA GCACCTTGTT AAAAGGCTTG GTGGCTTTAT GGGCTGGAAG TGTAGTGGTT
TTGGGCTGGT TTGGTTGGTG GTTGGCCGGG CGATCGTTGC GGCCAGCCCA ACAAGCTTGG
GAACGCCAAC AAGCGTTTAT CGCCAATGCC AGCCATGAAC TGCGTGCTCC TCTGACCTTG
ATGCGAGCCA GCAGCGAAAT TGCCCTGCGC GAATCGACCG ACCCTGCCGA GCAACAAGAA
TTGCTTGAAG ATATTTTGGC CGAAACCTAT CACATGGCAC GTTTGGTTGA AGATTTGCTG
TTGCTCTCGC GGCTTGATGC TGCCAAAACC CATCTTCAGC GCGAAACGAT CGATCTGGCT
GAGTTATGCC AGGATGTGGC TAAGGATGCT GGACGCTTGG CACACGATGC TGGGGTGGAG
GTTCGTGTGG CGCATGCCGA GGGTCAGATC AAAGTTGATC GCACCCGTTT GCGCCAAGTG
TTATTAATTT TGCTGGATAA CGCAATTGCC CATACGCCGC GTGGTGGGAG CGTAGTGATC
CATGCTGAAC GCAAGCAAAC TAGCTATCAA ATTAGCGTAA TCGACACGGG CAAAGGCATT
GAGCCTAAAC ATCTCAAGCA TATTTTCGAG CGTTTTTATC GGGTTGATAG CGCTCGGATC
GCTGGCGGAC GTGGTAATGG GCTTGGTTTA TCAATTGCTC AAGCGCTGAT TCAAGCCCAC
AATGGCACGA TTGAGGCCCA AAGCAACGTG GGCACTGGCA CAACAATGTT GATCAACTTG
CCAAAATAA
 
Protein sequence
MLNRLRWRLT LIYACTALLL LAAVGGSVYW MTVRYFRFAT ERALLERMTL EFEQLAAPLP 
PALQAYRPQQ LPDERPVDNG TLASTFVFTI DQNGQVFNPN PWNPPIQPDQ AAIEAAKAGK
LDLRTINLAD GTQVALLTEK LTRSDGPAFL QVGRVLNEQE AALSTLLKGL VALWAGSVVV
LGWFGWWLAG RSLRPAQQAW ERQQAFIANA SHELRAPLTL MRASSEIALR ESTDPAEQQE
LLEDILAETY HMARLVEDLL LLSRLDAAKT HLQRETIDLA ELCQDVAKDA GRLAHDAGVE
VRVAHAEGQI KVDRTRLRQV LLILLDNAIA HTPRGGSVVI HAERKQTSYQ ISVIDTGKGI
EPKHLKHIFE RFYRVDSARI AGGRGNGLGL SIAQALIQAH NGTIEAQSNV GTGTTMLINL
PK