Gene Haur_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3937 
Symbol 
ID5735798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4931349 
End bp4933454 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content52% 
IMG OID641281088 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_001546699 
Protein GI159900452 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACGC TTGCTTTATT TAGTAGCCTA GCCCAAACCT TGCTTGCAAC CACACCTTTT 
GAGCAGCGCA TGCAGCATTG TTTTGGGCTG TTGGCGGATA GCTATCCTCA GCTTGATTTA
CGCTTAACCT TATTAAACGA GCCTGATGCT CGTCCGCAAG TGGTTTTGCC GTTGCATCGT
ACCAGCGCCG TTTGGGATAA CACTCGCATG ATGCAGGTGG TGCGGCGGCG ACAACCAGTC
GTGATCGATC AACACACAGC TCCAGCCTTG CCACCCAGCC CGTTTACCAC CAGCGCAATT
GTTGCTAGCG AATGGCAAGC CGATATGCAA TGTTATTTGG GGCTGCCGAT TCAGTGGGAA
GGCCGCTTGT GGGGCGTGTT GGAAGCCCGC CGTAATGGTA CATTTAGCGC CAGCGAACGC
ACCTTATTGA GCAATTTGCT GCCCTTACTG GCCACCGCGA TCGGCGAAGC CCACTGGGGG
CGACCGATTC ATCATACCAG CAGCGAACAG CAGCTTGATG TACGGGCATT AACCCACGAT
TTAGAAATTG CGCCTGATGT GACAACGTTG TTGACAACGC TCTTGCAACG GGCCATCCAC
AGCGTCAAAG CCAGCGCTGG CGCGATTAAC TTGGTTGATC GTGAGCATGG CGAATATCGT
TTGATTGCCT CGCAAGGCTA CCCGCCAACC GCTGGAATTA GTGAGCGAAC ATCGTGGCCT
TGGAATGTTG GGGTGGTCGG GCGGGTGGCA CGCACTGGCA AGGCCGCCTT GTTGACCGAT
ATTGCCCACG ATAGCGAATG GCAACTTTCT ACCCCCGATG TGCGAGCCGA AATCGTCGTG
CCCGTGCGGG TCGAGGGCGA GGCATGGGCG GTGCTGGTGC TTTCGACCAA TCGTGAGCCA
ATCTTTACAA CCCGCGATCT TTATTTTATT CAGGCTTTGG CTGATGTGGC GGCGCGGCCA
TTACAACGGG CAACCAGTTA TAGCGAATTG CTCGAAGCCC GCATGCAATT GCAACAAACC
TTGGCTAGTT TGCCGCTGGG CTTGGCCTTG ACTGATGGCG AAGGCCGAAT TTTACGCACC
AATCCCGCCT GGTATCAACT TTGGCAAATT GAGCAGCCCG CCGATGAAAC TGCGCTCTAT
TTGCCGTGGG ATATTTTGCC CTTGCTGCTC AAACGGCTGT CGCATCCCTT GGAATTGACC
GATTTTTTTG CTGAATGCCA AGCTCAACCT GACGAAACCC TCGAATTAGC GCTGCGTTTG
AGCGAACCAC TTCAAGATTT AAAATTGCGT TCAACCCCAG TTAAAGATGC TCAGCACCAA
ATTACTGGGC GCTTGGTGGT GATTGAAGAT GTGACCCGCG AGCGTGAAAT CGACAAGATG
AAAAACGAGT TTGTGTCGGT GGTATCGCAT GAATTGCGTA CCCCGCTAAC CTCGATTTTG
GGCTATACCG AGTTGCTGTT AGCGCGTGAA TTCAAGCCAG TTGAACGTCA AGAGTTCGTC
CAAACCGTCT ATGATCAGGC CAACCAACTC TCGAAGATGG TCGATGATCT GCTGAATCTT
TCGCGTTTGG ATGCAGGCCA GATCAAGCTG AATCGTTGGG TGGTGTCGCT GCACCAAATT
ATTCGTGAAA TTACCAAGCA ACTTAACGAA ACATTGTCTG AAAAGCATCG TTTGTTAATT
GATATTCCCG AAGGCATTCC GCCAATCTTT GCCGATAAAG ATAAAGTGCG TCAGATTTTG
ACCAACCTGC TCTCGAATGC GATCAAATAT TCGCCCAATG GCGGCCAAGT AGCCTTGATT
GTACGTGAAT TGCGTAAAGT TCCGCCTGGT GCGCCGCCCT TGCCCAACGA GCGCTCGGTG
ATTATTGCGG TGCGCGACCA AGGTATGGGT ATTTCCGAAG AAGATTTGCC CAAGCTGTTT
ACGCGTTTTT TCCGCGTCGA TAACTCGACG ACTCGCAAAA TTGGCGGCAC AGGCTTAGGC
TTATCAATCA CCAAGGCCTT GATCGAGTTG CATGGCGGGC GAATTTGGGC CACCAGCACG
CTTGGCCGTG GCACAACCTT CTGGGTAACC TTGCCAATTG CCACTGAGTT AGCCCGCCGA
GGATGA
 
Protein sequence
MSTLALFSSL AQTLLATTPF EQRMQHCFGL LADSYPQLDL RLTLLNEPDA RPQVVLPLHR 
TSAVWDNTRM MQVVRRRQPV VIDQHTAPAL PPSPFTTSAI VASEWQADMQ CYLGLPIQWE
GRLWGVLEAR RNGTFSASER TLLSNLLPLL ATAIGEAHWG RPIHHTSSEQ QLDVRALTHD
LEIAPDVTTL LTTLLQRAIH SVKASAGAIN LVDREHGEYR LIASQGYPPT AGISERTSWP
WNVGVVGRVA RTGKAALLTD IAHDSEWQLS TPDVRAEIVV PVRVEGEAWA VLVLSTNREP
IFTTRDLYFI QALADVAARP LQRATSYSEL LEARMQLQQT LASLPLGLAL TDGEGRILRT
NPAWYQLWQI EQPADETALY LPWDILPLLL KRLSHPLELT DFFAECQAQP DETLELALRL
SEPLQDLKLR STPVKDAQHQ ITGRLVVIED VTREREIDKM KNEFVSVVSH ELRTPLTSIL
GYTELLLARE FKPVERQEFV QTVYDQANQL SKMVDDLLNL SRLDAGQIKL NRWVVSLHQI
IREITKQLNE TLSEKHRLLI DIPEGIPPIF ADKDKVRQIL TNLLSNAIKY SPNGGQVALI
VRELRKVPPG APPLPNERSV IIAVRDQGMG ISEEDLPKLF TRFFRVDNST TRKIGGTGLG
LSITKALIEL HGGRIWATST LGRGTTFWVT LPIATELARR G