Gene Haur_3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3102 
Symbol 
ID5734974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3911222 
End bp3914857 
Gene Length3636 bp 
Protein Length1211 aa 
Translation table11 
GC content52% 
IMG OID641280246 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_001545868 
Protein GI159899621 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTATGC AATCCCAAAA ACAGGCATTT GAAGATTCCT CAGCCCACTT TTTGCGCGAG 
TGGCTCCCAC GTTTGGCAAC CGCCACGAGC ATTAATGGCT TGCACACAAA TTTAAGCGCC
GCCCTTCGCC AATTTCGCCA TAATCCCCAC TACCATTTGA TTTGGCTTGA TCTGGCCGAA
ACTCAGCCCG AAATTCCTTT AGAATTAATG CTCTCCAGCA ACCAACAGGA GCAGCTTGAG
CAAGGTGCAA TTTTAACCAT CACCACCGCT GATGCCACGC AATGGACGAT GCTGCCCCTG
TTGCACATTA CTTTGCGCGG TTGGTTGGCC TTGCCTGAAG CAACCCAAAA TGATCAATTG
CTCTTGCCCT TAGCGCTGCA AACTGCCGCC AGCATGTGTA CGATCACCAA CAATCAACAA
TTGGTGGCCG AACTGCGCGA GCTGACCCAA TTTAATTTGA TCAGCCAACG CCTGAATAGT
TCGCTCGAGC TTGGGCCGTT GATCGAGGCG GTGTATCGCG GCATCCAACA GATGTTTGGT
CCGCAAAATC TGTTTGTCTC GTTGTATGAT CCTGAGTCGC AGCTGTTTCA AGTGGCAGCG
CATTTTTACG CCGATGGCCG CAGCACGACC CCCAATTCGC AATGGACAGT TGCCCATGGC
TTAACTGGGG TGATTATTCG GCGCAACGAG CCAATTATCA CCGATCGCTA TCTCGAAACT
TGTGCCCAAT ATCATGTTAC GCCATTGTTT ATTGATGATG ATATTCCGCC GCTGTTTTGG
GCAGGTGTGC CGCTACGCTT CGGCGAACGG GTGCTTGGCA CGTTAGTGAT CTACACGCTT
GAGCCAGAGG TACGCTACAA CGCCGACACG TTACGCAAAT TGGAGATGTT GGCTCATCGT
GCGGCTGATA ATTTTGAGCA GGCACGGCTC TACGAACGCA CCACACGCCA AGCTCGCCAA
CTTGAATTAT TAAACGAACT TGGCCGCACC ATCACTTCAA CCCTCGATTT TGAGGCCGTG
CCATCGTTGA TCATGGGGCG GGTGCAAGAG CTTTTAAATG TTGAAGAGGG TTCGTTGTTG
TTGCTCGACG AAGCCACCAA CGAATTAGTC TTTCGTTATA GTCTGAGTCC AGTTGGCCAG
CAACTTTTGG GCAATCGTAT CCCGTCTAAT GTGGGGATTG CTGGCTTGGT GTTACGTACA
GGCGAATCAT TGATCGCCAA TCGTGCTGGC GATCATCCGG CGTTTTATAC GGGCATTGAT
ATGTTGGTGG GCCATGAAAC CCGCGATCTG CTGTGTGTGC CGTTGCTTGG CGCTGGCGGA
GTCAAAGGGG TGATCGAAAT TCTCAATCAT CGCAATGGCT TGCCATTTAC TGCCGCCGAC
CGCGCCTTGC TCGAAGCAGT GGCCGATCAA GCAGTAATTG CGATTGAAAA CTCTAATCTC
TACACCCAAA CCGACCAAGC CCTGACTCGC CGAATCAGCG AACTCGACCA GCGTAACAAG
CAATTTCAAG AGATTGTGCA AATTGGCAAT GCGCTCAAAG CTGCTTCAAA CCTTGGGTCG
GTCTTGCCAG CCTTAGCCGA GGCTGTGCAA AGTGCTACGG GCTTCAACCA AGTGACGATT
AGCCTCGTTC AAACTACGCC AGGCCATAAA GCAATTATCA AACGGGTCGT CAGCGCAGGT
ATCGATCGCC AAATTTTCGA GGCCCAAGCT AGCATTACGA TTGAGCCTGA GCGCGTCGAG
GCTTTGCTCA AACCGCAATA TCGCCGGGGT GAATCGACCT ACTATATCGA TCATCGGCGC
AATGGTGATG CGCGTTTATG GCCTGATCCC TCAAATGGAG TCGATGTGCC AGAGCTACGG
ACTGGCATGT GGCATCCCAA CGATAGCTTG TTTGCGGTGT TACGCAGCAC CAGCGGCGAT
CTTTTAGGTC TTTTGACGGT CTATGCACCA AAAAATGGCC TTCAGCCAAC CACTGATCAA
ATTCAAATGC TGGAGATTTT CGCCACGCAA TTGGCAGTTG CGCTCGAAAA CAATCGGCTC
TACGAACGCC AACGCCAAAC CGTCGAAGGC TTGACTGCCT TGAGTGCGCT TGGCATGGCG
ATCAACAGCA CCTTCAGCAA TGCGCAATCA ATTTGGCAAT TTACAGTTGG CGGGATGGTC
GATTGGACTG GCGCACTCGG CGCTGGGGTA TTACTCAGCG ATCCCAATGA TCCCACGACA
TTGCAACCAA TTGTCGGCTT GGGAATTGAG CACACACCTG ACGAGGCGAT TATTCAATTT
GCCCATAAAG TGCTGGAGCG TGATAAGCCC TTGTTGCTTG GCGATCGCAG CAAATTGCCG
AGTGTCTTAC GCGATTTAGG TGGTCAAGCC TTGGTGATGT TACCGTTGCT AGCAACCCAC
CAAACCTTGG GCGTGATCTA CTTGTGGTAT CCCGAAACCT TGCCCAACCG CGAAGCCCAA
GATTTGCTCT CGCTGTTTGT GGGCCAAGCC GCTGTGGCGG TCGAAACTCG GCGTTTGGCC
GAAGAAGTGA GCAATGGCCG CGATCGCTTA GCCTCGATTT TGGCTTCAAC CGAAGAAGGC
ATTATTTTGC TGAGCGCCGA TTTGCGGGTG GTCGAGGTCA ATGCCGCCGC TCAAACCATG
ATCGGCAGCA ACGAAGTCGC CGAATTGTTG GGAGAGCCAG TCGATTTGCT GCTTACCCAT
TGGTGCGCCC AGTGGAATAA ATCTGAGGAA GCTTGGAGCG AACTAACGCT GGCGATCTAC
AAAGTTAGTC GCGGGCGTTT GGCTGAGGCG CGGGGCCAGT TAGAATTGGG CGGTTTGCGC
AGCCAATGGC TCGAATGGAC AACTCTGCCA GTTCAAAGCG CGGCGAACCA AAGCCCTCAC
CCGATTATTA TTGTGCTACG CGATATTACC ACCGAGATCG AGGCCGAAAA TCTGCGCCAC
GACCTGACCT ATATGATGAT TCATGATTTG CGTGGGCCAC TGAGTTCGGT GATGACCTCG
TTGGATATGC TGGCCAAAAA GATGGTTGGC GACCTCAGCG AAGGCCAAGA TAAAATTGTC
AAAATCGCTT TGCGCAGTAG TGCTCGTCTG CTCGATATGG TTAATTTGCT GATGGATATT
AGCAAACTTG AAGCAGGCCA GATGCCAATC ACACCAATTA CGGTGGCGGT TGATGATGTG
GTGCGTTCGG TGATGCAAAA TTATGAGCCA TTGCTCAACG AGCGCAAAGT CCATGTAGCG
CTGAATATTG CTGAAAATAC CCCTAAAGCC TCGGTTGATA TCCAAACCCT AGAGCGGGTC
GTGCAAAACT TGCTGGATAA CGCGATTAAG TTCAGCCCAG CCTTATCAAC AATCACGATT
AGCGCCAACA AAGTTCAAGC CAATCAACTA CCAAGCGATC ACCCAACTGG TCAATGGATT
TTGTTGGGCG TGCGTGATGC TGGGCCTGGC ATTCCAGCCC AATATCGCGA ACGGGTCTTT
GAAAAGTTTG CCCAAGTCAA ACAAACTGGG ATCAAAGGCA CTGGGCTAGG CTTGACCTAC
TGTCGCTTGG CGGTCGAAAC CCACGGCGGG CGCATTTGGG TTGCCAATGA CGATGGCCCT
GGGGCGCTCT TCCTGCTGAC TATTCCGATA GCCTAA
 
Protein sequence
MCMQSQKQAF EDSSAHFLRE WLPRLATATS INGLHTNLSA ALRQFRHNPH YHLIWLDLAE 
TQPEIPLELM LSSNQQEQLE QGAILTITTA DATQWTMLPL LHITLRGWLA LPEATQNDQL
LLPLALQTAA SMCTITNNQQ LVAELRELTQ FNLISQRLNS SLELGPLIEA VYRGIQQMFG
PQNLFVSLYD PESQLFQVAA HFYADGRSTT PNSQWTVAHG LTGVIIRRNE PIITDRYLET
CAQYHVTPLF IDDDIPPLFW AGVPLRFGER VLGTLVIYTL EPEVRYNADT LRKLEMLAHR
AADNFEQARL YERTTRQARQ LELLNELGRT ITSTLDFEAV PSLIMGRVQE LLNVEEGSLL
LLDEATNELV FRYSLSPVGQ QLLGNRIPSN VGIAGLVLRT GESLIANRAG DHPAFYTGID
MLVGHETRDL LCVPLLGAGG VKGVIEILNH RNGLPFTAAD RALLEAVADQ AVIAIENSNL
YTQTDQALTR RISELDQRNK QFQEIVQIGN ALKAASNLGS VLPALAEAVQ SATGFNQVTI
SLVQTTPGHK AIIKRVVSAG IDRQIFEAQA SITIEPERVE ALLKPQYRRG ESTYYIDHRR
NGDARLWPDP SNGVDVPELR TGMWHPNDSL FAVLRSTSGD LLGLLTVYAP KNGLQPTTDQ
IQMLEIFATQ LAVALENNRL YERQRQTVEG LTALSALGMA INSTFSNAQS IWQFTVGGMV
DWTGALGAGV LLSDPNDPTT LQPIVGLGIE HTPDEAIIQF AHKVLERDKP LLLGDRSKLP
SVLRDLGGQA LVMLPLLATH QTLGVIYLWY PETLPNREAQ DLLSLFVGQA AVAVETRRLA
EEVSNGRDRL ASILASTEEG IILLSADLRV VEVNAAAQTM IGSNEVAELL GEPVDLLLTH
WCAQWNKSEE AWSELTLAIY KVSRGRLAEA RGQLELGGLR SQWLEWTTLP VQSAANQSPH
PIIIVLRDIT TEIEAENLRH DLTYMMIHDL RGPLSSVMTS LDMLAKKMVG DLSEGQDKIV
KIALRSSARL LDMVNLLMDI SKLEAGQMPI TPITVAVDDV VRSVMQNYEP LLNERKVHVA
LNIAENTPKA SVDIQTLERV VQNLLDNAIK FSPALSTITI SANKVQANQL PSDHPTGQWI
LLGVRDAGPG IPAQYRERVF EKFAQVKQTG IKGTGLGLTY CRLAVETHGG RIWVANDDGP
GALFLLTIPI A