Gene Haur_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3521 
Symbol 
ID5735382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4432572 
End bp4435640 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content52% 
IMG OID641280668 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001546285 
Protein GI159900038 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGGC CGCTCTTGCT GAGTTTTTGC AGTGCGACCA TAACCGCGCT CTTGTTTGGA 
TTGGGTTGGC TAGCCAATCT GCCGATGCCT TGGATTTTAG CCTTAGGGGT TGTGCAAGCA
GGCATTATGT GGTGGCGCAC ACCATGGGCT GTTGGCAGTG CCATCAGCAT GGCGCTCATC
GCATGGTTTG CGGGCATTAA TCAGGTTGGG TGGCTAGCAC TGTTGGTTGG TGGTTGGTTG
GCTCAAGGCG GCTTGCCAGC CCTCATTTTT ATTCTGTATC GGTATCCCTA CGATATGTCG
CGCGAATCGG CCCTGAATGG TTTTTTGGCC AGTGGGGTGT TGTTCAACAC GGCGATTAGC
GCATTTTGTA TGAGCATTGT CGGCTGGCAA GTCGGCTGGA TCGAGGAACA TTCGTTGGTG
GTAACCACAG CATTGATGTG GCTCACCAAT AGTTTGGGTT CGTTGATCAT CACCTTGCCA
ATGCTGCGTT TAGGCACGCC ATGGCTCTCA ACCCGTGGCT GGTTTGGCAC GCCCGCCGCT
TTGCAACGCA GCATTCATCG CAGCACCATC ACCCGTAGCG ACGTATTGCA ATTGATCTTT
ATGCTGACCA TCGTCGGCCT GATCGGCTGG GGTATCGATC GGGTTGCCTA TATTCCTGTG
CAATTTTTAA ATGTGTTGTT TCTCTTGCCA ATCGTGGCCT TTACTGCCCG CCATGGGTTT
GATGGCGCAT TGCTCTCGGC CACAGGCAGT ATGTTAATCG CACTCTCATT TTTCCTCGAT
GAAACCAGCC GCATGCGCCG CGTTGGCACG ATGGGCAACG ACCGTGTAGC CTTTGCTAGC
ATTGTTTTTG AATTAGGCGT GTACTACTTA ATTAGCATGA TCGGCGGTTT ATTAATTGAT
ATTCAACGCG CCGAAAAGCA ACGCCTCGCC ATGCTGAGCC ATGTCAATCG GATTTTCAAC
CAAGCGCAAA AAGAAGATTT GCTTGATCAA TTGGTGCAAA CGGTCTGTGA AGCGCTCCAA
ACCAACGTTG GAATTATGTT TCAATATGAT GCCCACAGCC AAAGCCTTAA GCCGCTAGCC
TGGCAAATGC CCAAAGATTG CTCAATTGAT GTGATCGCCA CCGCGCTATT AACGTATTTT
CCCGATTTGA AACAGATGAC CCAAACCTCG GCCAGCGCCT ACCATCACAA CGATCGTAAT
GCAGCCACCA ACACCAGCGC ATTTTGGCTC GAAACTGGCT TAGTATCGGC CTTACTCATC
CCATTGGTTG GACGCAACGG CACCTTGGGC TTGATGGCGC TGTTTGATCA ACGACCTGAG
CGCTATTTTG GCCGCAGCAA TATCAATTTT GCTGAAGCAA TTTGTTCGCA AGCAGCGATC
GCCCTCGAAC AACGCGATTT GATTACCACG CTCCAGCAAC AAACCGAGCA ACTGAATGCG
GTTTCGAATA TTACCGCCAC ACTCAACGCA ACCCTCAACC TTGAAGTGGT TTGCCATCGA
ATCGCCCAGC AAATCGAACG GGTCGTGCCC TACGATTGGG CTTGTGTGGC CTTGGCAACC
GAGCAAACCC GCTTTTTCAG CGTACTGATG AAAACTGGCC GTGCCGAAGT TGACCTGCTT
GATAAAACCT TGGTGCTTTC ACAGGAAATT TGGCAGGATT TTGGGCCGAT CGATGCGCCG
CCCTATCGCA TGATTATGAA TTCATCGCCA GTTAGTCGCG CAGCGGAGTT GCGCCAAATT
GGCTTGGCGG TAGCACTGTT GGTTCCATTG CGCCGCGATG ATCGCTGGCT GGGTGTGTTG
GTGCTTTCCA GCTTTGATCC TGATGCCTTT CCACCAGCAC ACCAACAGTT GTTGCAAATT
TTGGCGCGAC ACATGGCCTT GGCAATTTCC AATGCGCAAC TGTACCAGGA GCTTGAGCAA
GCCTACCGCG CCAAACAAGA AGCCCAAGAT GTGTTGCTGC AAACCGAGCG CTTGCGGGCT
TTGGGTGAAT TATCGAGCGG GATCGCCCAT GATTTCAACA ATTTGATTGC GGGAATTTTG
GGGCACACCC AATTATTGCT GATCGAAGCG CCCGAGGAGC AACGCGAAGG CTTGGCGGTG
ATCGAACAAG CAGCGCGTGA TGGTCGGCAT ATGGTCGAGC GGATTCAGCA ATTTACCCGT
GCTCAGCAGC CCGAAGATCA TGAAATGGTC GATTTGAATA CAATTATTAA TGATGTGATT
AAGCTGATTC GCCCACGTTG GCGGAGCCGC CCTGCCAGCA CCATGATTCA AACCCGCATT
GAAAAAGGCC AAATTCCATT AATTTATGGT TCACCTTTTG CCTTGCGCGA AGTGCTAACC
AACGTTGTAC TGAATGCGAC CGATGCCATG CCCAAAGGCG GAATTTTGAC GATCCGCACC
GAGCAAGTTG AGGCTGATGT GATTATGGAA GTCAGCGACA CAGGCATTGG CATGGATGAA
GAAACCCAAA TTCGCATGTG GGAGCCATTT TTCAGCACCA AAGGGGAGCA TGGCACAGGC
CTAGGGCTTT CGATGACCCA TGCGATTGTG GTGCAGCATC ATGGCGGGCG GGTCGCAGTG
CAAAGCGAAG TAGGTACTGG CAGCACGATT TCGCTGGTGT TCCCAATTCC GCGCCCCGAA
AGCTCCAACG ACCCCTTGCA AGTTGGGGGA GCACAAGCCT TGCAACGGGG CACAATTCTG
GTGGTCGAGA GCGATACGCG GGTCCAATCG GCGTTGGCGG GCTTGCTCGA AAGTTTAGGG
CACACGGTGG TTTGCGCCGA TAGCGGCACC TTGGCGCTCG ATTTGGCCTA TGCCCGTGAT
TTTGATGCCT TGATTGCCGA TAGCGGCCTG ACCGACATCA ATGCTTGGGA TTTGACCGAA
CTGCTGAAAA CCCGCGACCC GCTGTTGGTG GCCATGCTGT TGACGGTTTG GAATGCGCCC
GATCTTGATA CTCGCCGCCA TGTGTTTGAT GCAGTGCTGC CCAAGCCATT TGAATCGCAG
GTGTTGGGCG AAACCATCGG TTTTTTGCTG AATAATCGCG CCAAACTAGC CAATAATATG
GAGAACTAA
 
Protein sequence
MTRPLLLSFC SATITALLFG LGWLANLPMP WILALGVVQA GIMWWRTPWA VGSAISMALI 
AWFAGINQVG WLALLVGGWL AQGGLPALIF ILYRYPYDMS RESALNGFLA SGVLFNTAIS
AFCMSIVGWQ VGWIEEHSLV VTTALMWLTN SLGSLIITLP MLRLGTPWLS TRGWFGTPAA
LQRSIHRSTI TRSDVLQLIF MLTIVGLIGW GIDRVAYIPV QFLNVLFLLP IVAFTARHGF
DGALLSATGS MLIALSFFLD ETSRMRRVGT MGNDRVAFAS IVFELGVYYL ISMIGGLLID
IQRAEKQRLA MLSHVNRIFN QAQKEDLLDQ LVQTVCEALQ TNVGIMFQYD AHSQSLKPLA
WQMPKDCSID VIATALLTYF PDLKQMTQTS ASAYHHNDRN AATNTSAFWL ETGLVSALLI
PLVGRNGTLG LMALFDQRPE RYFGRSNINF AEAICSQAAI ALEQRDLITT LQQQTEQLNA
VSNITATLNA TLNLEVVCHR IAQQIERVVP YDWACVALAT EQTRFFSVLM KTGRAEVDLL
DKTLVLSQEI WQDFGPIDAP PYRMIMNSSP VSRAAELRQI GLAVALLVPL RRDDRWLGVL
VLSSFDPDAF PPAHQQLLQI LARHMALAIS NAQLYQELEQ AYRAKQEAQD VLLQTERLRA
LGELSSGIAH DFNNLIAGIL GHTQLLLIEA PEEQREGLAV IEQAARDGRH MVERIQQFTR
AQQPEDHEMV DLNTIINDVI KLIRPRWRSR PASTMIQTRI EKGQIPLIYG SPFALREVLT
NVVLNATDAM PKGGILTIRT EQVEADVIME VSDTGIGMDE ETQIRMWEPF FSTKGEHGTG
LGLSMTHAIV VQHHGGRVAV QSEVGTGSTI SLVFPIPRPE SSNDPLQVGG AQALQRGTIL
VVESDTRVQS ALAGLLESLG HTVVCADSGT LALDLAYARD FDALIADSGL TDINAWDLTE
LLKTRDPLLV AMLLTVWNAP DLDTRRHVFD AVLPKPFESQ VLGETIGFLL NNRAKLANNM
EN