Gene Haur_4657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4657 
Symbol 
ID5736504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5949947 
End bp5952628 
Gene Length2682 bp 
Protein Length893 aa 
Translation table11 
GC content50% 
IMG OID641281821 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001547416 
Protein GI159901169 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGTCA ACCTGCGGCG ACGGCTAAAA AGTCGCATTC GGTATAAAAT CACCCTGCCA 
TATTTGCTCT TGCTAGCATT TTTGTCGGGC TTATTGATTG TGATTATTTT TTGGCTGTAT
GCTCAGAGCT TGCAGAACCA ACTCAATCAA TCACTGCGCA CCAGTGCTAC CAATACTGGC
GTGGGCTTGG AAACCATCGA AGCCCAAATT CTCGATAATT TGCGTTTGAT CGTCACGTCG
CCTGCCAACG AAAAAGAAGC TTTGCTGAGT ACCGCCGAGG CATTTAGCAA AAATGATGTT
GATCAGGTTG AGCAGATCAT CAACAATGCC TTTAATTATT TCAAGCCCTC ACGCCTAATT
GCGCTGAATG CTGATGGTCA AGTACTGGTC GATGCCGCTA GTTCAACTGT GCTGAGCCGT
AGCCCTTCGT TGGTTGGTAG CACTGAGCTA GCCAAAGATT CCTTGATTCA GGCGGTCCTG
CGCGGCCAAG TTGATGATTA TGGCGATAAA TATGCCAGCC TCTTGCGCAT CGGTGCGAAT
CCGCCTTACA CGATGTTTTT TGTGGTTGCT CCGGTTAAAT TAACGGTCGG CAGCAGTGAA
CAAGTGGTTG GGGCAATTAT TTATGCCGAG CCACTTGAGC GGATTGTCAA CGAGGAATTA
CCGCCACGCA ACGGCGCAAC CATCACCGCG ATTCTCAATA GCGATGGCAG TGTGCTCACC
AGCAATCCAC CCCAAGAGGC CGATGAGTTG CTGATTGATG CCAACCAAAT CGAGCAAATC
AAGGCCAGCC AAACTAACCC CGACCCCGAA ACTCGTGGAA CCTTATTCAC CACGATTCAA
CGCGGCGACA CGAGCTATCA AGTGATGTAT AGCCCAATGC GGATTCGACG GGCGCTGAAT
GGCTATTTTG CGGTTGGTTT GCCACGTAGC AATATTGATC GGGCTTGGCA ACAAGCGCGG
GTGCTGATTT TCCTGTTCGG CGTGCTTTCG CTAATTTCAA TTATTTGGAT CAGCGTGCGG
GTAACCCGTA GCATTACTGT TCCCCTTAGC GAGTTGGTCA GTACCGCCTT GCGGATTAAA
AGCGGCGATT TGGAAGGCCG TAGTTTTGTT TCCGAGGAAA ACGAACTTGG CACGTTGGCG
ACAGTCTTAA ATGATATGAC TGATCGCTTG CTCGATTTGT ATCGTACCAG CCGCCAACTG
GGCACAGAAC TCACTCTTGA TGGCGTTTTG ACCCAAACCA CTGCTGCGGT TCAACGCTTG
ATTCCCGATT CTGAAGTTGA TGCCTTGGTG GTTGAGCAAG GCGTTTGGCA CTATGTCACA
ACTGAAAGCG TGCGCGAAAT CGAGCGTCCC TTCCCTGCGA CTAGCCTTGA AACGCTTGCG
CCAATGGTGA CGATTACCGA AAATCCAGCC TTGGATAGCG CCTTGCGGCC ACTTGCCCCA
AATCTGCAAT TAGTGTTGCC GTTGCGCACT CAGCAACAGG TGATTGGCAG CCTGTTGGTT
AAGAATGATC GCGCTTTGCC ACCAGCCAGC AGCATTCGCG AACCACTCAG CGCAATTGCC
AGCATGACCG CCACGGCCAT GCAAAATGCG GTGCTTTACA CCACCGTCCA AGATGAAGCT
AGCCGTAAAC AGGCAATTCT GCAAAGTATC GCCGACGGCG TAATTGTGCT TGATCCTGAT
GGCAAAGTGA TTTTGGTCAA CCACACCGCC ACGGCCATGC TGGGAGCCAG CGAAGCCGAT
TTGCTAGGCA AAAGCTTTGC GGAATTCCAG CTTACGCCAT TGTCGGGCGG CGCTGAATTA
TTTGCCACGC CCACAGCCAC AACCTTCTAT GAGACAACCT CGCAGCGCAT TTTGACTATG
AATGCAGCGC CCGTTGAGCG CGAAGGCTTG CAATCGGGCG AAGTTTTGGT GCTGCACGAT
GTGACCGAAG AACGGGCGAT GGATCGCGCC AAAACCGATT TTATCGCCAC AATTTCGCAC
GAACTACGCA CGCCATTGAC CTCAATTTGC GGTTATGCCG ATTTGCTATT GCGCGGTTTT
GTCGGGCCTT TGACCGACGA ACAAACCCAA TTTATGAGCA CCATTCGTCA GCAAGGCCAA
AGCATGGTCG AAGTGCTGCA AAATGTGATT GTGATTGCCA GTATTGAAGC TGGGAATATG
GAACCACAAA TTGAGCAGCA TTCGCTGGAC GAGTTGTTGC CGCCAATTGC CCAAGCTTTA
CAAAAAGGCT TTGATCAAAA GCAACTCAAA TTGATCATTG ACCTGCCTGA GAACATGCCG
TACATTGTAG TGGATCGCGA TCACTTCAAA ATTATGTTGA CCCAACTGCT GGAAAATGCC
CGTCGCTATA CCCAAACTGG CACAGTGACG GTGCAGGCCA AAGCGGTTGA GCAGCGAATT
CAAATTGCAG TAATTGATAC TGGGCCTGGG ATTGCTAGCC AAGATTTTGA GCGTTTGTTT
GAGCGCTTTC AACGTGGCGG TGAGCAAAGC GGCCTGACTT CCAAAGAGCG CGGTATCGGC
CTTGGTTTGG CAATTACCAA ACAATTGGTC GAGCAGAATA ATGGTAAAAT TTGGGTTGAG
AGCGAGCAAG GAGTTGGTTC AAGCTTTATT ATGCAATTTC CGCTGATGCA GCTTGAGCCA
AGCGAATATA ACTTAGTGAA TGCAACGGCA ACCAACGCAT GA
 
Protein sequence
MFVNLRRRLK SRIRYKITLP YLLLLAFLSG LLIVIIFWLY AQSLQNQLNQ SLRTSATNTG 
VGLETIEAQI LDNLRLIVTS PANEKEALLS TAEAFSKNDV DQVEQIINNA FNYFKPSRLI
ALNADGQVLV DAASSTVLSR SPSLVGSTEL AKDSLIQAVL RGQVDDYGDK YASLLRIGAN
PPYTMFFVVA PVKLTVGSSE QVVGAIIYAE PLERIVNEEL PPRNGATITA ILNSDGSVLT
SNPPQEADEL LIDANQIEQI KASQTNPDPE TRGTLFTTIQ RGDTSYQVMY SPMRIRRALN
GYFAVGLPRS NIDRAWQQAR VLIFLFGVLS LISIIWISVR VTRSITVPLS ELVSTALRIK
SGDLEGRSFV SEENELGTLA TVLNDMTDRL LDLYRTSRQL GTELTLDGVL TQTTAAVQRL
IPDSEVDALV VEQGVWHYVT TESVREIERP FPATSLETLA PMVTITENPA LDSALRPLAP
NLQLVLPLRT QQQVIGSLLV KNDRALPPAS SIREPLSAIA SMTATAMQNA VLYTTVQDEA
SRKQAILQSI ADGVIVLDPD GKVILVNHTA TAMLGASEAD LLGKSFAEFQ LTPLSGGAEL
FATPTATTFY ETTSQRILTM NAAPVEREGL QSGEVLVLHD VTEERAMDRA KTDFIATISH
ELRTPLTSIC GYADLLLRGF VGPLTDEQTQ FMSTIRQQGQ SMVEVLQNVI VIASIEAGNM
EPQIEQHSLD ELLPPIAQAL QKGFDQKQLK LIIDLPENMP YIVVDRDHFK IMLTQLLENA
RRYTQTGTVT VQAKAVEQRI QIAVIDTGPG IASQDFERLF ERFQRGGEQS GLTSKERGIG
LGLAITKQLV EQNNGKIWVE SEQGVGSSFI MQFPLMQLEP SEYNLVNATA TNA