Gene Haur_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1601 
Symbol 
ID5733488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1856581 
End bp1858326 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content50% 
IMG OID641278740 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001544372 
Protein GI159898125 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATA AACTCAAAAT CGACCGCGAA CAAACCCAAT GGCGCAGCAG CACAATTTTG 
CGCTCGCTGC CAATTATGTT AATTCCAATT GCGGTGGCAA TTATTAGCCT GCTCTGGCTA
CGAATGAAGC TTCCCGCCGA GCAATTCAAT CAACAACCGC AACGTGATGG CACACCAATT
ACCGCAATTG TGGTGATCAT TATGTTTGTG TCGGCCTTAA TTATCTTTGT ACGTTTAGGC
CGACCAACCA TCTCAGCCTT TGCCTTAGTG GGCTTTTGGA CACTGATCAC CACCAGTTTG
GCAGTACGCA GCGGGATCAC CAGCTTTTTT CCGGCGCTGT TGGTCTTGCC AATTTGCGCC
GCAGGCTTGT TGATTGATCG GAGTGCCAGC ATTGGTTTAG CGGTCTTAGC CAGTTTGTTG
GTAATCAGCC TCGCTTGGGC CAAACTTCAG GGCTACGATT TAGCTCGCCA AGCGCCGCCC
TCAGTCTTCA CTCGCTATCC TTGGCTGGTT TCAACCGTAT TTTGGATTAC GCTCTTTTGG
ATGGTCGCAA GCATGACCTG GCTCTTAGCG GGGAATTTAC AACGAGCCTT GCAGCAAAGT
CGTGCTCATG CCCAAGATTT GGAGCAGTTG CGCAGCCAAT TGGAGCAGCG GGTGCAAGCC
CAAACTGCTG AGCTTGCTCA ACGCACCCAT CGCGCAGAAG CATTGTATAA CATCAGTAGT
GCCCTAGCCC TACCAGCAGA TATTACCCAA ATATTACCGT TGATTGCCGA GCAAGCCCAT
CAATTGCTGC ATGTCGATTA CTCGTGGATT ATGCTCAACC ATCAGTGTCT TGCGGCCTAT
CCCGAAGCGG CTGTTCACGA ACCACACTTG CAATTTGAGC CAACCCAAAA CACATCAGTT
ATGTCAATAT CCAGCAAATT TGGCGAGCAT ACCGCCTTGG TGCTGCCATT TCAACTTGAA
AACGACCAAG TGCAATTAAT TTTAGCTGGT GCAGCAATCG CCCAAACCAA TACCGATGAT
CGTGCCCTCG CCGAGGGTTT GCGCGATCAA GCAGTGTTGG CCTTGAACAA CCAACGTTTA
TTAGCCCAAG TGCGTGATAA AGCCATGCTC GAAGAACGCA CCCGGTTAGC CCGCGATATT
CACGATACTT TGGCTCAAGG CTTAACTGGT ATTGTGATTC AGCTTGGGGC AACTCAACGG
GCCTTAGTTT ATAGCCCCAG CGAGAGTAGC ACCCACTTGG CCTTAGCTAG CCGTATGGCC
CGCGAAGCCT TGGCCGAAGC TCGCGCCTCG GTTTGGAATT TACGTGCACC CTTGCTGGAT
GCCAATAGCT TGACCAAGGC CATTCAGGAA CTCGCGGCCC ATCCCATGCG CTCTGATTTA
CAGGTTGATG TGCAGATTCA TGGCCAAGTG CTAAGACTTG ATTTGCACGA CGAAACCGCC
CTGTTACGGA TCACTCAAGA AGCTTTGGCG AATGTGGTCA AACATTCCAA AGCTGAACAT
GTCAGCATTC AATTGATCTA TGGGAATAGC AGCGTGGAGC TATTAATTCA CGACAATGGC
ACAGGCTTCG ATCAGGCGAT TTTAACCAAC CAAATGCAGA TGACCAACCA TTTTGGCCTG
ATTGGCATCC GCGAACGGGT CGCCCAAATC GGTGGTAGCC TACAATTAAG CAATCAGCAT
GGCGCGTTGG TTCATGTGCA AATTCCCTAC AATCTGCCCC AAACAGCCTT GGTTGAAATG
GAGTAA
 
Protein sequence
MSNKLKIDRE QTQWRSSTIL RSLPIMLIPI AVAIISLLWL RMKLPAEQFN QQPQRDGTPI 
TAIVVIIMFV SALIIFVRLG RPTISAFALV GFWTLITTSL AVRSGITSFF PALLVLPICA
AGLLIDRSAS IGLAVLASLL VISLAWAKLQ GYDLARQAPP SVFTRYPWLV STVFWITLFW
MVASMTWLLA GNLQRALQQS RAHAQDLEQL RSQLEQRVQA QTAELAQRTH RAEALYNISS
ALALPADITQ ILPLIAEQAH QLLHVDYSWI MLNHQCLAAY PEAAVHEPHL QFEPTQNTSV
MSISSKFGEH TALVLPFQLE NDQVQLILAG AAIAQTNTDD RALAEGLRDQ AVLALNNQRL
LAQVRDKAML EERTRLARDI HDTLAQGLTG IVIQLGATQR ALVYSPSESS THLALASRMA
REALAEARAS VWNLRAPLLD ANSLTKAIQE LAAHPMRSDL QVDVQIHGQV LRLDLHDETA
LLRITQEALA NVVKHSKAEH VSIQLIYGNS SVELLIHDNG TGFDQAILTN QMQMTNHFGL
IGIRERVAQI GGSLQLSNQH GALVHVQIPY NLPQTALVEM E