Gene Haur_2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2161 
Symbol 
ID5734034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2724359 
End bp2727421 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content50% 
IMG OID641279302 
Productadenylate/guanylate cyclase with GAF sensor(s) 
Protein accessionYP_001544929 
Protein GI159898682 
COG category[T] Signal transduction mechanisms 
COG ID[COG2114] Adenylate cyclase, family 3 (some proteins contain HAMP domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0428251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGATG CAAGCAATGG CCATGTGTTA ATCGTCGAAG ATGACCTCAG TATTAGCCGG 
ATGTTTCAAC TGCTGCTGCG CGATGCTGGC TATCGCGTAT CGTTGGCTAA TAGCAGTGAA
GAAGCTTTGC AATTCGTTCG TGTCATCACT CCCGATATTA TTTTGTTGGA TATTTCCTTG
CCTGGCATGA ATGGGGTTGA TCTGACCCGC CATTTGCGCA GTAACCCCGC AATTCCCTTT
ATTCCAATTA TTCAAATCAC GGCGCTTGGC GATTTGCGCA CCAAAGTTGC AGCGCTCGAC
GCAGGAGCCG ATGATATTTT GGTCAAGCCA ATCGAGCTAT CCGAGCTTTT GGCGCGTTTG
CGGGTGATGC TGCGGCTACA AAAAAATCGC CGTGGCCTCG AAGAATCGAC CCGCCAAATG
CATATTTTAT ATGCGATCAG CCAAACGCTC AACAGCTCCC TAGATATTAA TAGTATTTTG
CGCGATTTGG TGCTGCAATT AGCCAATGCT TTGGGCGCAG TTCGCACCTC GATTATTTTA
GTTAACGATC ATAACACGCC ATTTTATGCC TCATCGGTTA AAGAGACCCT CAACACCGAG
AGTATTCAAC GGGTCATTCG CGATGGTGTG GCGGGCTGGG TTATGCGGGC CATGAAACCG
TTAATCATCG CCGATACAAC CCGCGATCCC CGTTGGATCT CACTTGATCG CCGCACTGAA
ACCACCCGCT CGATTTTATC AGTGCCGTTG ATTCATCAAG GGATTGTGGT TGGTGTGGTC
ACCGCTGCAC ATACGCGTAC CAATTATTTT ACCCAAGAGC ATCTTGAATT AGCCCAAAAT
ATCGGCAATC AAAGCGTTAC GGCGTTTAAT CACGCTCAAC TTTTTCAAAC GACTGTTCAG
CAAAAGAGCT TGCTCGAACG CCGTTCGCAA ATGTTGGAGG AATTGTTGAA CGTCGGCGAG
CGCTTGAGTC TCAATTTGCC CTTACAAGAT GTCTTGAATG AGCTAGCTCA AGCGATTCAT
CGTTCGTTGG GCTTTCGCCA AGTGATTATT CATGTGTTTG GAATTGATGA TGTTGAGCCA
GCCCAAGGCG TAGCTGGGGT TCAGCGCGAA CAAATTCAGC AACTCTGGAC AAATGCTCAG
ATTCAGCAAA GCATTGTGCC CTTGCTGCGT GAGCGGTTTC GCATCAGCCG CTCGTATTTC
GTGCCTTCAG GCTATAAACT CGACGATGAA GAAACCAATC AACTGCCGTT TGGAGCGATT
GGCATTCGCC AAGCTGATTA TCTGTTTGTG CCCCTAGGCA GCCCGCAACA ATTGCTTGGG
CTGATGATCG TCGATCTGCC CAACAATCTG ATCACCCCCG ATTTGGCCAC GATTCAGGCG
CTCGAAGTCT TCGCCAACCA TGGCACAACC GCCGTCAACA ATAATCGCTT GTTTGCTCGC
GAACATACCC GGGCCAATCA ATTGCAATTA TTGGTTGAGC TAGGCCGTAA TTTCGCCGAG
TTAATGACTC CTGATCAATT ATTGCGCTTG GTTGCCTCAT TGGTTCGCCA TAGTTTTGGC
TTTAACAGCG TAGCAATTTT GCTCAAGCAC AACAATGAAT TTATGCTGCG GGCCGGAACT
CACAATTTCG CCCATCATCC CTACAACCAG CCACTGAGCA TTGATGCGCG GATTTTACAA
GCCTTAGAGC ATTACCAAAG TTGGCTCAGC AGCAATAGCG AGCAATTTTG CTTGAATTAT
GAGTGGGGCG AGCCAACGAT CGTCGCCGAA TTGATCGTGC CCCTGCATAC CCACGAAGGA
ATTCAAGGCC TCTTGGTAAT TGGGCATAGC GATGCAAGCG CCTTGGATGG CCTCACTCAA
TCAATTTTAG GGGCAATTGC CGTTCAGTTG ACGGTTGGTT TGGATAACGC CAAGCTGTTT
TTGCGCGAAC AAGAATATAT TCAACAACTC AATCGCGTTA ACGACATAGC CCTACAACTA
ACCAGCACCA CCACCGAGCC AGCATTTCAT GAGATTATGC AGCAAGTTGC AGCCATGTTT
CAGGCTCCCC AAAGTATGTT GGTGTTGGTT GATCCCGAGC AGCGCCAGAT TAACCGCAGC
ATTGCCTATC CGCAATCATG GTTGCACGAG CTACCAGCCT TGCCTGAACT GTTGGCTAGC
CTTGATGAAA GTCGGGTACA AATGCTCAAC CATCAACATG CCTCGGCCTT AGGTCAAACA
TTACGTCAAT CAGAAATTAA CACGATCGTG GTAGCCCAAT TGTATAGCGC CGAACGCTTG
CACGGCCTTT TGTTGATTAA GCCCGCTGAT ATACACTACG TTTGGCGCTC GAATGACCGT
AATTTGCTCC AAACCCTTGC CACCTTATTC GCCCAAGCCT TGGAAAACCA ACAGCTGCAA
GCCCAACGCC TAGAACGCTT ACGCGCCGAT TTGCAGCGCT ATATGGCTCC ACCACTAGTT
GAACAATTGC TGAGCGAAGG TGGTTTTGGC GAGGCCAGCG AGCGTGATAT TGTGGTGTTA
TTTGCCGATT TACGCGGTTT TACGGCGCTC AGCGAAAATC TCGCGCCCCA AGTCGTGGTC
AAGCAAATTC TCAATCGATT TTTCGATGAA ATGACTGCCG TGCTCTATCG TTATGATGCA
ATTATCGATA AGTTTTTGGG CGATGGCCTG ATGGCGGTTT TTGGCTCAGT GCGACCATTG
CCCAACGATG CAGAACGCGC CATGAACGCC GCGATCGATA TGCAACAAAC CTTTGCCAAA
TTGCAAACAG AATGGCAAGC CAGCTTTGGC TATGCGATTG GCCTAGGCAT TGGCATGAGT
TGTGGGCGGG CGGTGGTTGG CAATATCGGC TCAGCCCAAC GCATGGATTA CACCGTGATT
GGCGATGTGG TAAATACAGC TAGCCGCTTG GTGGGGATTG CCGAGGCTGG GCAGATTATC
ATCACCCAGC CGTTGGCCCA ACGTTTAAAG CGGCATAAAC GCCAACTAGA GGAGCTTGAA
CCAGTCCAAC TCAAAGGTAA ACGCGAATTA CAAGCGATCT ACGCCATTCG CAAGTCGCGC
TAA
 
Protein sequence
MPDASNGHVL IVEDDLSISR MFQLLLRDAG YRVSLANSSE EALQFVRVIT PDIILLDISL 
PGMNGVDLTR HLRSNPAIPF IPIIQITALG DLRTKVAALD AGADDILVKP IELSELLARL
RVMLRLQKNR RGLEESTRQM HILYAISQTL NSSLDINSIL RDLVLQLANA LGAVRTSIIL
VNDHNTPFYA SSVKETLNTE SIQRVIRDGV AGWVMRAMKP LIIADTTRDP RWISLDRRTE
TTRSILSVPL IHQGIVVGVV TAAHTRTNYF TQEHLELAQN IGNQSVTAFN HAQLFQTTVQ
QKSLLERRSQ MLEELLNVGE RLSLNLPLQD VLNELAQAIH RSLGFRQVII HVFGIDDVEP
AQGVAGVQRE QIQQLWTNAQ IQQSIVPLLR ERFRISRSYF VPSGYKLDDE ETNQLPFGAI
GIRQADYLFV PLGSPQQLLG LMIVDLPNNL ITPDLATIQA LEVFANHGTT AVNNNRLFAR
EHTRANQLQL LVELGRNFAE LMTPDQLLRL VASLVRHSFG FNSVAILLKH NNEFMLRAGT
HNFAHHPYNQ PLSIDARILQ ALEHYQSWLS SNSEQFCLNY EWGEPTIVAE LIVPLHTHEG
IQGLLVIGHS DASALDGLTQ SILGAIAVQL TVGLDNAKLF LREQEYIQQL NRVNDIALQL
TSTTTEPAFH EIMQQVAAMF QAPQSMLVLV DPEQRQINRS IAYPQSWLHE LPALPELLAS
LDESRVQMLN HQHASALGQT LRQSEINTIV VAQLYSAERL HGLLLIKPAD IHYVWRSNDR
NLLQTLATLF AQALENQQLQ AQRLERLRAD LQRYMAPPLV EQLLSEGGFG EASERDIVVL
FADLRGFTAL SENLAPQVVV KQILNRFFDE MTAVLYRYDA IIDKFLGDGL MAVFGSVRPL
PNDAERAMNA AIDMQQTFAK LQTEWQASFG YAIGLGIGMS CGRAVVGNIG SAQRMDYTVI
GDVVNTASRL VGIAEAGQII ITQPLAQRLK RHKRQLEELE PVQLKGKREL QAIYAIRKSR