Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2161 |
Symbol | |
ID | 5734034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2724359 |
End bp | 2727421 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279302 |
Product | adenylate/guanylate cyclase with GAF sensor(s) |
Protein accession | YP_001544929 |
Protein GI | 159898682 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2114] Adenylate cyclase, family 3 (some proteins contain HAMP domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0428251 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGATG CAAGCAATGG CCATGTGTTA ATCGTCGAAG ATGACCTCAG TATTAGCCGG ATGTTTCAAC TGCTGCTGCG CGATGCTGGC TATCGCGTAT CGTTGGCTAA TAGCAGTGAA GAAGCTTTGC AATTCGTTCG TGTCATCACT CCCGATATTA TTTTGTTGGA TATTTCCTTG CCTGGCATGA ATGGGGTTGA TCTGACCCGC CATTTGCGCA GTAACCCCGC AATTCCCTTT ATTCCAATTA TTCAAATCAC GGCGCTTGGC GATTTGCGCA CCAAAGTTGC AGCGCTCGAC GCAGGAGCCG ATGATATTTT GGTCAAGCCA ATCGAGCTAT CCGAGCTTTT GGCGCGTTTG CGGGTGATGC TGCGGCTACA AAAAAATCGC CGTGGCCTCG AAGAATCGAC CCGCCAAATG CATATTTTAT ATGCGATCAG CCAAACGCTC AACAGCTCCC TAGATATTAA TAGTATTTTG CGCGATTTGG TGCTGCAATT AGCCAATGCT TTGGGCGCAG TTCGCACCTC GATTATTTTA GTTAACGATC ATAACACGCC ATTTTATGCC TCATCGGTTA AAGAGACCCT CAACACCGAG AGTATTCAAC GGGTCATTCG CGATGGTGTG GCGGGCTGGG TTATGCGGGC CATGAAACCG TTAATCATCG CCGATACAAC CCGCGATCCC CGTTGGATCT CACTTGATCG CCGCACTGAA ACCACCCGCT CGATTTTATC AGTGCCGTTG ATTCATCAAG GGATTGTGGT TGGTGTGGTC ACCGCTGCAC ATACGCGTAC CAATTATTTT ACCCAAGAGC ATCTTGAATT AGCCCAAAAT ATCGGCAATC AAAGCGTTAC GGCGTTTAAT CACGCTCAAC TTTTTCAAAC GACTGTTCAG CAAAAGAGCT TGCTCGAACG CCGTTCGCAA ATGTTGGAGG AATTGTTGAA CGTCGGCGAG CGCTTGAGTC TCAATTTGCC CTTACAAGAT GTCTTGAATG AGCTAGCTCA AGCGATTCAT CGTTCGTTGG GCTTTCGCCA AGTGATTATT CATGTGTTTG GAATTGATGA TGTTGAGCCA GCCCAAGGCG TAGCTGGGGT TCAGCGCGAA CAAATTCAGC AACTCTGGAC AAATGCTCAG ATTCAGCAAA GCATTGTGCC CTTGCTGCGT GAGCGGTTTC GCATCAGCCG CTCGTATTTC GTGCCTTCAG GCTATAAACT CGACGATGAA GAAACCAATC AACTGCCGTT TGGAGCGATT GGCATTCGCC AAGCTGATTA TCTGTTTGTG CCCCTAGGCA GCCCGCAACA ATTGCTTGGG CTGATGATCG TCGATCTGCC CAACAATCTG ATCACCCCCG ATTTGGCCAC GATTCAGGCG CTCGAAGTCT TCGCCAACCA TGGCACAACC GCCGTCAACA ATAATCGCTT GTTTGCTCGC GAACATACCC GGGCCAATCA ATTGCAATTA TTGGTTGAGC TAGGCCGTAA TTTCGCCGAG TTAATGACTC CTGATCAATT ATTGCGCTTG GTTGCCTCAT TGGTTCGCCA TAGTTTTGGC TTTAACAGCG TAGCAATTTT GCTCAAGCAC AACAATGAAT TTATGCTGCG GGCCGGAACT CACAATTTCG CCCATCATCC CTACAACCAG CCACTGAGCA TTGATGCGCG GATTTTACAA GCCTTAGAGC ATTACCAAAG TTGGCTCAGC AGCAATAGCG AGCAATTTTG CTTGAATTAT GAGTGGGGCG AGCCAACGAT CGTCGCCGAA TTGATCGTGC CCCTGCATAC CCACGAAGGA ATTCAAGGCC TCTTGGTAAT TGGGCATAGC GATGCAAGCG CCTTGGATGG CCTCACTCAA TCAATTTTAG GGGCAATTGC CGTTCAGTTG ACGGTTGGTT TGGATAACGC CAAGCTGTTT TTGCGCGAAC AAGAATATAT TCAACAACTC AATCGCGTTA ACGACATAGC CCTACAACTA ACCAGCACCA CCACCGAGCC AGCATTTCAT GAGATTATGC AGCAAGTTGC AGCCATGTTT CAGGCTCCCC AAAGTATGTT GGTGTTGGTT GATCCCGAGC AGCGCCAGAT TAACCGCAGC ATTGCCTATC CGCAATCATG GTTGCACGAG CTACCAGCCT TGCCTGAACT GTTGGCTAGC CTTGATGAAA GTCGGGTACA AATGCTCAAC CATCAACATG CCTCGGCCTT AGGTCAAACA TTACGTCAAT CAGAAATTAA CACGATCGTG GTAGCCCAAT TGTATAGCGC CGAACGCTTG CACGGCCTTT TGTTGATTAA GCCCGCTGAT ATACACTACG TTTGGCGCTC GAATGACCGT AATTTGCTCC AAACCCTTGC CACCTTATTC GCCCAAGCCT TGGAAAACCA ACAGCTGCAA GCCCAACGCC TAGAACGCTT ACGCGCCGAT TTGCAGCGCT ATATGGCTCC ACCACTAGTT GAACAATTGC TGAGCGAAGG TGGTTTTGGC GAGGCCAGCG AGCGTGATAT TGTGGTGTTA TTTGCCGATT TACGCGGTTT TACGGCGCTC AGCGAAAATC TCGCGCCCCA AGTCGTGGTC AAGCAAATTC TCAATCGATT TTTCGATGAA ATGACTGCCG TGCTCTATCG TTATGATGCA ATTATCGATA AGTTTTTGGG CGATGGCCTG ATGGCGGTTT TTGGCTCAGT GCGACCATTG CCCAACGATG CAGAACGCGC CATGAACGCC GCGATCGATA TGCAACAAAC CTTTGCCAAA TTGCAAACAG AATGGCAAGC CAGCTTTGGC TATGCGATTG GCCTAGGCAT TGGCATGAGT TGTGGGCGGG CGGTGGTTGG CAATATCGGC TCAGCCCAAC GCATGGATTA CACCGTGATT GGCGATGTGG TAAATACAGC TAGCCGCTTG GTGGGGATTG CCGAGGCTGG GCAGATTATC ATCACCCAGC CGTTGGCCCA ACGTTTAAAG CGGCATAAAC GCCAACTAGA GGAGCTTGAA CCAGTCCAAC TCAAAGGTAA ACGCGAATTA CAAGCGATCT ACGCCATTCG CAAGTCGCGC TAA
|
Protein sequence | MPDASNGHVL IVEDDLSISR MFQLLLRDAG YRVSLANSSE EALQFVRVIT PDIILLDISL PGMNGVDLTR HLRSNPAIPF IPIIQITALG DLRTKVAALD AGADDILVKP IELSELLARL RVMLRLQKNR RGLEESTRQM HILYAISQTL NSSLDINSIL RDLVLQLANA LGAVRTSIIL VNDHNTPFYA SSVKETLNTE SIQRVIRDGV AGWVMRAMKP LIIADTTRDP RWISLDRRTE TTRSILSVPL IHQGIVVGVV TAAHTRTNYF TQEHLELAQN IGNQSVTAFN HAQLFQTTVQ QKSLLERRSQ MLEELLNVGE RLSLNLPLQD VLNELAQAIH RSLGFRQVII HVFGIDDVEP AQGVAGVQRE QIQQLWTNAQ IQQSIVPLLR ERFRISRSYF VPSGYKLDDE ETNQLPFGAI GIRQADYLFV PLGSPQQLLG LMIVDLPNNL ITPDLATIQA LEVFANHGTT AVNNNRLFAR EHTRANQLQL LVELGRNFAE LMTPDQLLRL VASLVRHSFG FNSVAILLKH NNEFMLRAGT HNFAHHPYNQ PLSIDARILQ ALEHYQSWLS SNSEQFCLNY EWGEPTIVAE LIVPLHTHEG IQGLLVIGHS DASALDGLTQ SILGAIAVQL TVGLDNAKLF LREQEYIQQL NRVNDIALQL TSTTTEPAFH EIMQQVAAMF QAPQSMLVLV DPEQRQINRS IAYPQSWLHE LPALPELLAS LDESRVQMLN HQHASALGQT LRQSEINTIV VAQLYSAERL HGLLLIKPAD IHYVWRSNDR NLLQTLATLF AQALENQQLQ AQRLERLRAD LQRYMAPPLV EQLLSEGGFG EASERDIVVL FADLRGFTAL SENLAPQVVV KQILNRFFDE MTAVLYRYDA IIDKFLGDGL MAVFGSVRPL PNDAERAMNA AIDMQQTFAK LQTEWQASFG YAIGLGIGMS CGRAVVGNIG SAQRMDYTVI GDVVNTASRL VGIAEAGQII ITQPLAQRLK RHKRQLEELE PVQLKGKREL QAIYAIRKSR
|
| |