Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1458 |
Symbol | |
ID | 5736869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1697229 |
End bp | 1701044 |
Gene Length | 3816 bp |
Protein Length | 1271 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278596 |
Product | adenylate/guanylate cyclase |
Protein accession | YP_001544230 |
Protein GI | 159897983 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCAC GCTGGTTTGC TTATGTGCCG CCGTATCTTG TCCCACGATT GCTTGATCAA CGGGCCTCTT CTGCAACCCA AACGACCAAA CATGGCCAAG CTGTGGTGCT TTTCGCCGAT ATCGCGGGCT TTACGCCTCT CAGCGAGGCG CTTGGTCAAC ACGATTCGCA TGGAACTGAA ACCCTAACCC GCTTATTAAA TCGCTGTTTT GCCCCATTAA TTGATGTGAT CGAGCGCTTT GGCGGCATGA TCAGCACATT TGGCGGCGAT GCCATTACGG CGTTGTTTCC GATTAATCAG CCGCAACGGA CTCAACGGGT CGCTGCTCGC GCTGTTCGTT GTGCCCTCGA AATGCAAAGC CTGATGGAGC AATTGGGCGC AATCGAAATT CAAGCAACCA AATGGCGTTT GACCCTCAAA ATTGGGATTG CCGCAGGCCA TGTGCTCTAT ACCACGGTCG GCGATCCGAC GAAGCGCTGT GTTGCGGTGG TGGGTGGCTC GGCGTTGTTA CGCTCTGCCG AGGCCGAAAA TCAGGCTCGT TCTGGCGATG TGATTATTGA TTATGCCTCG TTGGATTGCG AGCAGTTCAA GCGTGAAACG CTCGATCAGC GTTTTGCCAA GGTGCTTGGG CTTGAGCAAA GCGTGCGGGC GCAGCCAATT CGCTGGCCTG AACCAATCAC CCCGCATCCA CCTGCCCAAT TCGATGCCTA CTTGCACCCC ACAATTGCCC AACAAGTGCG TGAAGGTCGT ACCGCGTTTA TCAACGAGCG CCGCTCAGTT ACGCTATTGT TTGTCAATTT CGATGCCCCC GATTATGATC GTGATCCATT TGCCGCCGAA CGGCTCAACC ATTACTTTCG CGAAGTGCTG CGGATCGTCG AGCGCTATGA TGGCTATCTG AATAAAGTTG AAATTGGCGA TAAAGGTAGT AGCTTTTTGG TGCTGTTTGG TGCACCAGTT GCCCACGAAA ATGATAGTGA TCGGGCGGCG CACTGTGCCT TAGAGCTTCG CGCCTTACCT GAGTTTGAGG CTCGTATCGG CATCAACACA GGATTTGTCT TTTGTGGCTT GATTGGCTCG GAGCGTCGCC AAGAATATAC CGTGATTGGC GATACAGTCA ATTTGGCGGC GCGATTGATG CAGCAAGCTG GTCAAGGCCA AATTTTGCTG ACCGAAGCTA CCCAAGAAAC CCTTAGCTCA ACCTTTTTAA CTGCGCCTTT GCCAGCCGTT CGAGTCAAAG GCCGCAGCGA ATTGGTCGAG TTACACGAAT TAACCGATTT ACGCCAGACG GCGATTCGTG GCCAAGAGCC AATTTATGCT TTGCCAATGG TCGGGCGCAG CCAAGAGTTA CAATCTGTCG GCGAATTGCT GAGTCGCATC AAGCAGGGCC ATGGCCAAGT GTTGGGGATT ACGGGCGAAG CTGGCATGGG CAAATCGCGG CTGGTGGCCG AAATTCGCCG GATTGCGCGG GCGCAACGAG TGGCGGTGTA TAGCGGCGAG TGCCTTTCGT ATGGCACAAC CATTAGCTAT TTGTTGTGGC ATAATTTGTG GCGCTCGTTT TTTAATGTTG ATCCTGAGTG GCCGCTCGAT TTGCAAATGT TGCAGTTGCG GGCGCAATTA GCCCTGATCG ATCAAGATTT ATTAGATTGG ATGCCGCTGC TGGCGAGCGC TCTACGCCTG CCGATTCCCG ATCAATCCTT GACCAAATTG CTGGATATTA AGTCGCGCAA GATGCTACTA GAATCGCTGC TGGTGGATTG TTTGCGCTAC CGTGCCAATG AAACGCCGTT ATTACTGGTG CTCGAAGATT GCCATTGGAT CGATTCGCTC TCGAACGATT TGTTGGCGCG AATTACTAAG GCGATTCGCG ATGTGCCCGT GCTGATTGTG CTGGCCTATC GACCTTCCGC CGAGCGTGAT CAAACGATGT GGCACGAGTT ACGCCAGCTT GAACATTGGC ACGAAATCGA ATTACAAGAA TTTACGCTCG ACGAAACTGC TGAATTAGTG CGACTCAAAA TCAAACAACT GCTCGGCAAT CGCCAAGCAC CATCAGAGCA GTTGGTTAGC AAACTGACCG ATCGGGCGCA GGGCAACCCA TTTTATATTG AAGAATTGAT TAATTTGGTG GTTGAACGCC AGCCCGATTT GAGCGATCCC AAGGCCGTGG CTCAACTGGA GTTGCCCGAT AGCTTACACC GCTTGATTAT CAGCCGGATC GACCAACTTG AAGAGGCGAC CAAGCATACG CTCAAAGTTG CCAGCGTGAT TGGTCGTTTA TTCAAAGCCA ATTGGCTCTG GGGCGCGTAC CCGCAATTGG GCAGTGCCGA GCAAGTTAAG CAGCAACTCA ACACTCTGAG CCGCATGGAT CTTACGCCAC TTGATCGCGA TGAGCCTGAG CTTGAATATC TGTTTAAACA TGTGGTTACC CAAGAAGTTG CCTACCAAAG CTTGCCATTG GCCACGCGTG CCGCGCTCCA CGAGCAGGTT GGCGATTATT TGGTTGAAAC CTATGGTGTG GAAGGTGCTG CCGATTTGTT GGCGCACCAC TATGGCATGA GCAATAATCT TGATAAACAG CGCAACTATT TTCGCAAGGC TGGCGATGCG GCGGCGGCAC GCTATGCCAC CGATGTGGCC TTGAGCTATT ACGAACGCTG TTTGCCCTTG CTCGCCGCCC ATGAAACCAT CGATATTTTC TTTGCAATGG GCGAAATCTA CAAACATACT GGGCGTTGGC ACGAGGCCGA TGCAATTTAT CGGCGTTTGC TGAACCAAGC CCAACAGCAA CAGCAACTTT CAGCGTTGGC TCGCGTCTGG TGTGAAATTG CCGATGTCCA AAGCAGTCAA GGTTTGCATA ACGATGCACT GCAAAGCATT CAACCAGCCT TGGAATATGC CAAACAAGCC CATGACCAGC CGACCCATTG CAAAATTTTG ATGCGCATGA GTTGGATTGC CAGCTATCAA GGCCAATTGC AGCAGGCTTG GCAAACCAGC CATGCGGCGG TGGCGATTGC CCGTGAAGCC AATGATGCCC ATAGTTTGGC TTTGGCACTG AAAACTCATG GCTATATGAA TGTGCTGCAA GGCCAATTTG CTCTAGCCGA GCAGCTATTT TCCGAAGGCC TCGCCTTACA CCGCCAACTC GGCCAGCGCG AAGATGAAGG GCGCATGTTG AATGTGATGG GCGAGGCTGC TCGTCATCGT GGCGATTTTC AGCATGCCGT CGAGCTATAT CAACAAGCCT TGGTCATTGG CCGCGAGTTG CGCAATCCTG AACGGTTAAT CATGTTTTTG AGTAATTTGG GCGGCGCATT GGTGGGGCTT GGCACGTATG AAACCGCGAT TGCCACCTTG AACGAGGCTT TTGAACTCGC TCGTTCGACC ACGTGGTATG GGATCGCGGA AACCTATCGT TTTCGGGCTG AGGCGGCGAT TGGTTTGGGC TATCCTGATG AGGCAATTCG CCATATTATC CAAGCGCATC ATGCTGCTCG CGAACGCCAA CAAACCCTCG AATTATGCGC GGCACTGCGG GTTTTGGGCA GTGCCTTGAG CCAATTGAAT GTGCCAATCC TCTTGCCGCC AAGCTTGCCA ACTACCCCAC TAAACTGCTT TGAACAGGCC TTGAGCATGG CCGAAGCTAG TGGTTCGCAA GCTGAAGTTG CGGCGATTCA GCTGGCCTTG GCCGAGCATT GGCAGCGCCA ACAACAACCA ACCAAAGCCC AAGCGGCTTG GCAGCAAGCC TACACCATCT ATGGTCAATT AGGCATGGAT GCGCTGCAAC AACGGGTAGC CCAGTGGTTG AGCTAG
|
Protein sequence | MQARWFAYVP PYLVPRLLDQ RASSATQTTK HGQAVVLFAD IAGFTPLSEA LGQHDSHGTE TLTRLLNRCF APLIDVIERF GGMISTFGGD AITALFPINQ PQRTQRVAAR AVRCALEMQS LMEQLGAIEI QATKWRLTLK IGIAAGHVLY TTVGDPTKRC VAVVGGSALL RSAEAENQAR SGDVIIDYAS LDCEQFKRET LDQRFAKVLG LEQSVRAQPI RWPEPITPHP PAQFDAYLHP TIAQQVREGR TAFINERRSV TLLFVNFDAP DYDRDPFAAE RLNHYFREVL RIVERYDGYL NKVEIGDKGS SFLVLFGAPV AHENDSDRAA HCALELRALP EFEARIGINT GFVFCGLIGS ERRQEYTVIG DTVNLAARLM QQAGQGQILL TEATQETLSS TFLTAPLPAV RVKGRSELVE LHELTDLRQT AIRGQEPIYA LPMVGRSQEL QSVGELLSRI KQGHGQVLGI TGEAGMGKSR LVAEIRRIAR AQRVAVYSGE CLSYGTTISY LLWHNLWRSF FNVDPEWPLD LQMLQLRAQL ALIDQDLLDW MPLLASALRL PIPDQSLTKL LDIKSRKMLL ESLLVDCLRY RANETPLLLV LEDCHWIDSL SNDLLARITK AIRDVPVLIV LAYRPSAERD QTMWHELRQL EHWHEIELQE FTLDETAELV RLKIKQLLGN RQAPSEQLVS KLTDRAQGNP FYIEELINLV VERQPDLSDP KAVAQLELPD SLHRLIISRI DQLEEATKHT LKVASVIGRL FKANWLWGAY PQLGSAEQVK QQLNTLSRMD LTPLDRDEPE LEYLFKHVVT QEVAYQSLPL ATRAALHEQV GDYLVETYGV EGAADLLAHH YGMSNNLDKQ RNYFRKAGDA AAARYATDVA LSYYERCLPL LAAHETIDIF FAMGEIYKHT GRWHEADAIY RRLLNQAQQQ QQLSALARVW CEIADVQSSQ GLHNDALQSI QPALEYAKQA HDQPTHCKIL MRMSWIASYQ GQLQQAWQTS HAAVAIAREA NDAHSLALAL KTHGYMNVLQ GQFALAEQLF SEGLALHRQL GQREDEGRML NVMGEAARHR GDFQHAVELY QQALVIGREL RNPERLIMFL SNLGGALVGL GTYETAIATL NEAFELARST TWYGIAETYR FRAEAAIGLG YPDEAIRHII QAHHAARERQ QTLELCAALR VLGSALSQLN VPILLPPSLP TTPLNCFEQA LSMAEASGSQ AEVAAIQLAL AEHWQRQQQP TKAQAAWQQA YTIYGQLGMD ALQQRVAQWL S
|
| |