Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3586 |
Symbol | |
ID | 5735447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4511037 |
End bp | 4514408 |
Gene Length | 3372 bp |
Protein Length | 1123 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280735 |
Product | adenylate/guanylate cyclase |
Protein accession | YP_001546350 |
Protein GI | 159900103 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0258664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGTTC AAACAACTAC GTGCTGCTCA ACCTGCAATT TTGCCGAAAA CCCGCCGAGT GCGCGGTTTT GTGGCAATTG CGCTAGTCCT TTACCGCTGC ACTGTTTTAG CTGTGGCGCG GCCAATCCAC CTGGCTTTCG CTTCTGCGGC ACATGCGCCA CTGGCTTGAT TGAGTATATT CCTGCCGCCA CAACACGGCG CGTGGTCACA ATTCTGTTTG CCGATGTTTG CAACTACACC AACCTGACCA ATCGGCTGGG TGCTGAGCAT ATGTATAGTT TGCTCGACCC ATGTTTGCGA CGCTTGGGCG AAACGGTACG GCGCTTCGAA GGTACAATCG ATAAATTTAC TGGCGATGGT TTGATGGCGG TTTTTGGCAT GCCAGTCGCG CTCGAAGCGC ATGCCGCCCA AGCTGCCTAC GCCGCATTAG CCATGTTTGA AGAATTAGCC GCCTACAACG AAACAATTGC CCATGCTGGA GTGCAATTTC AGATTCGGGT CGGCTTAGCT AGTGGTGAAG TGATTGCAGG GTTGCTTGGG TCTGATCGCT ACAGCGATTT GACCGTGATC GGCGGCCCAG TCAACTTGGC GGCGCGGCTG CAACAAGTAA CCGATCCTGG CACGATTATG GTCGATGGCG ATTTAGCCCA ATCATTACAA ACCAGTTTTA TGTTCAACGA GCAGCATCCA GTTGCGCTCA AAGGCTTTGA GCAACCTATC ACTGCGGCTA TTTTAGCGGG CAAACAATTG GCCCAGGCGG TCGTTGATGG CAGTTTTGGC TTGCCTTTGA TTGGCCGCGA GGCCGAATTG CGGAATCTGA TCGGCGCAAC CGAGCTACTA CACAACGGCA TGGGCGGCGT GATTGGCCTA ACTGGCCGGG CTGGCGTGGG CAAAACCCGC CTGACCGCCG AAATTATTCA GCATTTGCAT AGCCGTGAGG TCAAAGTATT GACCAACGAT TGTAGCAGCG CCACTCGAAT TGTGCCCTAT AGCGCCTTTT TAGGCCTGAT TCGCCAACTT TTCCAGATTG ATCATGCCGA TTCTGCCGCC ACAATTCATC ATAAAATCCA ATTGATGATT CATAGTTTGG GCAATATGCC GCTCGATTTG CTGCCCTACA TTGAATATTT GCTCTCGCTG GATTTTATCG ATCAAAGTTT GGTTGAGCGG GTGCAACATC TTGACGCAGC TCAATTGAAG CGCCATGTCT TTTTGGCAAT TCGCGAATTA TTGGTAGCCT GTACCCGCCA ACAATCGATG GCAATTGTGA TCGACGATGT GCAATGGGCC GATGATCTAT CGTTGGAATT GCTCGAATTT ATTGCCGAAA CCTTTGATGC TTCGTCGTTG CTGCTCTATT TGGTAGCGCG TGATGATGAA ACGCCGCAGA TTCGCCAAAC CTTCAATCGG ATTTTAGCGC AAGCCCGCCA ACGTGGCTTA GTGCTTGAAC TTAATTCGCT TAGCCCTGAA GCAGCCGAAG CCTTGATCAA ACAAATTCTG CCGCAAGCTT CAGCCGTGAT GATCGATCAT TTGGTGCGCC AAGCCGATGG TGTGCCCTTC TATTTAGAAG AATTAGCGCG TCATGCCCTG CATGCTGGCC TTGATATTCA GGCTCAAAGC AGTGATAGCC TACAAATACC GTCAATGCCG CTTTCGCTGC AAGCGCTGAT GCGTTCGCGC TACGATCGTT TGCCCAACGA TTTGGCGCAT AGTTTGGCGC TGGCGGCGGT GATTGGCCGC CGTTTTAGCC AGCAACTACT CTGCCAACTA CGGCCTGAGC AACCGATCAA TCAGCAACTC CAACAATTGC AGCAACGAGG GTTTGTACAG CCAATCGCCA ACCATCACGA TGATTGGGCA TTTAGTCATT TGCTGCTGCA AGAAACAATT TACGCCAGCC TGTTGCAGCA ACAACGCTAT AACTTGCACG GCGAGGTTGC GCTAGCACTC GAAACCCACG ATGAACAACG GCCTGAATTA TATCTTGATG CCTTGGCCTA TCACTTCAGC CGTTCGCAAC TGACTCACAA AGCGTTGGAA TATTTGTTGT TGGCGGCGCA ACGAGCAGCT AGCCGTTTTG CTAACGACGA TGCGCTGCGC TTGTATGATC AAGCTATGCC CTTGCTCAGC GAGTTGGATC ACCCCGCAGC CGAACAAGCC GTGAGTTTGT TTCATGGGCG TGGCGATGTG CTCAGCTTTG TTGGGCGCTA CGATGAAGCT CGCATGGCCT ATCAACAAGC GATGAGCATG ATTCAGCCAA ACCCAGAAAC CCAAGCGACC CATGCCACGA TTTTGCGCCA ACTAGCGGCA ACCTATGAAA AACAGGGCAA TTACGACGAA GCCATGCTGC ATTTAGACCA AGCTCGTTTG GTTTTGGCCG ATGGAGCCTT GTTTGATCAT GCCCGCATTG ATTGCGATGC CGGCTGGATC GCCTTTCATC GCGGCAATTT GACCCAGGCT GAGCGGTTGC TCAACAACGC CTTGACCATC AGCGAATTGA ATCAACACCA TGGCTTGCAA GCGCTGGCCG CCAATCGTTT GGCGGGGGTC TATTGGCAAC GGGGAAGCCT TAAAGCCGCC CAAGCCTTGG TCAACCAAAG TTTGGCGGTC AGTCGCTATT TAGGCGATTT ACCAGCAATG AGTCGGGCGC TCAATAACCT TGGGGTGATT GCGATGGAGC TGGTCGATTG GCATGCTGCC GCCAATTACT ACGAGCAAAG CCTAGAAACC ACCCAAGCTA TCGGCGATTT GAATGGCCAG ATTTTGGCGA TTAACAATCG TTCGCATTGT ATGTTGATGC TTGGATTGCT CAACGATGCG CTACGATTCT CGCAAATGGC TTTTCGTTTG GCCCGCCAAA TTGGCGCAAA ATTACATATG GCCACAGCAT TAATTCAACA AGGCACGATT TCATTTTATG TTCAAGATTA TCCACGTGCT CGGCGCTACT ATCTCCAAGC CGAGCAGCTC TTTCGTGAAT TAGGCCACTA CGACCCCAAG CGAGTAATTT TGGCCGAAAT GCTTGGGCGA ATCGCGTTCG TGGAAGGTCG GCGAGCTTGC GCCAATCGGA TGGCTGCTCG TGCTTTGCGG CTGGCCCAAC AACTCAATGA ACCACAATCG CTATTTCGCA GCCAAGCCTT GCAGATCTAT TTGCAAGCCT ATAACGGCCA ACGCCGCCAA GCCAGCGAGG CGATTGATCA ACTGCTGCAA TTATCGGTTA ACAATAATTA TTTGTTGGCT TTAGGCTTGA TCATTGGCGC GACGATTAAG CGCCTGAATG GCGATTACAC GGTTGCCCTA GCCCATCAAG AACGTGCTGA CATTTTATTT GAGCAGATGA ATACCCCAGC GATGCTCCGC GAATACATGT AA
|
Protein sequence | MTVQTTTCCS TCNFAENPPS ARFCGNCASP LPLHCFSCGA ANPPGFRFCG TCATGLIEYI PAATTRRVVT ILFADVCNYT NLTNRLGAEH MYSLLDPCLR RLGETVRRFE GTIDKFTGDG LMAVFGMPVA LEAHAAQAAY AALAMFEELA AYNETIAHAG VQFQIRVGLA SGEVIAGLLG SDRYSDLTVI GGPVNLAARL QQVTDPGTIM VDGDLAQSLQ TSFMFNEQHP VALKGFEQPI TAAILAGKQL AQAVVDGSFG LPLIGREAEL RNLIGATELL HNGMGGVIGL TGRAGVGKTR LTAEIIQHLH SREVKVLTND CSSATRIVPY SAFLGLIRQL FQIDHADSAA TIHHKIQLMI HSLGNMPLDL LPYIEYLLSL DFIDQSLVER VQHLDAAQLK RHVFLAIREL LVACTRQQSM AIVIDDVQWA DDLSLELLEF IAETFDASSL LLYLVARDDE TPQIRQTFNR ILAQARQRGL VLELNSLSPE AAEALIKQIL PQASAVMIDH LVRQADGVPF YLEELARHAL HAGLDIQAQS SDSLQIPSMP LSLQALMRSR YDRLPNDLAH SLALAAVIGR RFSQQLLCQL RPEQPINQQL QQLQQRGFVQ PIANHHDDWA FSHLLLQETI YASLLQQQRY NLHGEVALAL ETHDEQRPEL YLDALAYHFS RSQLTHKALE YLLLAAQRAA SRFANDDALR LYDQAMPLLS ELDHPAAEQA VSLFHGRGDV LSFVGRYDEA RMAYQQAMSM IQPNPETQAT HATILRQLAA TYEKQGNYDE AMLHLDQARL VLADGALFDH ARIDCDAGWI AFHRGNLTQA ERLLNNALTI SELNQHHGLQ ALAANRLAGV YWQRGSLKAA QALVNQSLAV SRYLGDLPAM SRALNNLGVI AMELVDWHAA ANYYEQSLET TQAIGDLNGQ ILAINNRSHC MLMLGLLNDA LRFSQMAFRL ARQIGAKLHM ATALIQQGTI SFYVQDYPRA RRYYLQAEQL FRELGHYDPK RVILAEMLGR IAFVEGRRAC ANRMAARALR LAQQLNEPQS LFRSQALQIY LQAYNGQRRQ ASEAIDQLLQ LSVNNNYLLA LGLIIGATIK RLNGDYTVAL AHQERADILF EQMNTPAMLR EYM
|
| |