Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1214 |
Symbol | |
ID | 5733107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1398604 |
End bp | 1401996 |
Gene Length | 3393 bp |
Protein Length | 1130 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278354 |
Product | adenylate/guanylate cyclase |
Protein accession | YP_001543990 |
Protein GI | 159897743 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCCA ACCCTCAGAT CAGCAGCTTT CTGCCACGCT ATGTCCTGCA ACGGCTCATC GAAGAGCATC AACCGCGCCA GCCGCTCGCC GAAGATCAAC TGGCGACGGT GTTGTTTGCC GATTTTTCTG GGTTTAGTCG CCTCAGCGAG CAATTTGCCT ACGACCAAAC CACCATCCTC GAATATATTT CCGATGTACT GAACGACTCA TTTAGCCAAT TAATCGACAC CGTGATTGCC CATGGCGGCG ATGTGGTGAA GTTTGCTGGC GATGCATTAA TTGCAATTTG GATGGCCGAC GATCTTGATC AATTAACTCA AAATACCCAG CTTGCGGCTC AATGTGGCCT AGCGATTCAA GCAAGCTTTA ATAATCCTAG TGAATCTGAG CGCTTGGCGA TTCGGGTGCA AATTGGGGCT GGCTCGATTT CAAGTTTTAT CGTGGGTGGC GTGAATAATC ATTGGGAGCA ATTGATTACG GGCGAGGCGT TGTTACAAGT GCATTTGCTT GGCTCGCAAA GCTTTGCTGG TCAAGTGATC GTTTCGCCCG AAGCGCGTTC GTTAATTCGC GAATATGGGC TGGGCGAAGA TTTACGTGCT CAGGCTTTTC GGCTGACTGC AATGTCCGAA AATCTGCCCT TGCCAATGAT GAACCAAACA CTGCCCGAAT ATCCAACTGA GCAGTTGACT CCGTTTATGC CTACCGCGGT GATTGCTCGC ATTAATGCAG GTTTGCACGA ATGGCTGGCT GAGCTACGCC ATCTGAGCGT GATGTTTATT AATGTGCCAG CTTTGAGCTT TTTTACCACG TTAGAGCAAG TGCAAGCCGC AGTTAGTGCT TTGCAAACCG TTATTTTTCG CTACGAAGGC AGCATCGACA AACTATCGGT CGATGATAAA GGCGTAAGTT TGCTAGCGGC TTTTGGTTTG CCGCCACTCT CGCATCGCGA TGATGCTGAA CGAGCGGTGC GCTCGGCCTT AGAAGCCTCA GCCGCATTGA GCAAACTGAG TTTAACCCAC ACGATTGGCA TTGCCACTGG CCCAGCCTTT TGCGGCGAGA TTGGCAATAG CCAGCGCCGC GAATTTACCA TGATTGGTGA TGTCGTCAAT CGGGCTGCCC GTTTGATGGA AGCTAATCTT GCACCAATTC TCTGCGATCA AACCACCGCC CAAGCTAGCC AACAACATGT GCGCTTTCAA GTACTACCGC CAATTACGAT TAAAGGCAAA AGCCAGCCAA TTACGATCTA TCGCCCGCAA CAAACCAACC ATAGCCCCGA CCAAATTCGG CCTCGACGCT TGATTGGGCG ACGGCGCGAG CGTGGTCAAA TTGAGGCGCT ATTCAATCAA ACCCAACCAA GCAGTCAGCA TGTGGCGATC ACTGGCGATA GCGGTATGGG TAAATCGGCC TTGCTGTATG AGGCCGTTGA AATTGCTCAG CATTACGAAC GCCAAGCTAT GTTGATTACC AGCAGCAGCC TGCGTCAATC GGCTCAGTAT CCAGGCTGGC GTTTGTTGCT TGAGGCCTGT TTGGCGGTTG AGCAATGGCC GAATCAGGCG GTTCAATCTT ATCAATTAAT TATTCAGCGT TTGCAATTGC CTGACCAGCT CGCTAAATCG GTGCACTTGC TCCACGATGT TTTAGAGCTG CCAGGTCGCG CCGAAGCTAA CACCACCGAT CATGCCCAAC AGGTGCAAAT CATCCATGAG TTAATTGGGC ATGCCCTCTA TCAACTGCAT GCTCAACGCC CGTTAGTCTT ATGCATTGAT AATTTGCAAT GGTTCGATTC ATTAGCCTTG GCGACGCTGG AATTATTGTT GCAGCAGCAT GACGACATTA TTCTGATTAC AACTGCGCCG TCTGCTGTGG CATGTCTCGA ACCACAACAA ACCATCCATC TGCAAGCGCT TGATCCTGTG GCTTGTATCG CGGTGGTTGC TCAATCATTG GGTGTGCAGG CGATTCCTCC AAGTGTGGCT TTATTTATCA ATCAGCGTGC GGCAGGCCAT CCGTTGTGGA GCATCGAGCT AGCCCAAGCC TTGCGCAATG CAGGGATGAT TCGAGTCAAC AATGGGGTTT GCAAGCTTGA TCAATTTAGC CAGTTAGAAA AACTCAATCT CCCAAGCACA ATTCAGGGTG TGCTGGTCAG CCGCATCGAT CAACTGCCGC CGCAACCCCA ATTAACCCTC AAAATTGCTA GTGTGATCGG CCATGATTTT AGTTTGGCAG TGCTTGATGC GATTTATCCA GTGGCCCACG AGCGTGAGCA TATTCCAGCT CACCTTGACT TATTGCTGCA ACAAGGTTTT ATTCACGAGG CTGCCGAGGG CTATCAATTT AGCCAAGCAA TTATCCACGA TGTGGCTTAT TCATTGTTGT TGTTTGGTCA ACGACGAGCC TTGCATCGAG CGATTGCTGA ATGGTACACC CGCGAACATC CCCATTTGGT TGAGGCAGGC AGCAGCCTTT TGGCCCATCA CTGGAGCCAT GCGATCGATC CTGATGAGCC AGAAAGCCGT CAACCTGCGA TCGATGCCTT ACGGCGAGCA GGCGAACAAA CCCTTGTGCG CTGTAGCTAT CGCGAGGCGA TTCCGTTTTT CGAGCGAGCC TTGCATTTGT TGGCAATTGA TGATGATTTT GCCTCGCAGC AACAAATTGT GCGCATGCAA TTTAATCTGA GTCAAGCCCG CTGGCGGTTG GGTGACCACA GCATGGCCTT GACCAATTTG GATGCAGCAT TGGTAACCGC GCAACAAATT GGCGATGGGA TTGGCGAGGC CGATGTGCTG CGCCAATTTG GCAATATTGC CTATGTCCAA GGCGACCTTT ACACTGCCCA ACAACATTTT CTAGCCAGCG TCGCGCGAGC ACGCAAGGCC AATTATCCCA GCGGAATTAT TAGTGGGGTT AGTAATGTCG GCGTGGTGGC GTTTGCACGG GGCGATTATC AGGTTGCCCG CGAAGCTTAT CGTGATGGTT TAGCAATTTC GATCGAGCAA GGTCACGATT TTGGCATCGC CGTCAATCAG CTCAATTTAG GTGGTTTGGC GATTGTTGAA CAAGCTTGGG ATGAAGCACG CTCCTATTTG CAGCAAGCAT TGAGCTTGGG CTATGCCAAA CACATGACTT TAGTCTGTCT GCATAGTTTG GTGGCCTTGG CTGAATGGCG TTTAGCGACC AATCAAGCCG AGGCCAGTGC TAACTTGATT CAGATTGTGC TGCATCACGA AGCGATCGAT AGCGAAATTC ATGCAGCAAT TGATAAACTC AAACCCAAGC TGATTCAAAT TTTAGGCGAG ACCCAATGGC TGATTCTGAG CCAACGCCCA ACAACCCCGT TTGAACAAGT ACTGCCTGCG ATTATGCAAG AATTGGCTAC TGAAAAAGCC TAA
|
Protein sequence | MAANPQISSF LPRYVLQRLI EEHQPRQPLA EDQLATVLFA DFSGFSRLSE QFAYDQTTIL EYISDVLNDS FSQLIDTVIA HGGDVVKFAG DALIAIWMAD DLDQLTQNTQ LAAQCGLAIQ ASFNNPSESE RLAIRVQIGA GSISSFIVGG VNNHWEQLIT GEALLQVHLL GSQSFAGQVI VSPEARSLIR EYGLGEDLRA QAFRLTAMSE NLPLPMMNQT LPEYPTEQLT PFMPTAVIAR INAGLHEWLA ELRHLSVMFI NVPALSFFTT LEQVQAAVSA LQTVIFRYEG SIDKLSVDDK GVSLLAAFGL PPLSHRDDAE RAVRSALEAS AALSKLSLTH TIGIATGPAF CGEIGNSQRR EFTMIGDVVN RAARLMEANL APILCDQTTA QASQQHVRFQ VLPPITIKGK SQPITIYRPQ QTNHSPDQIR PRRLIGRRRE RGQIEALFNQ TQPSSQHVAI TGDSGMGKSA LLYEAVEIAQ HYERQAMLIT SSSLRQSAQY PGWRLLLEAC LAVEQWPNQA VQSYQLIIQR LQLPDQLAKS VHLLHDVLEL PGRAEANTTD HAQQVQIIHE LIGHALYQLH AQRPLVLCID NLQWFDSLAL ATLELLLQQH DDIILITTAP SAVACLEPQQ TIHLQALDPV ACIAVVAQSL GVQAIPPSVA LFINQRAAGH PLWSIELAQA LRNAGMIRVN NGVCKLDQFS QLEKLNLPST IQGVLVSRID QLPPQPQLTL KIASVIGHDF SLAVLDAIYP VAHEREHIPA HLDLLLQQGF IHEAAEGYQF SQAIIHDVAY SLLLFGQRRA LHRAIAEWYT REHPHLVEAG SSLLAHHWSH AIDPDEPESR QPAIDALRRA GEQTLVRCSY REAIPFFERA LHLLAIDDDF ASQQQIVRMQ FNLSQARWRL GDHSMALTNL DAALVTAQQI GDGIGEADVL RQFGNIAYVQ GDLYTAQQHF LASVARARKA NYPSGIISGV SNVGVVAFAR GDYQVAREAY RDGLAISIEQ GHDFGIAVNQ LNLGGLAIVE QAWDEARSYL QQALSLGYAK HMTLVCLHSL VALAEWRLAT NQAEASANLI QIVLHHEAID SEIHAAIDKL KPKLIQILGE TQWLILSQRP TTPFEQVLPA IMQELATEKA
|
| |