Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2256 |
Symbol | |
ID | 5734143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2881809 |
End bp | 2885696 |
Gene Length | 3888 bp |
Protein Length | 1295 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641279397 |
Product | TPR repeat-containing adenylate/guanylate cyclase |
Protein accession | YP_001545024 |
Protein GI | 159898777 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCAT TATTGCCATT GCTTCCCTCA CCATTGCTCA GACTTCTGAA CGCCTCTGAC CAGCCAAACA CATTAATGGC CGAACAGCAG TTTAACGCGG TTGTCTTATT TGCTGATATT GCAGGGTTTA CCACATTGAG CGATCAGGTG CGCGACGTTG GTGCTCGTGG GACTGAACAG CTAACTCTGC TGTTAAATCA TGTTTTTGAG GCGATGATCG ACTTGATTCA GGCAGCGGGC GGGATGGTTT GGGGCTTTAG TGGCGATGCA TTAACTGCGC TTTTTCGGTA TGAAACTGAG AATCAAGCGG CGATTGCCCA ACAAGCCTTA ACGTGTAGTC TTGCAATGCA AGCAGCCATG GCCCAATTTC AGACAAGTGC TATGCTTGAT GAGCTGGCAA CTATTCGCTT AACCATGAAA ATTGGTTTGG CTCATGGTGT GCTTATGTGT GGTATGGCAG GTATTCCCCA CCAGCGATTA GTCGCATTAT TGGCTGGCCA GCCTTTAATT GAAAGTAGTT TGGTTGAACA CCAAGCAAGC GAGGGCTCAA TTGGCTTAAC CACTGAGCTT TGGCTGGTTG TTCAGAGCTA TGCTCAGGCA CAATTGCGCG ATGGTTATTA TTGGTTGCAA GGCTGCGAAA CGGGAGTTGC AACTGCTTAT CCATCATTAG CAGCGCCGCC ACTGCCTACA TGGGCTGAAA GCTTTATTCA CCCCCTAATT GCCAAACGGC TGCACGATAA TCAAGCTGCG TTCATTAATG AACATCGGCT GGTCTATGTT TTATTTGCCC AATTATGTAT CGAACAACCA GCCCATGAGC TTGCGATCAT TCGTGAATGG GCGAGCATGC TTACAACAGT TAGTGCTCAG TATGATGGTT ATCTTAGCCA TGTCGCAGTT GGTGATAAAG GCATCACGAG CATGATTTTG TTTGGAGCAC CAATTGCTCA CGAAGATGAT CCTTTGCGCA TTTTAGATTG TGGCTTGGCT GTACGTGAGC AAACCAGCCA AATGGGCTAT CAATCGGCTT TAGGCTTAAA TAGTGGTGTA GTGTATTGTG GGCTGGTGGG TTCAGTTATT CGCCGCGATT ATACAACGAT TGGCGATGCA ATTAATGTGG CGGCCCGCTT GATGGAGCAT GCTAAGGCTG ATCAGATTTT GGTGAGTATG GAGCTTGCCA AGCTGGCCCA AACCCAATTT CGTTGGCAAC TCTTACCTGA TGCTCGACTC AAAGGCAAAC CACAACCAAT TGCCCTGATG GCCTTACAGC AACGAGCTTT TGAAGCCTCA GTCCGCTTGC CTGAATTTGT TAGTAATCTG CGATTAATTG GGCGTGCCAG CGAATTAACC ATGTTATTGC AAGCAACCGA GCAAACCTTG ACTGGTCATG GGCAAATTAT AGCGATTGTT GGCGATGCTG GGATCGGAAA ATCGCATTTG ATTAATACAT TCATCCAACA ACTTGACGCT GAGCGCTGGA ATATTTATCG TGGCGAGTGC GAGGCCTATG GTGAAAATAG CCCTTATTTG GTCTGGAATG CAATTTGGCG AGCCTTCTTC TGTATTGAGC CAATGTGGGA TCTGGCTACC CAAATTCGGG TTTTGAGCTT CCAATTAGAA CTAATCGATC CAAGTTTACT CAATCGCCTA CCGTTACTGG GAACGATCTT CAACCTAATG ATTCCACCCA ACGAAACGAC AGCCTTACTT GAACCCCAAC TGCAAAAACT GGCTCGTGAA GCGATTTTGG TTCAATGCTT ACGAACTCGC GCTACCAATC AACCAATGAT CTTAGTATTA GAGGATTGTC ATTGGATCGA CCCATTGTCG GCGGATTTAC TATGGGCGAT TAGTCAAATA ATTGAAGCGT TACCAATTTG TATTATTTTA GGATTTCGCC CATCGATACA GAATGAAGTA ATTGCTCAGC GCTTACCGCA ATTGACGTAT TGGCATCGCT TGAACCTTAA TGAATTTAAT CAGAGCGAAG CGAGCCAATT GGTTGCTTAT AAAATGCAGC ATTGGCCTGG CGAGCACCAT ATCTCAAGTC TATTAGCCGA CCATTTAATT AATCAGGCCG AGGGAAATCC TTTTTACTTG GAGGAATTAC TCAATTATGT GTATACGCAA GGCTTAGATC TCAATGATGC TGCGGTTATT GGTAAATTAC AACTTCCTGC AAGTTTATCG AGCTTGATTC TAAGCAGAAT TGATCAGCTT AATCAGCGCC AGCAATTGAC AATTAAAGTT GCGAGTATTA TTGGTCGGTT GTTTAAATTG CACTGGTTAT GGGGTGTTTA TCCTCAGCTG GGTGAGCATC AGACAATTCA AACCGATCTT GAAACGCTTG ATCGACTGGA GTTAACAATT AAATATAGTT TTGAACCAGA GGTTGCCTAT ATTTTTAAGC ATATGTTGAC CTATGAAGTA ACCTACGAGA GTTTAACCTA TGCGACACGC TCAACCTTAC ATGAGCAATT TGGCCATTTT CTTGAGCAAC ACTATGCTGA TGATCCCGCC TATCTTGATC TGATTTGCTA TCATTTTATG CGTAGCGATA ATCGTAGTAA GCAGATTGAA TATTTATGGA AGGCTGCTGA TGCTGCCCAA CGATCCTATG CCAATGAAGC AGCCTTATTG TTTTATCAAC AATTATTGAG TTTATTAGCG GACGTTGAGC ATTGGTCAGT GTTGTTGCCT ATTGGTGAGA TTCTCCAAAT TATTGGTAAG CCGCATGCAG CGATTGAAAC CTATCAAACG ATTATTAATG GGCTATTAGC TCCTGATGCG GCTATTGGTA AAAGCCATTG GAGCATTGGC AAGATTTATG GCGAACTTGG ACAATTTCGT CAAGGCTTGG ATTGGCTTGA ACAAGCTCGA ACCCGTTATT TAACGCGAGG CAATGCGGAA GGTGTTGCCG AAGTGTTAAT CGATATTGCG AATATTCTGT GGCAACAAGG CCATTATGAG CAAGGTCTTG CTCATGTGGA GCAAAGCCTT GCGCTTTGGC GACAGTTATC TAACTCGTTG GGAACAGCTC GCGCCTTGTT TCAATTAGGA GTGATTTTAT CCGATCAACG GCGATATACC GAGGCTTATC ATGCCTTAGA GCAAAGTTTA GAGCTACGGC GTGAGGCCGG AGACTTGTTC GCGATGGCGA GTTCGCTGAA CGATCTCGGC ATTATTGCTT TTGATCGTGG TGATCATACG ACAGCTGAGC AATTGTATAC CGAGGCTTTT ACTATTCGCC GTGATTTGGG TTTTGTCCGT GGGATGGCTC AATCGTTAAG CAATTTAGCC AATGCAGTGT TTGTTTCGGG TGATTATCAG CGAACTCGTC AGTTGCTTGA GGAAGCTTTA GTTTATCGGC GACAAGTCGG CCATCAACGC GGGATTGCCA TCTCACTGGC GCATTTGGGC AATACATACG CGGCTTTAGG GGATTTCAAA GCGGCTTGGA TGCATCATCG TGATGCATTT ACGATTCGTT GCGCGATTAA TCATCGTTTA GGAATTGCTC AATCGCAGGT AGCAATGGGC TTTTTGGCAC TGCGAACAGA CGATTTTTAT CAGGCTTATG GCTTGTTCCA GCAAAGTATT CATGGTTTTT TAGCACTTGA TGATCAGCGT GGATTGGCTG AAAATTTGGT CGGGTTTGGC TGTGTCGCGG CTGGAATGCT TAAATATCGG CTTGCACATC AGTTTATGCT CGCAGCTGAA ACAATTATTG CAAGCCTCGA TACGATTTTT GAGCCAGAAT TTCGCGATGG TCATGCTTGG TTGAAACGCC AATTAGATCA AAATTCATCC GACGTGGTTG GTTTAAGTGA GTCAAACACT GCGCGTGCTA TCGCAACACT GCTGAACTTG AGTTATCATT TGTATTAG
|
Protein sequence | MSSLLPLLPS PLLRLLNASD QPNTLMAEQQ FNAVVLFADI AGFTTLSDQV RDVGARGTEQ LTLLLNHVFE AMIDLIQAAG GMVWGFSGDA LTALFRYETE NQAAIAQQAL TCSLAMQAAM AQFQTSAMLD ELATIRLTMK IGLAHGVLMC GMAGIPHQRL VALLAGQPLI ESSLVEHQAS EGSIGLTTEL WLVVQSYAQA QLRDGYYWLQ GCETGVATAY PSLAAPPLPT WAESFIHPLI AKRLHDNQAA FINEHRLVYV LFAQLCIEQP AHELAIIREW ASMLTTVSAQ YDGYLSHVAV GDKGITSMIL FGAPIAHEDD PLRILDCGLA VREQTSQMGY QSALGLNSGV VYCGLVGSVI RRDYTTIGDA INVAARLMEH AKADQILVSM ELAKLAQTQF RWQLLPDARL KGKPQPIALM ALQQRAFEAS VRLPEFVSNL RLIGRASELT MLLQATEQTL TGHGQIIAIV GDAGIGKSHL INTFIQQLDA ERWNIYRGEC EAYGENSPYL VWNAIWRAFF CIEPMWDLAT QIRVLSFQLE LIDPSLLNRL PLLGTIFNLM IPPNETTALL EPQLQKLARE AILVQCLRTR ATNQPMILVL EDCHWIDPLS ADLLWAISQI IEALPICIIL GFRPSIQNEV IAQRLPQLTY WHRLNLNEFN QSEASQLVAY KMQHWPGEHH ISSLLADHLI NQAEGNPFYL EELLNYVYTQ GLDLNDAAVI GKLQLPASLS SLILSRIDQL NQRQQLTIKV ASIIGRLFKL HWLWGVYPQL GEHQTIQTDL ETLDRLELTI KYSFEPEVAY IFKHMLTYEV TYESLTYATR STLHEQFGHF LEQHYADDPA YLDLICYHFM RSDNRSKQIE YLWKAADAAQ RSYANEAALL FYQQLLSLLA DVEHWSVLLP IGEILQIIGK PHAAIETYQT IINGLLAPDA AIGKSHWSIG KIYGELGQFR QGLDWLEQAR TRYLTRGNAE GVAEVLIDIA NILWQQGHYE QGLAHVEQSL ALWRQLSNSL GTARALFQLG VILSDQRRYT EAYHALEQSL ELRREAGDLF AMASSLNDLG IIAFDRGDHT TAEQLYTEAF TIRRDLGFVR GMAQSLSNLA NAVFVSGDYQ RTRQLLEEAL VYRRQVGHQR GIAISLAHLG NTYAALGDFK AAWMHHRDAF TIRCAINHRL GIAQSQVAMG FLALRTDDFY QAYGLFQQSI HGFLALDDQR GLAENLVGFG CVAAGMLKYR LAHQFMLAAE TIIASLDTIF EPEFRDGHAW LKRQLDQNSS DVVGLSESNT ARAIATLLNL SYHLY
|
| |