Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2671 |
Symbol | |
ID | 5734566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3424400 |
End bp | 3425812 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279813 |
Product | DNA/RNA non-specific endonuclease |
Protein accession | YP_001545437 |
Protein GI | 159899190 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1864] DNA/RNA endonuclease G, NUC1 |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0295423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGATC GTCGGCTGAT TACCCTGCTC GCGTTGTTCG TCCTTGGGTT TGTTGGGGCT AGCTTGGTAC AACCAACGCA GGCCAAAACC GTTAGTAGTG ATAACTTGGT CTTGGGCAAT CCCAGCGGGG CCGTCGCCAG CAGCAGCTAT CCGACCAATT ATTTAATTCA ACGCAACCAA TATGCGCTCT CGTATCACCG CGATAATGGG ATTGCCAATT GGGTCAGTTG GCACCTCGAT AGCGGCGATA TTGGTAGCGT TTCACGCAGC GATTTTCAAA CCGACACTAG CTTGCCCAGT GGTTGGTATC GCGTTGCCAC TGGCGATTAC AGCGGCAGCG GCTATGATCG CGGCCATATG ACTCCCTCAG GCGATCGCAC CGCCACCACC GCCGATAACC AAGCCACCTT CTACATGACC AACATTATTC CCCAAGCGCC CGATAACAAC CAAGGCCCAT GGGTTGACCT CGAAACCTAT GCTCGCGAGT TGGTCAGCGC TGGCAACGAG TTGTATATTA TCAGCGGCGG GGCTGGCTCA CGTGGCACAA TCGCCAGTGG CAAGGTGCGA ATTCCGAATT CAACCTGGAA AATCATCGTC GTGCTTAGCC AAGGCAGTAA CGACCTCAGC CGCGTCAGCA ACAACACCCG CGTCATCGCG ATCAACATGC CCAATGTGCA AGGTATTCGC GATAACGATT GGCGCGATTA TCTGACCACG GTTGATGCTC TCGAAAGCTT GACGGGCTAT AACTTCCTTT CAAATGTCTC AACCAACATC CAAAATGTGA TTGAAGCCCG CGTCGATGGC TCGACCACAC CAATTCCGAC AGCTGTACCA ACCTCGGGAA CCAACCCAAC CGCCACTCCA GTACGCACGG CCACCCCAAC CCCCAGCACC GGCTGTACAT CGAGCCGCCT GTTCTTCTCA GAATATGTCG AAGGCAGCAG CAACAACAAA GCTTTGGAAC TTTACAATAA TACTGGAGCC AGCGTCAGCC TCAGTGGCTA TAGCATTCAG TTGTATGCCA ACGGCTCGAC CAGCGCCAGC AGTAGCGTGA ATTTGAGTGG CTCGGTCGCC AATGGCGCAA CCTATGTGAT TGCCAACGCC TCGGCATCAA GTAGCGTACA GAATCTTGCT AACATCACCA GCAGTGTGGC CAACTTCAAT GGCAATGATG CACTTGTGCT GACCTACAAT GGCACGGTGG TTGATAGCTT TGGCCAAGTT GGCAACGACC CAGGTAGCAG CGGTTGGGGT GGCACAACCA CCGATCGCAC GTTGCGCCGT AAAGCAACAA TCAGCGCAGG CGATACCAAT CGCAGCGATA GCTTCACCCC AAGCAGCACC TGGGATAGCT ATAGCCTTGA TACATTCAGT GGCTTGGGCA ACCACAGTGT CAGTTGCCCA TAG
|
Protein sequence | MKDRRLITLL ALFVLGFVGA SLVQPTQAKT VSSDNLVLGN PSGAVASSSY PTNYLIQRNQ YALSYHRDNG IANWVSWHLD SGDIGSVSRS DFQTDTSLPS GWYRVATGDY SGSGYDRGHM TPSGDRTATT ADNQATFYMT NIIPQAPDNN QGPWVDLETY ARELVSAGNE LYIISGGAGS RGTIASGKVR IPNSTWKIIV VLSQGSNDLS RVSNNTRVIA INMPNVQGIR DNDWRDYLTT VDALESLTGY NFLSNVSTNI QNVIEARVDG STTPIPTAVP TSGTNPTATP VRTATPTPST GCTSSRLFFS EYVEGSSNNK ALELYNNTGA SVSLSGYSIQ LYANGSTSAS SSVNLSGSVA NGATYVIANA SASSSVQNLA NITSSVANFN GNDALVLTYN GTVVDSFGQV GNDPGSSGWG GTTTDRTLRR KATISAGDTN RSDSFTPSST WDSYSLDTFS GLGNHSVSCP
|
| |