Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2644 |
Symbol | |
ID | 5734524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3391775 |
End bp | 3392863 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641279786 |
Product | C-5 cytosine-specific DNA methylase |
Protein accession | YP_001545410 |
Protein GI | 159899163 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAA TTACGAATCA CTATACACGT AATGTGGGGA TGCAAGTTCA AGAGCGGATT AATCATCTTG GTATTGGGCA AAAAATGCAG GATCTACCCG AACACCTTTG GCATGAAAGT TTTCGCTATT ATGTTAAAGA AGATCCATCA CGTCAAGGTG GGCCAAATCT ACGGATGATT CGCTTAGATC CTAATAAGCC ATCGTTAACA GTTACTGCCT TTATCTTCAA TAAATTTGTC CATCCTTTTG AAAATAGATT TATTACAACT CGTGAGGCTG CACGATTACA AGGCTTTCCT GACAACTTCG AATTTATTGG CTCATTAACT AGTATTCAAC GTCAAATCGG TAATGCAGTT CCAGTTCCAT TAGCAACGGC TATATTACAA ACAATCTTAC AGCATGCAAA AGCTCATCAT CCAGAAAAAA GCTTATTTTC AGCATTAAGT ATTTTCAGTG GGGCAGGTGG GCTTAATTTA GGTGCTCAAC AGGCCAAATT ACCCAATGCA AAATGGCAAA CAATTGCATC AATTGATATC GATCGTGATG CATGTACCAG TCTTGAACAT CATTTCGCGA ATAAAAATGT GATTTGTCAA AATATTATTG ATATTACTCA ACCTAAATCA TTAATGAGCC AACCACTCGA TCTATCATAT GGTGGGCCTC CTTGTCAATC ATTCAGTCAG GCTGGTAAAC AAAAAGGGTT ATCCGATCCT CGTGGAAATT TAATTTACGA GTTTATTCGT TTTCTAAGTG ATCTTAATCC AAATTACTTT TTGTTAGAGA ACGTAAAAGG TTTACAGGGA ATTAATAATG GCCAATTATT ACATCTGATA ATCGATGATA TTCGTAAGCT TGGGTATAAC GTAACATTCG GTGTAGTCAA TGCTGCTGAT TATGGAACAC CGCAGCTTCG CAAAAGAATT ATTATCCTAG GTTGTAAGCA AGATTTAGGA TTTGTAAATC TGCCGCTACC AACGCATGCG ACTGAGGCTA ATTTGCTATT ACAGCCCTAT AAAACAGTTG GTCAAGCTTT GCAAAATCTA CCACCCGCCT TAGTTTGGCA AAAAACAACC ACCAAGTAA
|
Protein sequence | MSKITNHYTR NVGMQVQERI NHLGIGQKMQ DLPEHLWHES FRYYVKEDPS RQGGPNLRMI RLDPNKPSLT VTAFIFNKFV HPFENRFITT REAARLQGFP DNFEFIGSLT SIQRQIGNAV PVPLATAILQ TILQHAKAHH PEKSLFSALS IFSGAGGLNL GAQQAKLPNA KWQTIASIDI DRDACTSLEH HFANKNVICQ NIIDITQPKS LMSQPLDLSY GGPPCQSFSQ AGKQKGLSDP RGNLIYEFIR FLSDLNPNYF LLENVKGLQG INNGQLLHLI IDDIRKLGYN VTFGVVNAAD YGTPQLRKRI IILGCKQDLG FVNLPLPTHA TEANLLLQPY KTVGQALQNL PPALVWQKTT TK
|
| |