Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3043 |
Symbol | |
ID | 5734915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3843996 |
End bp | 3845123 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641280187 |
Product | DNA methylase N-4/N-6 domain-containing protein |
Protein accession | YP_001545809 |
Protein GI | 159899562 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00264807 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATT TAACCTTGAA CCACGATCAA ATCCTTTTAG GTGATTGTCG TGATGTTCTT CCATTACTGC CACCAGCCAG CGTTGACCTC ATTTTCGCTG ACCCGCCCTA TAATTTGCAA TTGCGTGGCG ATTTATTACG CCCCAACATG ACGCATGTTG ATGCAGTTGA TGATGATTGG GACTCATTTC GTGATTTTGC TGCCTATGAT GCGTTTACAC GTGCTTGGCT GCAAGCATGT CAGCGTGTTT TAAAAGATAA TGGCACAATG TGGGTGATTG GTAGTTATCA CAATATCTAT CGGGTTGGTA CAATTTTACA AGACCTTGGC TTTTGGATTT TAAACGATAT TGTTTGGATT AAGCGTAATC CAATGCCAAA TTTTCGTGGT GTACGCCTAA CCAATGCCCA TGAGACATTA ATTTGGTGTG CGAAATTGCC AGGCCAGAAG TATACCTTTA ATTATCATGC CTTGCGCCAT TTGAACGACG ATAAGCAAAT GCGCAGCGAT TGGGAATTTC CGCTGTGTAC TGGCAACGAA CGCCTGCGGA TCAACGGCAA CAAAGTGCAT AGCACCCAAA AGCCCGAAGC GTTGCTCTAT CGAGTATTAT TGGCAAGCTC GAATGTTGGT GATGTGGTGC TTGATCCATT TTTTGGCACG GGCACGACGG GCGCGGTAGC CAAACGTTTG GCGCGTCACT ACATTGGCAT CGAGCGTGAT CCCAGCTATG TTGAAGCAGC GCGAGGCCGG ATTGCCGCGA TTGAGTCGCC TAGCAGCACC GATGCCCTGC AAGCCTTGCC AAGCAACAAA CGGCGGATTC CACGGATTCC GTTTGGCAAT TTGTTGGAGC ATGGCTTGTT GCAAGCGGGC CAACAATTAT GGTTTAACCG CGATCCAAAC TTGGTTGCCA CGCTGTTGGC TGATGCTTCG CTGCGCATGT CCGATGGCAC ACGCGGATCG ATTCACAAGC TTGGTACAAT TTTGACAGGC CAACCAAGTT GCAATGGCTG GGAACATTGG TTTTTTCAGG CGAGTGATGG TACTTTAACT TCGATTGATG TGTTGCGCCA AGAGGTGCGG CGTTTACGCG AACAAACTCC AAGCGCCGAT GATTTAAGTG AGTTATGA
|
Protein sequence | MADLTLNHDQ ILLGDCRDVL PLLPPASVDL IFADPPYNLQ LRGDLLRPNM THVDAVDDDW DSFRDFAAYD AFTRAWLQAC QRVLKDNGTM WVIGSYHNIY RVGTILQDLG FWILNDIVWI KRNPMPNFRG VRLTNAHETL IWCAKLPGQK YTFNYHALRH LNDDKQMRSD WEFPLCTGNE RLRINGNKVH STQKPEALLY RVLLASSNVG DVVLDPFFGT GTTGAVAKRL ARHYIGIERD PSYVEAARGR IAAIESPSST DALQALPSNK RRIPRIPFGN LLEHGLLQAG QQLWFNRDPN LVATLLADAS LRMSDGTRGS IHKLGTILTG QPSCNGWEHW FFQASDGTLT SIDVLRQEVR RLREQTPSAD DLSEL
|
| |