Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2920 |
Symbol | |
ID | 5734791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3693623 |
End bp | 3694552 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280063 |
Product | DNA methylase N-4/N-6 domain-containing protein |
Protein accession | YP_001545686 |
Protein GI | 159899439 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATT CAGAAGCGCA ACAGCCCCAC CGTCGCAACC AGTTAAACGA TTTATCGGGG CGCGAGTGGC TTTATGCAAC GCGTTCGATT GCGCTAACTG GCACGCCACC AACGCCACTC AACGATTTAA CTGCTGAAGA ATGGGCGGCT TGGTCAGCCC CAATTTTAGC CAGCGCTTAC CCAACCCGTG GCCCTGCTTC GGCTGCCCAT CATATTCGCA AAGCCCATCC ATCGCCTAAA CCACCAGCCT TGCTTAGCGA AATTATTCGT TTTTTTACCA AGCCCAACGG TTGGGTGCTC GACCCATTTG CTGGGGTTGG CGGCACGCTG CTGGCTTGTG CCCAAACCCA ACGCCAAGCA ATCGGCATCG AATTATCGCC GCACTATATC GATTTGTATG GACAGGCGGC TGATGCACTG AATTTGGCTC GTTTGCCAGT TTTGCAAGGC GATGCCCGTG AAGTACTCAA CAGCCCAAGC ATCCAGCAAC AACGCTTTGA TTTGGTGTTG ACCGACCCAC CCTATAGTTC AATGCTCAGT CGAGCACGCA CTGGTCAGCG TCGCAAACAG GGTAACGGCG CAGCCACGCC CTTCACCGAT GATCCCGCCG ATCTTGGCAA TGTCGATTAT CCAACCTTTC TGCGTGAATT GACTTCAATT GTGGCGCAAA CCTTGCAATC GTTGCGGGTT GGCGGCCATT TGGTGTTGTT TGTCAAAGAT CTGCAACCAA CCCCAGAGCA TCATAACATG CTCCATGCCG ATATCGTGTC AGCGTTGTAT CAATTATCGC AGCTGCGCTA TCGCGGCTAT CGCATCTGGT ACGATCAAAG TGCAATGCTC TATCCATTTG GTTATCCCTA TAGTTTTGTT GCCAATCAGG TGCATCAATT TTTGTTAATT TTTCAGAAGC AAGCAGAGGG TGAATTATGA
|
Protein sequence | MSDSEAQQPH RRNQLNDLSG REWLYATRSI ALTGTPPTPL NDLTAEEWAA WSAPILASAY PTRGPASAAH HIRKAHPSPK PPALLSEIIR FFTKPNGWVL DPFAGVGGTL LACAQTQRQA IGIELSPHYI DLYGQAADAL NLARLPVLQG DAREVLNSPS IQQQRFDLVL TDPPYSSMLS RARTGQRRKQ GNGAATPFTD DPADLGNVDY PTFLRELTSI VAQTLQSLRV GGHLVLFVKD LQPTPEHHNM LHADIVSALY QLSQLRYRGY RIWYDQSAML YPFGYPYSFV ANQVHQFLLI FQKQAEGEL
|
| |