Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0528 |
Symbol | |
ID | 5732445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 614213 |
End bp | 615277 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641277655 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_001543304 |
Protein GI | 159897057 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.555926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGGCG CAGTCATTGA TCTATTCTGT GGAGTTGGTG GGCTAACCCA CGGGCTTATT CTTGAAGGTT TTGGCGTACT GGCAGGGATT GATAACGATC CTTCTTGTAA GTATGCCTAT GAGCAAAATA ATAGAACTCG TTTTATTGAA AAGTCTATTA CTGAAGTTGA TGGCAGAGAG TTAAATGCCC TCTATCCCAA TAATCAACAT AAAATCTTAG TAGGCTGTGC CCCATGCCAA GATTTTTCTC AATATACGAA GAAGAGTCGT ACTGGAACAA AGTGGCAATT ACTCACAGAA TTTTCGCGCC TTATTAGAGA GATCGAGCCT GATATTATAT CAATGGAAAA TGTTCCTGAG GTTCGAACAT TCAATAGAGG GGAAGTTTTT AACAATTTTA TTCAAGCACT TGAACAATTA GGGTATCACG TTTCGCATAG CGTGGTGCAT TGTCCTGATT ATGGAATTCC GCAGCAACGT GACCGACTAG TCTTATTTGC TGCTAAACAG GGCGTTATTA AGATTATACC CCCAACCCAC ACTCCTGAGA ACTATCGAAC TGTACGTGAA GTTATTGGTT CTCTGCCACC AATTACTGCT GGCGGACACT GGGAGGGTGA TAGCATGCAT GCAGCCAGCA GACTAGAAGA TATAAACCTT CGACGTATTC AGCATTCTGT GCCTGGTGGT ACTTGGGCCG ATTGGCCGGA AGAATTGATT GCAGAATGTC ATAAAAAGGA AAGCGGTGAG AGTTATGGGA GCGTCTATGG TCGAATGGAA TGGGATAAAG TAGCACCTAC CATCACTACA CAATGCAATG GGTATGGTAA TGGTCGCTTT GGTCATCCAG AGCAGGATCG CGCTATTTCG CTCCGTGAGG CTGCCTTACT TCAGACATTT CCGCGAAGTT ATCAATTTGC CCCTGAAGGC CAACTGAAAT TTAAGACAGT TAGTCGTCAA ATAGGGAATG CTGTTCCGGT CGCACTAGGT CGTGTTATTG CAAAAAGTAT TAAGCGTTTT TTGGAGGGTT TACATGAGCG ACAGCGGGTA CGAATTATCA TTTAG
|
Protein sequence | MVGAVIDLFC GVGGLTHGLI LEGFGVLAGI DNDPSCKYAY EQNNRTRFIE KSITEVDGRE LNALYPNNQH KILVGCAPCQ DFSQYTKKSR TGTKWQLLTE FSRLIREIEP DIISMENVPE VRTFNRGEVF NNFIQALEQL GYHVSHSVVH CPDYGIPQQR DRLVLFAAKQ GVIKIIPPTH TPENYRTVRE VIGSLPPITA GGHWEGDSMH AASRLEDINL RRIQHSVPGG TWADWPEELI AECHKKESGE SYGSVYGRME WDKVAPTITT QCNGYGNGRF GHPEQDRAIS LREAALLQTF PRSYQFAPEG QLKFKTVSRQ IGNAVPVALG RVIAKSIKRF LEGLHERQRV RIII
|
| |