Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1333 |
Symbol | |
ID | 5733225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1543504 |
End bp | 1544751 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641278471 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_001544106 |
Protein GI | 159897859 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.235826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATTA CCCTTAAGAC AATAGAATTA TTTGCTGGAG CTGGAGGCCT CGGGCTAGGG TTTCTTCTCG CAAATCATCC AGGTGTCAAT TTTAGGCCTT TATGTGCAGT CGATTTTAAC GTAGACGCAT GTACTAGCTA TAATATGAAT ATGCAATGGC TGCATCAGAA TGCTCCTCAT TTACAGACAA CACAAGCTTC TAAGGCTTAT CTGCGGAAAG TTGAATCCTT AAACGTTAAT GCAGTGAAGA GGCTTTTCCA GTTACAACAA GGTGATCTCG ATATTTTAAT GGGTGGTCCT CCTTGTCAAG GATATTCATC TTCAAATCGC CAGGCATCAA AAGAAACACG CGATGAACTT AATAATATGG TGAAATCCTT TCTTGATCGA GTTCAAGATT TTTCACCAAA AATGTTTCTC TTAGAAAATG TCCAAGGAGT CACATGGACT GCCTCGACTG ACGAAATGAG AATACCTAGT GAGCAATTAT CCTTTATAGA TAATGAAGAG ATTGCTGATG TTAAAGACTA TTTAGTTCAT AGAGCACGCG AGCTGGGTTA TCACATATGG TATTCGGTGC TTGATGCAGC GGATTTTGGT GTCCCTCAAC ATAGAAAACG ATTTTTTCTT TTTGGTATTC GTACAGACTT GACAACTGAC CCAAATATTC GGCTTGAAAA ATTTATCAAT CCTTATAGAA CGAGCACACT TACAACAGTT GCCCAAGCTA TTGAGGATCT TCCTGTTATT AATAATGGCG AGCATTGGAA AGGTAATAAC TATAATCCGG TGGCGAATGG GTATATCACT ATGATGCGTA GCTTTATGAA TAATAATGTT TTATTTGACC ACTTTACAAC AAATCATCAA GAATATGTTC TTGAGCGTTT CAGAAATATT CCTGAAGGCG AAAATTGGAA ATCTATAAAA AATATTATGA ATACGTATAA AAATGTAAAC AAAACCCATA GTAATATTTA TAGAAGATTA CAACGGAATG CCCCATCGCA TACTATTAGT CATTACCGCA AAGCAATGAC TATCCATCCT GTACAGAATA GAGGATTATC ATTTAGAGAA GCCTGTAGAT TGCAGTCTTT TCCAGACTGG TATCGATTTA GTGGAACAAG AGAAAGTGCC CAACAGCAAC TAGCGAATGC AGTGCCACCT TTGCTTTCGT CAAAGGTGGC ACTGGCTATC GCAGATTATT GGTTATCTCT GCCACATAAT GCTCTTATGA AAGATTAA
|
Protein sequence | MPITLKTIEL FAGAGGLGLG FLLANHPGVN FRPLCAVDFN VDACTSYNMN MQWLHQNAPH LQTTQASKAY LRKVESLNVN AVKRLFQLQQ GDLDILMGGP PCQGYSSSNR QASKETRDEL NNMVKSFLDR VQDFSPKMFL LENVQGVTWT ASTDEMRIPS EQLSFIDNEE IADVKDYLVH RARELGYHIW YSVLDAADFG VPQHRKRFFL FGIRTDLTTD PNIRLEKFIN PYRTSTLTTV AQAIEDLPVI NNGEHWKGNN YNPVANGYIT MMRSFMNNNV LFDHFTTNHQ EYVLERFRNI PEGENWKSIK NIMNTYKNVN KTHSNIYRRL QRNAPSHTIS HYRKAMTIHP VQNRGLSFRE ACRLQSFPDW YRFSGTRESA QQQLANAVPP LLSSKVALAI ADYWLSLPHN ALMKD
|
| |