Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0080 |
Symbol | |
ID | 4570647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 94386 |
End bp | 95501 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 639764682 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_910574 |
Protein GI | 119355930 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.35686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAAC AATCGGAGCC TACTTGCTCA AAAATTCGTG TTTATGACTT TTTCTCAGGT TGTGGGGGAA CAAGTGTCGG TTTTGGGCGA GCAGGTATAC AGCACGCATT GGCTGTTGAT TCTTGTTCTG ACGCCATAAG TACCTATCAA AAAAATTTTA TTGGTGTTCC CGTCATAACT GATCCAATAG AAACACTTAA TGTTGACCGA ATACAAAATT ATTTCAGTCA TAACCCGGAA GTGAAGTTAT TCTGTGGCTG TGCACCATGT CAGCCGTTTA CCAAGCAGAA AACAAATACA AAAAAAGATG CGGCTTCGGA TGATAGACGC GGATTACTTA TATATTTCTC AGATATTGTT CATGCGTGTT TGCCTGAGCT TGTTTTTGTT GAGAACGTGC CTGGTTTGCA AAAATTTTCT CTTGAAGATG GCGGGCCCTT GGCTATGTTT ATAAGCCGAT TAAAGCAAAA CGACTACTTT GTCGATTTTG ATGTGATAGC AGCTCAGGAC TATGGTTCAC CTCAGGTTCG TAGACGGTTC GTGTTAATCG CAAGTAGGTT GGGAAAGATT ACTTTACCTG CACCAACGCA TGGCCCAAAT ACAAAGAATT CGTATGTAAC TGTTCATGAT GCCATTGGCA ACTTACCGTC CGTCAAACAT GGCACAGAAC ATCCAGACAA TCAAAATTAC CCTAATCACC GGGCCGCAAT GCTGTCAGCA TTAAATCTTG AGCGCATTAG ACACACTGGC GCGAACGGAC GGCGAGATTG GCCTGAAAGA CTATTGCCAA AATGTTATGC ACAAAAGAAA GACGGAAAAC GCTATGAAGG GCATTCGGAC TGTTATACTC GATTAGCATG GGGCGAACCC GCACCAGGGT TGACAACTCG TTGCATCAGT TATTCAAACG GTCGATTCGG ACACCCAGAA CAGGATCGTG CCATTACGAT CAGGGAGGCA GCAAAGCTGC AAGGATTTCC TGATGATTTC ATTTTTACTG GTTCACTTAA CTCTATGGCT CGCCAGATTG GCAATGCAGT TCCTGTGTCT GTCGCGGAGG TATTCGGGAG ACATTTTCTG AATCACGTTA AAGCCATGGA GAGTACAAAT GGCTAA
|
Protein sequence | MNQQSEPTCS KIRVYDFFSG CGGTSVGFGR AGIQHALAVD SCSDAISTYQ KNFIGVPVIT DPIETLNVDR IQNYFSHNPE VKLFCGCAPC QPFTKQKTNT KKDAASDDRR GLLIYFSDIV HACLPELVFV ENVPGLQKFS LEDGGPLAMF ISRLKQNDYF VDFDVIAAQD YGSPQVRRRF VLIASRLGKI TLPAPTHGPN TKNSYVTVHD AIGNLPSVKH GTEHPDNQNY PNHRAAMLSA LNLERIRHTG ANGRRDWPER LLPKCYAQKK DGKRYEGHSD CYTRLAWGEP APGLTTRCIS YSNGRFGHPE QDRAITIREA AKLQGFPDDF IFTGSLNSMA RQIGNAVPVS VAEVFGRHFL NHVKAMESTN G
|
| |