Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2524 |
Symbol | |
ID | 4569689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2888446 |
End bp | 2889552 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639767084 |
Product | DNA methylase N-4/N-6 domain-containing protein |
Protein accession | YP_912936 |
Protein GI | 119358292 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase |
TIGRFAM ID | [TIGR01764] DNA binding domain, excisionase family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000139446 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGT ATTTTACCAC AGAAGAGGCC GCTTTGTATC TGGGCGTCTC CTCAGCAAGA ATCCGGCAGT TTATTCTGGA AGAGCGCCTG CAGACTGATA AATCCGGCAG AGATCATCTC ATTGCGGAAT CTGTTCTTTC TGATTTTGCA AAGTTTGGCA GAAAAAAGGT AGGTCGTCCC TTCCTTGATT CGGGCAACAC AGATTTTGTT GGTGTAGAGT CAGAAAGAGC ATCAGCGTCA CACCTTCTCA TCAATAGAGA TCAGCTTGAT GAGGATGGTA TTCAGGTGAT CAATGGCGAT ACAAGGGATA TCATTAAAAG CCTTCCAGAT AACACGTTCA GATGCGTTGT TACCTCTCCG CCCTATTGGG GTGTAAGGGA TTATGGCGTT GAAAATCAGA TTGGAGCTGA GCCTGACCTT CAGGATTATA TCAAGGCTCT TGTTGAAATT TTCTCTGAAG TGCGGCGTGT CCTGCAACCA GACGGGACTT TCTGGCTGAA TATCGGTAAT ACGTATACTT CAGGCGGAAG AAAATGGCGG CAGGAGGATT CAAAAAATAA AGGCAGAGCT ATGTCGTATC GCCCGCCTAC ACCTGATGGG TTGAAGAAAA AAGACCTGAT CGGCGTAGCA TGGATGGTGG CAATGGCTTG CCAGCTTGAC GGGTGGTATT TGAGAAATGA CATCATCTGG CACAAACCGA ATTGCCAGCC TGAAAGTGTG AAAGACCGCT TAACGGTAGC TCATGAATAT CTTTTTATGT TCTCAAAATC AGAACAATAC TACTTTAATC AGGAGGCGAT CAAGGAGTCG TATACAAACG GAAACGGCTT CAAAAACAAG CGGACAGTAT GGTCTATAAA TACTGAATCT TGTGCTGAAG CTCATTTTGC GGTTTTCCCG AAAAATCTGG TAAGGCCATG CATATTAGCC GGATCAGAGG AGCGCGATTT GATTCTTGAT CCTTTCTATG GAGCCGGGAC GGTTGGAATT GTTTCGCAGG AACTCAACAG AAAATGTGTC GGTATTGAAA TTAATCCGGA TTATGTTTAC ATATCAAGCC GACGCAACGC CCGTGTACAA GGCGCACTTA TACTGCCGGA ATCGTAA
|
Protein sequence | MSKYFTTEEA ALYLGVSSAR IRQFILEERL QTDKSGRDHL IAESVLSDFA KFGRKKVGRP FLDSGNTDFV GVESERASAS HLLINRDQLD EDGIQVINGD TRDIIKSLPD NTFRCVVTSP PYWGVRDYGV ENQIGAEPDL QDYIKALVEI FSEVRRVLQP DGTFWLNIGN TYTSGGRKWR QEDSKNKGRA MSYRPPTPDG LKKKDLIGVA WMVAMACQLD GWYLRNDIIW HKPNCQPESV KDRLTVAHEY LFMFSKSEQY YFNQEAIKES YTNGNGFKNK RTVWSINTES CAEAHFAVFP KNLVRPCILA GSEERDLILD PFYGAGTVGI VSQELNRKCV GIEINPDYVY ISSRRNARVQ GALILPES
|
| |