Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4756 |
Symbol | |
ID | 5902218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5141119 |
End bp | 5142246 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641565275 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_001686374 |
Protein GI | 167648711 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0971048 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGGA ACGCCCCCCC TTCCTTCTAC GAATTCTTCG CCGGCGGCGG CATGGCCCGC CTCGGCCTGG GCGAGGCCTG GACCTGCGCC TTCGCCAACG ACTTCGATCC GGTCAAGGCC GCGACCTATC GCGACAACCA CAAAGACGCC GCGACGCATT TCCACGAAGG CGACGTCTGG AAGATCGCCG CCGCCGACCT GCCGGGCCAA GCCGACCTGG CCTGGGCCTC CAGCCCCTGC CAGGACTTCA GCCTGGCCGG CGCCCGGGCG GGCCTGCAGG GCGGGCGCTC CTCGGCCTTC TTCGGCTTCT GGAAGCTGAT GGAGGCGCTG AACGCCGAAG GCCGCGGCCC CGACACCGTC GTGGTCGAAA ACGTGGTCGG CCTGCTGACC TCGCACGGCG GCGCCGACTT CACGGCCCTG TGCCGGGCCC TGGCCGACCA GGGCTACCGC TTCGGCGCGC TGGAGATCGA CGCCGCGCGC TTTGTCCCGC AGTCGCGACC TCGGGTGTTC GTGGTCGCCA CGCGCCTGCC GGTCGGGGAC CTCGCCGGCC CCAGCCCCTT CCAGACCCGC GCCGTGCGCG AGGCCCATGC CCGCCTGCCC CCCGACCTGC AAGGCTTGTG GACCTGGTGG GGCGTCGATG CGCCGCCGGC TCGCAACACC GACCTGACCG CCCTGCTCGA ACCCGACGAC GCCGTGACCT GGCGATCCGA AACCGCGACC CAGTCCCTGC TCCACCTGAT GGCGCCCCTG CATCGCGCCG CCGTAGAGGC CCGTGTCGAG AGCGGCCAAC GCGCGGTCGG GGCGGTGTTC CGGCGGATGC GCGACGGCGA GCAGCGGGCC GAGGTGCGGT TCGACGGCTT GGCCGGCTGT CTGCGCACCC CGCGCGGCGG CTCCTCGCGC CAGACGCTGC TGGTGATCGA CGGCGGGCAG GTCCGCTCGC GCCTGCTCAG TCCCCGCGAG GGCGCCCGGC TGATGGGCCT GCCCGAAACC TATCGCCTGC CCAAGAGCGC GACCGCCGGC CTGCACGTGA TCGGCGACGG CGTGGCCGCG CCGGTGGTGC GCTGGCTGTC GGAGCGGCTT TTGACTCCTC TCGCCCTCAG GAAAACTGCG CCGCTCGCCG CCGAATGA
|
Protein sequence | MNRNAPPSFY EFFAGGGMAR LGLGEAWTCA FANDFDPVKA ATYRDNHKDA ATHFHEGDVW KIAAADLPGQ ADLAWASSPC QDFSLAGARA GLQGGRSSAF FGFWKLMEAL NAEGRGPDTV VVENVVGLLT SHGGADFTAL CRALADQGYR FGALEIDAAR FVPQSRPRVF VVATRLPVGD LAGPSPFQTR AVREAHARLP PDLQGLWTWW GVDAPPARNT DLTALLEPDD AVTWRSETAT QSLLHLMAPL HRAAVEARVE SGQRAVGAVF RRMRDGEQRA EVRFDGLAGC LRTPRGGSSR QTLLVIDGGQ VRSRLLSPRE GARLMGLPET YRLPKSATAG LHVIGDGVAA PVVRWLSERL LTPLALRKTA PLAAE
|
| |