Gene Caul_4756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4756 
Symbol 
ID5902218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5141119 
End bp5142246 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content73% 
IMG OID641565275 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001686374 
Protein GI167648711 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0971048 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGGA ACGCCCCCCC TTCCTTCTAC GAATTCTTCG CCGGCGGCGG CATGGCCCGC 
CTCGGCCTGG GCGAGGCCTG GACCTGCGCC TTCGCCAACG ACTTCGATCC GGTCAAGGCC
GCGACCTATC GCGACAACCA CAAAGACGCC GCGACGCATT TCCACGAAGG CGACGTCTGG
AAGATCGCCG CCGCCGACCT GCCGGGCCAA GCCGACCTGG CCTGGGCCTC CAGCCCCTGC
CAGGACTTCA GCCTGGCCGG CGCCCGGGCG GGCCTGCAGG GCGGGCGCTC CTCGGCCTTC
TTCGGCTTCT GGAAGCTGAT GGAGGCGCTG AACGCCGAAG GCCGCGGCCC CGACACCGTC
GTGGTCGAAA ACGTGGTCGG CCTGCTGACC TCGCACGGCG GCGCCGACTT CACGGCCCTG
TGCCGGGCCC TGGCCGACCA GGGCTACCGC TTCGGCGCGC TGGAGATCGA CGCCGCGCGC
TTTGTCCCGC AGTCGCGACC TCGGGTGTTC GTGGTCGCCA CGCGCCTGCC GGTCGGGGAC
CTCGCCGGCC CCAGCCCCTT CCAGACCCGC GCCGTGCGCG AGGCCCATGC CCGCCTGCCC
CCCGACCTGC AAGGCTTGTG GACCTGGTGG GGCGTCGATG CGCCGCCGGC TCGCAACACC
GACCTGACCG CCCTGCTCGA ACCCGACGAC GCCGTGACCT GGCGATCCGA AACCGCGACC
CAGTCCCTGC TCCACCTGAT GGCGCCCCTG CATCGCGCCG CCGTAGAGGC CCGTGTCGAG
AGCGGCCAAC GCGCGGTCGG GGCGGTGTTC CGGCGGATGC GCGACGGCGA GCAGCGGGCC
GAGGTGCGGT TCGACGGCTT GGCCGGCTGT CTGCGCACCC CGCGCGGCGG CTCCTCGCGC
CAGACGCTGC TGGTGATCGA CGGCGGGCAG GTCCGCTCGC GCCTGCTCAG TCCCCGCGAG
GGCGCCCGGC TGATGGGCCT GCCCGAAACC TATCGCCTGC CCAAGAGCGC GACCGCCGGC
CTGCACGTGA TCGGCGACGG CGTGGCCGCG CCGGTGGTGC GCTGGCTGTC GGAGCGGCTT
TTGACTCCTC TCGCCCTCAG GAAAACTGCG CCGCTCGCCG CCGAATGA
 
Protein sequence
MNRNAPPSFY EFFAGGGMAR LGLGEAWTCA FANDFDPVKA ATYRDNHKDA ATHFHEGDVW 
KIAAADLPGQ ADLAWASSPC QDFSLAGARA GLQGGRSSAF FGFWKLMEAL NAEGRGPDTV
VVENVVGLLT SHGGADFTAL CRALADQGYR FGALEIDAAR FVPQSRPRVF VVATRLPVGD
LAGPSPFQTR AVREAHARLP PDLQGLWTWW GVDAPPARNT DLTALLEPDD AVTWRSETAT
QSLLHLMAPL HRAAVEARVE SGQRAVGAVF RRMRDGEQRA EVRFDGLAGC LRTPRGGSSR
QTLLVIDGGQ VRSRLLSPRE GARLMGLPET YRLPKSATAG LHVIGDGVAA PVVRWLSERL
LTPLALRKTA PLAAE