Gene Caul_4367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4367 
Symbol 
ID5901828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4743157 
End bp4744233 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content66% 
IMG OID641564885 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001685985 
Protein GI167648322 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.721631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTCG GACCCGAAAC CATCATCCAG GGCGACTGCA TCGAGGCGAT GAACGCCCTG 
CCCGAAAAGT CCGTGGACCT GATCTTCGCC GATCCGCCCT ATAATCTGCA ACTGGGCGGC
GACCTGCTGC GGCCCGACAA TTCCAAGGTC GATGCGGTCG ATGACCACTG GGACCAGTTC
GAGAGCTTCG CCGCCTACGA CAAGTTCACC CGCGAGTGGC TGAAGGCCGC CCGCCGGGTG
CTGAAGGACG ACGGCGCGAT CTGGGTGATC GGCAGCTATC ACAACATCTT CCGCGTCGGC
GTGGCCGTTC AGGACCTGGG CTTCTGGATC CTCAACGATG TGATCTGGCG CAAGTCCAAC
CCCATGCCCA ACTTCAAGGG CACCCGCTTC GCCAACGCCC ACGAGACCCT GATCTGGGCC
TCCAAGAGCC AGGGCGCCAA GCGCTACACC TTCAATTACG ACGCCCTGAA GATGGCCAAT
GACGAGGTGC AGATGCGCTC GGACTGGACC CTCCCGCTGT GCACCGGCGA GGAGCGCATC
AAGGGCGCGG ACGGGAACAA GGCCCACCCG ACCCAAAAGC CGGAAGCCCT GCTCTACCGG
GTGATCCTGT CGACCACCAA GCCGGGCGAC GTGATCCTGG ATCCGTTCTT CGGCGTTGGC
ACCACCGGCG CGGCCGCCAA GCGCCTGGGC CGCCGCTTCG TCGGCATCGA GCGCGAGAGC
GACTATATCG AGCACGCCAA GGCCCGCATC GCCAAGGTGG TCCCGATCGC CCCCGGCGAC
CTGGACGTCA TGGGCAGCAA GCGCAGCGAA CCGCGCGTGC CGTTCGGCAA TATCGTCGAG
GCCGGCCTGC TCAATCCGGG CGACACGCTC TACTGCGCCA AGGGCTCGCA CGCCGCCAAG
GTCCGGGCCG ACGGCTCGAT CACGGTGGGC GACCTGTCGG GCTCGATCCA CAAGGTCGGC
GCCCTGGTGC AATCGGCCCC GGCCTGCAAC GGCTGGACCT ACTGGCACTT CAAGACCGAC
AAGGGCCTGG CCCCGATCGA CGTCCTGCGC AGCCAGGTGC GGGCGGGGAT GAACTAG
 
Protein sequence
MSFGPETIIQ GDCIEAMNAL PEKSVDLIFA DPPYNLQLGG DLLRPDNSKV DAVDDHWDQF 
ESFAAYDKFT REWLKAARRV LKDDGAIWVI GSYHNIFRVG VAVQDLGFWI LNDVIWRKSN
PMPNFKGTRF ANAHETLIWA SKSQGAKRYT FNYDALKMAN DEVQMRSDWT LPLCTGEERI
KGADGNKAHP TQKPEALLYR VILSTTKPGD VILDPFFGVG TTGAAAKRLG RRFVGIERES
DYIEHAKARI AKVVPIAPGD LDVMGSKRSE PRVPFGNIVE AGLLNPGDTL YCAKGSHAAK
VRADGSITVG DLSGSIHKVG ALVQSAPACN GWTYWHFKTD KGLAPIDVLR SQVRAGMN