Gene Caul_0324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0324 
Symbol 
ID5897598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp369308 
End bp371281 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content52% 
IMG OID641560808 
ProductN-6 DNA methylase 
Protein accessionYP_001681959 
Protein GI167644296 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00528654 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTCAG TAGCGACGAC CAGCCCGCCT TCTAGGCCTG TGGCTGTGAT TGTGCAGCAG 
GGCAAGGTGC TCGACTTCAT CGATAGCCAG ACCCAACGTG AGGAGACGCC AGAAGAATAC
GTTCGCCAGG AGATCGCCAA GTCGCTCGTA CGCGAATACG GCTATCCTAA AAGCGATATT
GCTGTCGAAT TCGTCTTGCG GCTGGGAAGC CGCAAACCTC GGGCCGACTT GGTTATATTC
AGTCACTCCG ATGAGCAGCA ACAGCCCCAC GCCTATATTA TCGTGGAGTG CAAAGAGCAA
AGAGTGAAGT CCAGCGACCG AAAAGAAGGC GTCGGGCAGC TCCACAGCTA TATGGCCGCG
TGTCCTAATG CCATCTATGG CATGTGGACG AATGGACTAG AGCGCTTTTG CTACCGGAAG
GTCGAAAGTG GTGGAAAATT ACTTTTCGAA GAAATTCCCG ACCTCCCAAG CTTCGGTCAA
TCGAGCGAAG ACGCAGACCG TCCCGAGTTT GATCAGCTTA AGCCAGCCTC GTCGGATGCA
CTTCTCTTTG CATTTCGCCG ATGCCACAAC TACATCGCCG GAAATCAGGG CCTCCAAAAG
CCTCAGGCAT TCTGGGAGTT GCTTAAGCTA ATTTTCTGCA AAATTCATGA CGAGCGCGAC
AGCCCCGAAG TCGAATTCTT CGCGTCGGCA AATGAGCGGA CGGGCATCAA CGGTCCGCTG
AAAGTGAAAA AGCGGATTGA CGGGCTTTTT GATGCCGTCA AGGAAGATTA TCCCGCCATA
TTTCAGGCGA ATGACGTCGT CGCGCTGAAG CCACCTGTCC TCGCTTACAT CGTATCTCAA
CTGCAGATGT ACTCTCTACT CGAGAGCGAT GTCGACGTTA AAGGTCATGC CTATGAAGAA
ATAGTGGGAT CAAACCTTCG TGGTGACAGA GGCGAATTCT TCACGCCGCG CAACATTTGC
AACATGGCTG TCTCTATGCT CGACCCGAGC GAGGGGCAGA CAATCCTGGA TCCGGCCTGC
GGAACGGGCG GCTTTCTTAT TTCGGCGATG AACCACGTCA TCGAGAAAAT TCGGGTTGCA
GAACTCGAAA AGTGGAAAGG CGACTATGGA AGGGCGGACC CGAAGATAGC CGCTCGGATT
TCGAAGTTCG CTGGCGCTTG CATCGTTGGT CTCGACTTCA ACCCGGAATT AGTTAAAGCA
ACCAAGATGA ATATGGTCAT GAACAATGAT GGAGCTGGCG GGCTCTATCA GGCCAATTCA
TTAGAAAGCC CAGCCACATG GGAAGAAGCA CTCAGGGACA GAAAGTTGAT CGGGTCTGTC
GATCTGATTT TCACCAACCC ACCGTTTGGT TCGAAAATAC CCGTCGATGA CCCAGCAATC
CTGGAGAAGT ATGATCTCGG CCACTCCTGG TCGTACAATG AAGAAATCGA TTCATGGACC
ATGAATGAAT CCATTCAGAA GTCTCAACCT CCTGAGATTC TGTTCATCGA GCGCTGCGTA
AAATTCCTGA AGCCTGGAAC CGGCCGAGTC GCAATGGTTC TGCCCGACGG AATACTGGGC
TCACCTGGGC TGGGCTATGT TCGCGAATGG ATTCTCAAGA ATACCTGGGT CCTGGCGTCA
ATCGACCTGC ACCCTGATAC CTTCCAGCCA AATGTCAGTG TCCAGACAAG CGTGTTGGTC
CTCCAAAGAA AGACCGACGA ACAGATCGCC CTTGAAGACG CCGCAGGCCG AAAGAACGAT
TACAACGTCT TTATGGCGGT CGCCAATCAT ATAGGCCACG ACAAGCGTGG AAATAAGACG
TACGTGCGCG ATAGAAAAGG CAATGAAATA GTCGAGGAGA TTGAGGAGGA CACTAAAGAG
TATATCGATG GACAGCCAAT TTATAAGAAG CAGAAGACCC AAAGAAAAGT CTCCGACGAT
AATACTCTTC AGATTGCGCA GGCATTCCGC ACATGGCTCG TAGAGCAAGA CTAA
 
Protein sequence
MGSVATTSPP SRPVAVIVQQ GKVLDFIDSQ TQREETPEEY VRQEIAKSLV REYGYPKSDI 
AVEFVLRLGS RKPRADLVIF SHSDEQQQPH AYIIVECKEQ RVKSSDRKEG VGQLHSYMAA
CPNAIYGMWT NGLERFCYRK VESGGKLLFE EIPDLPSFGQ SSEDADRPEF DQLKPASSDA
LLFAFRRCHN YIAGNQGLQK PQAFWELLKL IFCKIHDERD SPEVEFFASA NERTGINGPL
KVKKRIDGLF DAVKEDYPAI FQANDVVALK PPVLAYIVSQ LQMYSLLESD VDVKGHAYEE
IVGSNLRGDR GEFFTPRNIC NMAVSMLDPS EGQTILDPAC GTGGFLISAM NHVIEKIRVA
ELEKWKGDYG RADPKIAARI SKFAGACIVG LDFNPELVKA TKMNMVMNND GAGGLYQANS
LESPATWEEA LRDRKLIGSV DLIFTNPPFG SKIPVDDPAI LEKYDLGHSW SYNEEIDSWT
MNESIQKSQP PEILFIERCV KFLKPGTGRV AMVLPDGILG SPGLGYVREW ILKNTWVLAS
IDLHPDTFQP NVSVQTSVLV LQRKTDEQIA LEDAAGRKND YNVFMAVANH IGHDKRGNKT
YVRDRKGNEI VEEIEEDTKE YIDGQPIYKK QKTQRKVSDD NTLQIAQAFR TWLVEQD