Gene Caul_5352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5352 
Symbol 
ID5897156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp61573 
End bp65838 
Gene Length4266 bp 
Protein Length1421 aa 
Translation table11 
GC content70% 
IMG OID641550644 
Productmethylase/helicase 
Protein accessionYP_001672130 
Protein GI167621622 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.686845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGA CATTCCCCCT CTTCAGCGCC CTTCAGACCC CAGGGGCCGC CCAGGACAAG 
GCCGCCAATA CGCTGGCCGC CGCCCGGGCG CTCACCCCGC ACCTCAATCG GTCGCGGGCG
CTGGATCGAA AGCTGGTGGC CAGCACCATG ACCATCTCCT TCGGGGGCTC GGACGCCGAG
GGCGCCTGGT CGTGGCGCGA TGCCTACGAC GCCATCGAGG CGGCCCTTGT CCTGCAGCTG
CGTCGGCTCA GCCCGCAGAT TGGCCGGCTC GAGGACGCTC CGGCGCAGAT CGCGGCGCTG
CTGGCCAGCG TCACCGACCT GACCTTGACC CACACCCGCC GCAGCGAGGA GCAGGTCGCG
CTCGACCAGT TCTCCACGCC GCCGGCCTTG GCCGCCTTGG CGGTGCTGGC CGCCCAGGTC
CGGCCTGGCG ACAAGGTGCT CGAACCGTCG GCCGGCACCG GCCTGATGGC GATCATCGCC
GAGGCGTGCG GGGCGACGCT GGAGCTCAAT GAGATCGCGC CGCACCGTTC GGCTCTGCTC
GATGGCCTGT TCCCGGCCGT CTCGCGCACC CGCCATGACG CGGCGCACCT GAAGGACATC
CTGCCCTCGT CGGGCAGCTT CCAGGCGGTG ATCGCCAATC CGCCGTTCCA ACGGCTGGAG
GGTCATCTGC ACGCCGCCAT CGACTGCCTT GCCGAAGGCG GACGGCTCAG CGCCATCGTA
CCCACGCGCC TCTTCGAGGA CGCCGGGGCG ATGCAGGCGC TCGCCCGCCG CGGCCGGGTC
GTGGCGTTGA TCGCCTTCCC GCCCCGGGCC TATGCCAAGC ACGGGACGTC CGTCGAGACC
GGCCTGCTGG TCTTGGACTG CGTCGAAGAG TGCGGCGGCG TCCCCGTCAA GGTCGTCAGC
GGCGAGACCC TGGCCGATAT CGCCAAGCTG ATCGCCGCGC TGGATCCGCG CCCGACCGCC
CAGCCGCGCC AGTTCCGATC GGTCTCCAAC GTCGCCTTCC TGGCGCCGCG CGCCCGCGCC
CTGGCCACCC CCTCGACCCG CCTGGGGTTC CTGGCCGGGG CGTCGCCGGT GGCGTTCGAG
ACAATCGCCT GGTCGGGCGA GGGCCACGAC GTCGGCCTCT ACCAAGCCTA CCGGGTCAGC
CGCCTGGACT TGCAGGGCAG CCGTCCACAC CCCTCCGCGC TGGTGGAGTC CGGCCCCATG
GCGTCGGTGG CGCCGCCGGC CCCGACCTAT CGCCCGCTCC TGCCGACCAA CATCGTGCAG
GAGGGGCTGC TGTCCGACGC CCAGATCGAG ACAGTGATCT ACGCCGGCGA GGCGCAATCG
GGGCTTCTGC CGGGCTGGTG GGTGCTCGGC GAAGCGCCGC ACAAGCTGGT GCTGGTCAAG
GAGGGCGCCC CCGGCGCCTT CCAGCTGCGG CGCGGCTTCT TCCTCGGCGA CGGCACCGGC
TGCGGCAAGG GCCGCCAGGC CGCCGGGGTC ATCGCCGACA ACATGGCCCA GGGCCGCCTG
AAAGCGGTGT GGATTTCCAG GAACGACACG CTTCTGGAGG ACGCACGGCG GGACTGGACG
GCGATCGGCG GCTCCGCCAC CGACATCACC CCGCAGAGCG CCTGGAAGCA GGCCGACGCG
ATCCGCATGG ACCGGGGGGT CCTGTTCACC ACCTATGCCA CCCTGCGTCA GCCGGCGCGG
GGGATGCGTC TTTCGCGCCT CGACCAGATC GTCGGCTGGC TGGGCGCCGA CTTCGACGGG
GTGATCGCCT TTGACGAAGC CCACGCCATG GCCAACGCGG CGGGCGGCGG CCAGGGGGCT
CGTGGACCCA AGAAGGCCTC GCAACAGGGG ATGGCGGGCC TGGCCCTGCA GAACAGGTTG
CCCAATGCGC GGGTGATGTA TGTCAGCGCC ACGGGTGCGA CCACGCCCGA GAACTTGGCC
TATGCCGCCC GTCTGGGCCT GTGGGGCGGG CCGGAGGCGC CGTTCAACAC CCGCGACGCC
TTCATGGACG CCGTCGAGAA GGGCGGCGTG GCGGTCATGG AGTTGATCGC CCGCGAACTG
AAGGCGCTCG GCCTCTACAT CGCCCGGTCG CTGTCGTTCG ACGGGGTGGA ATATCAGGCC
CTGCGCCATG TGCTGACCGG CGATGACGTC GAGATCTGGA ACGCCTGGGC CGACGCCTAC
CAGTTGATCC ACCAGAACCT GCGCGCCGCC CTGGAGGCGG TCGGCATCAC GCAGGACGGC
AAGGCCAAGA GCGGTCAGGC CGCTTGCGCG GTCATGTCGG CCTTCGAGGG CGCCAAGCTG
CGCTTCTTCG GCGCGCTGCT GGCGGGCCTG AAGTCGCCGA CCCTGATCGC GGCGATCCGC
GACGACCTCG TCCAGGGACG TTCCAGCGTC GTGCAGATCG TCTCGACCAA CGAGGCGGTG
ATGGAGCGCC GGCTGGCGCA GATCCCGCCC GAAGAGTGGA ACAACCTGAC CATCGATCTG
ACCCCCAAGG ACCAGGTGCT GGACTATCTG ATGGGGGCCT TCCCTATCGC CGCCATGGAG
GCGATCGATG ATAAGGAAGG CAATGTGACG ATGCGCCCGT TGATCGTGGA TGGTCAGCCG
GTCGTCAGCC AGGAGGCCCT GCGCCTGCGG GAGGCGCTGG TCGTCCACCT GGCCTGCCTG
CCCGCCGTTC CGGGTGTGCT GGATGCGGTC CTGGGCGCGC TCGGCCCGGA CAACGTCGCC
GAGATCACCG GCCGCTCGCG CCGGGTCGTC CTGCGCGATG GCCGCCGGGT GGTGGAGCGT
CGCAGCGCGT CGAGCGCCAA GGCCGAGACC GACGCCTTCA TGAGCGGCAA GAAGCGGGTG
CTGGTGTTCT CCGACGCCGG CGGCACGGGG CGCAGCTACC ACGCCGACCT GAACTGCGCG
AACCAGGACC GTCGCCGGCA CTATCTCTGC GAGCCCGGCT GGCGGGCCGA CGCGGCGATC
CAGGGCCTTG GCCGCTCGCA CCGGACCAAC CAGGCCAGCG CGCCGCTGTT CTGTCCGGTC
ACGACCGACA TCCACGGCGA GAAGCGGTTC ACCTCGACGA TCTCGCGGCG GCTGGACAGC
CTGGGCGCCC TGACCAAGGG TGAGCGCCGA ACAGCCGGCA ACGGCCTCTT CCGGCCCGAG
GACAATCTCG AAAGCCCCTG GGCGCACCGC GCACTGCAGG CCTTCTACGT CGCCTTGCAT
TGGGGGAACG TGCCGGCGAT GGACCGGGTG ACCTACGAGC AAAAGACCGG GCTTCAGCTC
CTCGACAGCG ATGGCCAGTT GAAGAAGGCC GAGGACCTGC CGCCGATGAA CACCTGGCTC
AACCGGCTCC TGGCGCTGCG GATCGAGGAT CAGAACGCCT TGTTCGAGAC CTTCGACGCG
GTGCTCACCA GCATCCTCGA ACGGGCCGCC GCCTCGGGCG CTCTGGACAA GGGCATGGAG
GACATCGTCG CCGATGACCT GACGGTGACG TCGGAGGAGG TGATCCGGAC TGACGCGGTG
TCGGGCGCGC AGACCAAGGT CGTGACCTTT GCCGTCCGCA CACGCCGCGT GCTGGCGTCG
GCGGCGGATG CGCTGGCCGG GCTCGATCCG CAAACGCTGG AGTATGTGGT CAACACCAAG
TCGCAGCGCG CCGGCCTGGT GGTGAAGGGC CTGACGACGA CCGACGACGA TGACCGTCTG
GTTCAGGCCG TCCGGCTGAT CCGCGCCGAG AAGGCCGCCG TGCTGCCCCT GAAGACCTAT
GAGGAATCGG CCTGGGAGGT GGTCGCCGAA CCCATCTGGC GCGCGACCTG GGACGCCGAG
GTCGCCGGCG CCGATCCCTG GCACACCCGC CAGCTGGCCC TGGTCACCGG TCTGCTGCTG
CCGGTGTGGA GCAGTCTGCC TAGCAAGCGG ACCTTCGTCC GCCGGCTGAA GGCTCCGGAC
GGGCGGCGCT GGCTTGGGCG GGTTCTGGGC CCCGCGGATG TCACCAAGCT GAAGATCGCG
CTTGGGATCA GCGACGTCGC CACGGCCGTC GGCTCTGGCA ACAACGCGGC CAGTATGGTC
CTGGGCGAGA ACATCTCCAT CGCCCTGGCG GGCGGGTTCT GGCTTCGCCG GGCCAAGGTG
ATGGACCGCT ACCGGCTTGA GGTTGTCGGC GCCGGCTCCC AGCGCGCGAT GTTCCAGGCG
CTCGGCTGCT TTGTCGAAAT CATCAACTAC ACCCCTCGCG TCTTCGTGCC TGTCGACCAG
CCACAGGTGC TCTGCGCCGT CCTGGCCAAG TGGCCAGCCC AGACCATCCT GCCGGCGGCG
GCGTAG
 
Protein sequence
MNATFPLFSA LQTPGAAQDK AANTLAAARA LTPHLNRSRA LDRKLVASTM TISFGGSDAE 
GAWSWRDAYD AIEAALVLQL RRLSPQIGRL EDAPAQIAAL LASVTDLTLT HTRRSEEQVA
LDQFSTPPAL AALAVLAAQV RPGDKVLEPS AGTGLMAIIA EACGATLELN EIAPHRSALL
DGLFPAVSRT RHDAAHLKDI LPSSGSFQAV IANPPFQRLE GHLHAAIDCL AEGGRLSAIV
PTRLFEDAGA MQALARRGRV VALIAFPPRA YAKHGTSVET GLLVLDCVEE CGGVPVKVVS
GETLADIAKL IAALDPRPTA QPRQFRSVSN VAFLAPRARA LATPSTRLGF LAGASPVAFE
TIAWSGEGHD VGLYQAYRVS RLDLQGSRPH PSALVESGPM ASVAPPAPTY RPLLPTNIVQ
EGLLSDAQIE TVIYAGEAQS GLLPGWWVLG EAPHKLVLVK EGAPGAFQLR RGFFLGDGTG
CGKGRQAAGV IADNMAQGRL KAVWISRNDT LLEDARRDWT AIGGSATDIT PQSAWKQADA
IRMDRGVLFT TYATLRQPAR GMRLSRLDQI VGWLGADFDG VIAFDEAHAM ANAAGGGQGA
RGPKKASQQG MAGLALQNRL PNARVMYVSA TGATTPENLA YAARLGLWGG PEAPFNTRDA
FMDAVEKGGV AVMELIAREL KALGLYIARS LSFDGVEYQA LRHVLTGDDV EIWNAWADAY
QLIHQNLRAA LEAVGITQDG KAKSGQAACA VMSAFEGAKL RFFGALLAGL KSPTLIAAIR
DDLVQGRSSV VQIVSTNEAV MERRLAQIPP EEWNNLTIDL TPKDQVLDYL MGAFPIAAME
AIDDKEGNVT MRPLIVDGQP VVSQEALRLR EALVVHLACL PAVPGVLDAV LGALGPDNVA
EITGRSRRVV LRDGRRVVER RSASSAKAET DAFMSGKKRV LVFSDAGGTG RSYHADLNCA
NQDRRRHYLC EPGWRADAAI QGLGRSHRTN QASAPLFCPV TTDIHGEKRF TSTISRRLDS
LGALTKGERR TAGNGLFRPE DNLESPWAHR ALQAFYVALH WGNVPAMDRV TYEQKTGLQL
LDSDGQLKKA EDLPPMNTWL NRLLALRIED QNALFETFDA VLTSILERAA ASGALDKGME
DIVADDLTVT SEEVIRTDAV SGAQTKVVTF AVRTRRVLAS AADALAGLDP QTLEYVVNTK
SQRAGLVVKG LTTTDDDDRL VQAVRLIRAE KAAVLPLKTY EESAWEVVAE PIWRATWDAE
VAGADPWHTR QLALVTGLLL PVWSSLPSKR TFVRRLKAPD GRRWLGRVLG PADVTKLKIA
LGISDVATAV GSGNNAASMV LGENISIALA GGFWLRRAKV MDRYRLEVVG AGSQRAMFQA
LGCFVEIINY TPRVFVPVDQ PQVLCAVLAK WPAQTILPAA A