Gene Caul_4852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4852 
Symbol 
ID5902314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5247270 
End bp5250269 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content74% 
IMG OID641565372 
Productdouble-strand break repair protein AddB 
Protein accessionYP_001686470 
Protein GI167648807 
COG category[L] Replication, recombination and repair 
COG ID[COG3893] Inactivated superfamily I helicase 
TIGRFAM ID[TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.173842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGG CCGGCCCCTT CGACCGCCCC CTCTTCGGTC GTCCGGGGCC GCGCTGGTTC 
TCGATCCCCG CCCATCGGCC CTTCGTCGAG GACCTGGCGC GCGGCCTGCT GAGCGCCCTG
GCGCCGCTGG GTCCCGAGGC CCTGCCCGCC GCCACCGTCC TGACCCCGAC CCGGCGCGGG
GCCCGAGCCT TGGCCGACGC CTTCGTGGCG GCCGGCGGCG GCAAGGCCCT GCTGCTACCG
CAGATCCGCC CGCTGGGCGA CCTGGACGAG GGCGAACCGC CGTTCGAGCC GGGCGAGCTG
AGCCTGGACC TGCCGCCCGC CGTCTCCTCG CGCCGCCGCC GGTTCGAACT GGCCCGGCTG
GTCGCAGAAC ACGCCCCTCT GCTGTCGTTC CAGCCTCAGG CGGGGCAGGC GTTGGAGATG
GCCAAGGCCC TGGCCGACTT CCTCGACAGC TGCCAGATCG AGGAGGTGGT CGCCGACGAC
GGCCGCCTGG ACAGCCTGGC CGAAGGCGAC CTGGCCCAGC ACTGGCAGGT CTCGGCGCGG
TTCCTGAAGG CCGTCCTGAC CGCCTGGCCC AAACGGTTGG ACGAACTGGG CCTGATCGAC
GTCAGCGACC GCCGGGTGCG GCTGCTCAAC GCGCTGTCCG ACCAGTGGGA GCAGAACCCG
CCCCAGGGGG TGCTGGTCGC CGCCGGCTCG ACCGGCACGG CCCCGGCCAC CGCGCGCCTG
CTCGGCGTCA TCGCCAACCT GCCCCAGGGC GCGGTGGTGC TGCCCGGCCT CGATGACGGC
CTGGCCGACG ACGCCTGGGA CAAGATCGCC GGCGTCCAGG GCGAACAGCA TCCGCAGGGG
GCGATGAAGC GTCTGTTGGA CGGAGCCGGC GCGACCCGCG CCGACGTCCG CCCCTGGTGG
CCCGAGGCCG ACAGCCGGGG CCGCTGGCGC CGCCGCCTGA TCAACGAGGC CCTGCGCCCG
GCCGAGGCCA CCGCCGACTG GCTGGCCCAG ATCGATCGCC TTCGCGAGGA GGCTCCTGGC
CTCGATCCGA TCGCCGAGGG GTTGAAGGGC CTGTCCCTGG TCAGCACCCG CACCGAGGAG
GAGGCCGCCG TCGCCTGCGC GCTTCTCTTG CGCGAGGCCC TGGAGACCGA GGGCCTGACC
GCCGCCCTGG TCACCCCCGA CCAGGAACTG GCCCGCCGGG TCGGCGCCCG CCTGATGCGC
TGGGGCGTGA TCCCCGACAG CTCGGCCGGC GCGCCCCTGG CCGCCAGCCC CGCCGCCATC
CTGGCCCAGC ATGTCGCCGC CCTGGCGGTC GATCCGCTCG ACCCCGTGCG CCTGCTGGCC
GTGGCCAAGC ATCCCCTGCT GCGCGCCGAC GCCGTGGCCG CCCGCGACCT GGAACTGAAG
AGCTTGCGCG GCCCCGCGCC GCGCGACGCG GGCCAACTGC TGGGCAAGCT GAAGGACCAT
CCCGACGCCC TGGCCCTGGC CGAGCGCGTC CTGGCCGCCG CTCGCCAGGC CGCCGCGCCC
TATGTCGATG ACCAGGCCCC GCCATCCGTC GCCACCCGCG CCCTGGTCGA GAGCCTGGAG
GCCCTGGCCG ATCCCGCCGA CCTGTGGGCC GGGTCGGCCG GCGAATGCCT GGCCGGCCTG
TTGTCGTCGC TAATCACCGA CGGCGTCGTC CTGCCGCCCG CCTCGGCCCT GGGTTTCGCC
GACCTGCTCG ACCGGCTGGT CAACGAAGAG ACCCTGCGCG TCGGCGGCGC CACCCACCCG
CGCCTGCGGA TCTTCGGGGC CATCGAGGCG CGGATGGTGC GGGCCGACCG GTTGATCCTG
GCCGGCCTGG AGGAAGGGAT CTGGCCCAAG AACGCGCCGA TCGACCCGTT CCTGTCGCGG
CCCATGCGCG CCAAGCTGAA CCTGCCGCCG CCCGAGCGCC GCATCGGCCT GACCGCCCAC
GACTTCGCCC AGGCGGCCTG CGCGCCCGAC GTGATCCTGG TCCACAGCGA GCGACGCGGC
GGGGCGCCGG CCGTCGAGAG CCGCTGGCTG TGGCGGCTGA AGACCCTGGC CCGCGGCGCC
GGCCTGCGCC TGACCGAGCG CCCCGACGTC CTGGCCTGGG TCCGCGATCT GGACGCCGCC
GGTCTCTATG ACCCCATCAA GCGCCCCGCG CCGACGCCGC CGGTCGCCGA CCGACCGCGC
AAGATGGCCG TCACCCGGGT CGAGGCCCTG ACCCGCGACC CCTACGCCGT CTGGGCGCGC
GACATCCTCA AGCTCTATCC GCTCGATCGC CCCAACGAGC CGGTCGAGGC CCGGGCGCGC
GGCACGGCGA TCCACGCGGC GTTCGAGACG TTCGCGCTGC AACACCCCGG CCCCGTGCCC
GCCAACGCCG CCGAGATCTT CGCTGGCCTG TACCTGTCCG AACTGGTCGC CGCAGGCATG
CCGCCCGCCG CCCTGGCCCG CGAACGCGCG CTCGCGCGGG AAGCAGCGCT GTGGGTCGCC
GACCTGGAGA CGCGCCGGCG GGCCGGGGCC GAACGGATCG TGGTCGAGGC CGCCGGATCG
CTGACCTTCG ATATCGGCGG CCGCCCGTTC ACCGTCACCG CCAAGGCCGA CCGCATCGAG
CCCACCGCCG ACGGGATGGC CCACATCCTC GACTACAAGA CCGGCGCCGC GCCGTCCAAG
AAGCAGGTCG AGACCGGCTT CTCGCCCCAG CTGACCCTGA CCGCCGCCAT TCTGCGCGAG
GGCGGCTTCC CCGACATCGG CCCGCGCGAG CCCGGCGACC TGACCTATCT GCGGGTCACG
GGCCGCAAGC CGGCCGGGGT GGAAGAGGTG CGGGCCGCCG CCGGAGCCGA CGCCCAGGAG
GCGGCGATCA AGGCGCTGAA CGGGCTGCGC GAGCTGATCG AGCGCTACGA CGATCCGAAA
CAACCCTATC TCTCCCGCGT AGCTCCACAG TTCGTGCATG ACCATGTGGG AGACTACGGA
CATCTGGCGC GGGTGTTCGA GTGGTCGACC AGCGGCGATG ACGGGGAGGG CGGCGAATGA
 
Protein sequence
MTTAGPFDRP LFGRPGPRWF SIPAHRPFVE DLARGLLSAL APLGPEALPA ATVLTPTRRG 
ARALADAFVA AGGGKALLLP QIRPLGDLDE GEPPFEPGEL SLDLPPAVSS RRRRFELARL
VAEHAPLLSF QPQAGQALEM AKALADFLDS CQIEEVVADD GRLDSLAEGD LAQHWQVSAR
FLKAVLTAWP KRLDELGLID VSDRRVRLLN ALSDQWEQNP PQGVLVAAGS TGTAPATARL
LGVIANLPQG AVVLPGLDDG LADDAWDKIA GVQGEQHPQG AMKRLLDGAG ATRADVRPWW
PEADSRGRWR RRLINEALRP AEATADWLAQ IDRLREEAPG LDPIAEGLKG LSLVSTRTEE
EAAVACALLL REALETEGLT AALVTPDQEL ARRVGARLMR WGVIPDSSAG APLAASPAAI
LAQHVAALAV DPLDPVRLLA VAKHPLLRAD AVAARDLELK SLRGPAPRDA GQLLGKLKDH
PDALALAERV LAAARQAAAP YVDDQAPPSV ATRALVESLE ALADPADLWA GSAGECLAGL
LSSLITDGVV LPPASALGFA DLLDRLVNEE TLRVGGATHP RLRIFGAIEA RMVRADRLIL
AGLEEGIWPK NAPIDPFLSR PMRAKLNLPP PERRIGLTAH DFAQAACAPD VILVHSERRG
GAPAVESRWL WRLKTLARGA GLRLTERPDV LAWVRDLDAA GLYDPIKRPA PTPPVADRPR
KMAVTRVEAL TRDPYAVWAR DILKLYPLDR PNEPVEARAR GTAIHAAFET FALQHPGPVP
ANAAEIFAGL YLSELVAAGM PPAALARERA LAREAALWVA DLETRRRAGA ERIVVEAAGS
LTFDIGGRPF TVTAKADRIE PTADGMAHIL DYKTGAAPSK KQVETGFSPQ LTLTAAILRE
GGFPDIGPRE PGDLTYLRVT GRKPAGVEEV RAAAGADAQE AAIKALNGLR ELIERYDDPK
QPYLSRVAPQ FVHDHVGDYG HLARVFEWST SGDDGEGGE