Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4852 |
Symbol | |
ID | 5902314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5247270 |
End bp | 5250269 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641565372 |
Product | double-strand break repair protein AddB |
Protein accession | YP_001686470 |
Protein GI | 167648807 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3893] Inactivated superfamily I helicase |
TIGRFAM ID | [TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.173842 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACGG CCGGCCCCTT CGACCGCCCC CTCTTCGGTC GTCCGGGGCC GCGCTGGTTC TCGATCCCCG CCCATCGGCC CTTCGTCGAG GACCTGGCGC GCGGCCTGCT GAGCGCCCTG GCGCCGCTGG GTCCCGAGGC CCTGCCCGCC GCCACCGTCC TGACCCCGAC CCGGCGCGGG GCCCGAGCCT TGGCCGACGC CTTCGTGGCG GCCGGCGGCG GCAAGGCCCT GCTGCTACCG CAGATCCGCC CGCTGGGCGA CCTGGACGAG GGCGAACCGC CGTTCGAGCC GGGCGAGCTG AGCCTGGACC TGCCGCCCGC CGTCTCCTCG CGCCGCCGCC GGTTCGAACT GGCCCGGCTG GTCGCAGAAC ACGCCCCTCT GCTGTCGTTC CAGCCTCAGG CGGGGCAGGC GTTGGAGATG GCCAAGGCCC TGGCCGACTT CCTCGACAGC TGCCAGATCG AGGAGGTGGT CGCCGACGAC GGCCGCCTGG ACAGCCTGGC CGAAGGCGAC CTGGCCCAGC ACTGGCAGGT CTCGGCGCGG TTCCTGAAGG CCGTCCTGAC CGCCTGGCCC AAACGGTTGG ACGAACTGGG CCTGATCGAC GTCAGCGACC GCCGGGTGCG GCTGCTCAAC GCGCTGTCCG ACCAGTGGGA GCAGAACCCG CCCCAGGGGG TGCTGGTCGC CGCCGGCTCG ACCGGCACGG CCCCGGCCAC CGCGCGCCTG CTCGGCGTCA TCGCCAACCT GCCCCAGGGC GCGGTGGTGC TGCCCGGCCT CGATGACGGC CTGGCCGACG ACGCCTGGGA CAAGATCGCC GGCGTCCAGG GCGAACAGCA TCCGCAGGGG GCGATGAAGC GTCTGTTGGA CGGAGCCGGC GCGACCCGCG CCGACGTCCG CCCCTGGTGG CCCGAGGCCG ACAGCCGGGG CCGCTGGCGC CGCCGCCTGA TCAACGAGGC CCTGCGCCCG GCCGAGGCCA CCGCCGACTG GCTGGCCCAG ATCGATCGCC TTCGCGAGGA GGCTCCTGGC CTCGATCCGA TCGCCGAGGG GTTGAAGGGC CTGTCCCTGG TCAGCACCCG CACCGAGGAG GAGGCCGCCG TCGCCTGCGC GCTTCTCTTG CGCGAGGCCC TGGAGACCGA GGGCCTGACC GCCGCCCTGG TCACCCCCGA CCAGGAACTG GCCCGCCGGG TCGGCGCCCG CCTGATGCGC TGGGGCGTGA TCCCCGACAG CTCGGCCGGC GCGCCCCTGG CCGCCAGCCC CGCCGCCATC CTGGCCCAGC ATGTCGCCGC CCTGGCGGTC GATCCGCTCG ACCCCGTGCG CCTGCTGGCC GTGGCCAAGC ATCCCCTGCT GCGCGCCGAC GCCGTGGCCG CCCGCGACCT GGAACTGAAG AGCTTGCGCG GCCCCGCGCC GCGCGACGCG GGCCAACTGC TGGGCAAGCT GAAGGACCAT CCCGACGCCC TGGCCCTGGC CGAGCGCGTC CTGGCCGCCG CTCGCCAGGC CGCCGCGCCC TATGTCGATG ACCAGGCCCC GCCATCCGTC GCCACCCGCG CCCTGGTCGA GAGCCTGGAG GCCCTGGCCG ATCCCGCCGA CCTGTGGGCC GGGTCGGCCG GCGAATGCCT GGCCGGCCTG TTGTCGTCGC TAATCACCGA CGGCGTCGTC CTGCCGCCCG CCTCGGCCCT GGGTTTCGCC GACCTGCTCG ACCGGCTGGT CAACGAAGAG ACCCTGCGCG TCGGCGGCGC CACCCACCCG CGCCTGCGGA TCTTCGGGGC CATCGAGGCG CGGATGGTGC GGGCCGACCG GTTGATCCTG GCCGGCCTGG AGGAAGGGAT CTGGCCCAAG AACGCGCCGA TCGACCCGTT CCTGTCGCGG CCCATGCGCG CCAAGCTGAA CCTGCCGCCG CCCGAGCGCC GCATCGGCCT GACCGCCCAC GACTTCGCCC AGGCGGCCTG CGCGCCCGAC GTGATCCTGG TCCACAGCGA GCGACGCGGC GGGGCGCCGG CCGTCGAGAG CCGCTGGCTG TGGCGGCTGA AGACCCTGGC CCGCGGCGCC GGCCTGCGCC TGACCGAGCG CCCCGACGTC CTGGCCTGGG TCCGCGATCT GGACGCCGCC GGTCTCTATG ACCCCATCAA GCGCCCCGCG CCGACGCCGC CGGTCGCCGA CCGACCGCGC AAGATGGCCG TCACCCGGGT CGAGGCCCTG ACCCGCGACC CCTACGCCGT CTGGGCGCGC GACATCCTCA AGCTCTATCC GCTCGATCGC CCCAACGAGC CGGTCGAGGC CCGGGCGCGC GGCACGGCGA TCCACGCGGC GTTCGAGACG TTCGCGCTGC AACACCCCGG CCCCGTGCCC GCCAACGCCG CCGAGATCTT CGCTGGCCTG TACCTGTCCG AACTGGTCGC CGCAGGCATG CCGCCCGCCG CCCTGGCCCG CGAACGCGCG CTCGCGCGGG AAGCAGCGCT GTGGGTCGCC GACCTGGAGA CGCGCCGGCG GGCCGGGGCC GAACGGATCG TGGTCGAGGC CGCCGGATCG CTGACCTTCG ATATCGGCGG CCGCCCGTTC ACCGTCACCG CCAAGGCCGA CCGCATCGAG CCCACCGCCG ACGGGATGGC CCACATCCTC GACTACAAGA CCGGCGCCGC GCCGTCCAAG AAGCAGGTCG AGACCGGCTT CTCGCCCCAG CTGACCCTGA CCGCCGCCAT TCTGCGCGAG GGCGGCTTCC CCGACATCGG CCCGCGCGAG CCCGGCGACC TGACCTATCT GCGGGTCACG GGCCGCAAGC CGGCCGGGGT GGAAGAGGTG CGGGCCGCCG CCGGAGCCGA CGCCCAGGAG GCGGCGATCA AGGCGCTGAA CGGGCTGCGC GAGCTGATCG AGCGCTACGA CGATCCGAAA CAACCCTATC TCTCCCGCGT AGCTCCACAG TTCGTGCATG ACCATGTGGG AGACTACGGA CATCTGGCGC GGGTGTTCGA GTGGTCGACC AGCGGCGATG ACGGGGAGGG CGGCGAATGA
|
Protein sequence | MTTAGPFDRP LFGRPGPRWF SIPAHRPFVE DLARGLLSAL APLGPEALPA ATVLTPTRRG ARALADAFVA AGGGKALLLP QIRPLGDLDE GEPPFEPGEL SLDLPPAVSS RRRRFELARL VAEHAPLLSF QPQAGQALEM AKALADFLDS CQIEEVVADD GRLDSLAEGD LAQHWQVSAR FLKAVLTAWP KRLDELGLID VSDRRVRLLN ALSDQWEQNP PQGVLVAAGS TGTAPATARL LGVIANLPQG AVVLPGLDDG LADDAWDKIA GVQGEQHPQG AMKRLLDGAG ATRADVRPWW PEADSRGRWR RRLINEALRP AEATADWLAQ IDRLREEAPG LDPIAEGLKG LSLVSTRTEE EAAVACALLL REALETEGLT AALVTPDQEL ARRVGARLMR WGVIPDSSAG APLAASPAAI LAQHVAALAV DPLDPVRLLA VAKHPLLRAD AVAARDLELK SLRGPAPRDA GQLLGKLKDH PDALALAERV LAAARQAAAP YVDDQAPPSV ATRALVESLE ALADPADLWA GSAGECLAGL LSSLITDGVV LPPASALGFA DLLDRLVNEE TLRVGGATHP RLRIFGAIEA RMVRADRLIL AGLEEGIWPK NAPIDPFLSR PMRAKLNLPP PERRIGLTAH DFAQAACAPD VILVHSERRG GAPAVESRWL WRLKTLARGA GLRLTERPDV LAWVRDLDAA GLYDPIKRPA PTPPVADRPR KMAVTRVEAL TRDPYAVWAR DILKLYPLDR PNEPVEARAR GTAIHAAFET FALQHPGPVP ANAAEIFAGL YLSELVAAGM PPAALARERA LAREAALWVA DLETRRRAGA ERIVVEAAGS LTFDIGGRPF TVTAKADRIE PTADGMAHIL DYKTGAAPSK KQVETGFSPQ LTLTAAILRE GGFPDIGPRE PGDLTYLRVT GRKPAGVEEV RAAAGADAQE AAIKALNGLR ELIERYDDPK QPYLSRVAPQ FVHDHVGDYG HLARVFEWST SGDDGEGGE
|
| |