Gene Caul_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3011 
Symbol 
ID5902580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3276749 
End bp3278566 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content71% 
IMG OID641563512 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_001684636 
Protein GI167646973 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.245556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCG ACGGTCACGT TTCGGTCGCG TCCGACCACT TCCTCGACGT TTCGCGCTCG 
CTGAGCGGCC GGGCCTGGCG CGAGCGGCCG GGTGACGCGG CCGTGGCGCG CAGCCACCAG
CAGTTGCACG GCCTGTCCGA ACCGCTCGCC CGCGCCCTGG CCTCGCGCGG CGTGGGTGTC
GATGACGCCG AGCACTATCT GCGGCCGACC TTGAAGGCGC TGTTTCCAGA CCCCTCCAGC
TTCACCGACA TGGACCGGGC CGCCGAGATC CTGATCGACG CGCTGGAGAG CGGCCGCTCG
ACCATGGTGT TCGCCGACTA CGACGTGGAC GGCGCCACCA GCGCCGCCCA GCTGGTGCGC
TGGTTTCGCT ACATGGGCGT CGAGCTGCCG ATCTATATTC CCGACCGCTT GACCGAGGGT
TATGGCCCCA GTCCGGCGGC CTTCAAGACC ATCCGGGACA CCGGCGCCGA CTTGGTCGTC
ACCCTCGACT GCGGGGCGGC GGCCTACGAC GCCATCGCCA GCGCCGCGAC GATCGGGCTG
GAGGTGGTGG TGATCGATCA CCACCTGATG CGCGAGGACC CGCCGGCCGC CGCCGCCGTG
GTCAATCCCA ACCGTCCGGG CTGCCAGAGC GGGCAGGGGG TGCTGGCCGC CGCCGGGGTG
ACCTTCGTGC TGCTGGCCGC CCTGAACCGC GAGGCTCGCC GGCGCGGCCT GTTCACCGAC
GAACGCCCCC AGCCGGATCT CCGCCAATGG CTGGACCTCG TGGCCATGGG CGAGGTCTGC
GACGTCACCC AACTGGTCGG CTTCAACCGC GCCCTGACGA CGCTTGGCCT CCGAACCATG
TCGAGCTGGG GCAATCCGGG CCTCAAGGCC CTGTTCGAGG TCGGCAAGGG CTCGGGTCCC
GCCTCGGTGT TCCACGCCGG CTTCATCCTG GGGCCGCGCA TCAACGCCGG CGGCCGGATC
GGCCGTTCCG ACCTCGGCGC GCGCCTGCTG TCCACGGACG ATCCCGAGGA GGCGCGCATG
CTGGCCGAGG AATTGGACGG CCTCAACACC GAGCGCAAGG CGGTCGAGGC CGGCGTGGTC
GAGGAGGCCG CGGCCGTGCT CGAACGCGGC AGCAACTTCA ATCCCGACGC CCCGGTGATC
GTGGTGGCCG GGGAGGGCTG GCATCCTGGG GTGATCGGCA TCGTCGCCGG CCGCCTGCGC
GAGCGCTATC GCAAGCCGGT GGTGGTGATC GGCATCGACC GCGCCGCCAA TGTCGGCAAG
GGCTCGGGCC GTTCGCAGCC GGGCGTCAAT CTGGGCGCGG CGATCCAGGC GGCCTTCGAG
GCGGGCCTGC TGATGGCCGG CGGCGGCCAC GCCATGGCGG CGGGCCTGTC GATCCGCCCC
GACTCCATTC CCGAGTTCCG CGCTTTCCTG GAAGAGCGTT TGGCGGGCGA GATGGAGGCT
GTTGGCGTCG AGGGGGTGGA TATCGACGCC CTGGTCCAGC CCCGCGCCGC CAACCGCGCC
CTGTGGACCG AGTTTCAGCA ACTGGCGCCG TTCGGCCCAG GCAACCCCGA ACCGATGTTC
GCCTTGGCCG GGGTCCGGCC AGAGCGGATG ATGGCGATGA AGGGCGGCCA CGTGCGCGTC
GATCTGGTCG GACCGTCGGG CGACCGAATC AAGGCGATCT CCTGGCGCTC GGTTGAAACC
CCGCTGGGAC AACGTCTGAT GGCGGGCGGC GGCGCTCTGC ATGTCGTAGG TCGCCTGAAA
CCGGACGATT ATATGGGCCG GGAAGGGGTG CAGTTGGAGA TCGAGGACGC CGCCGACCCC
CGCATGCGCG TGACTTGA
 
Protein sequence
MPADGHVSVA SDHFLDVSRS LSGRAWRERP GDAAVARSHQ QLHGLSEPLA RALASRGVGV 
DDAEHYLRPT LKALFPDPSS FTDMDRAAEI LIDALESGRS TMVFADYDVD GATSAAQLVR
WFRYMGVELP IYIPDRLTEG YGPSPAAFKT IRDTGADLVV TLDCGAAAYD AIASAATIGL
EVVVIDHHLM REDPPAAAAV VNPNRPGCQS GQGVLAAAGV TFVLLAALNR EARRRGLFTD
ERPQPDLRQW LDLVAMGEVC DVTQLVGFNR ALTTLGLRTM SSWGNPGLKA LFEVGKGSGP
ASVFHAGFIL GPRINAGGRI GRSDLGARLL STDDPEEARM LAEELDGLNT ERKAVEAGVV
EEAAAVLERG SNFNPDAPVI VVAGEGWHPG VIGIVAGRLR ERYRKPVVVI GIDRAANVGK
GSGRSQPGVN LGAAIQAAFE AGLLMAGGGH AMAAGLSIRP DSIPEFRAFL EERLAGEMEA
VGVEGVDIDA LVQPRAANRA LWTEFQQLAP FGPGNPEPMF ALAGVRPERM MAMKGGHVRV
DLVGPSGDRI KAISWRSVET PLGQRLMAGG GALHVVGRLK PDDYMGREGV QLEIEDAADP
RMRVT