Gene Caul_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2232 
Symbol 
ID5899687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2426630 
End bp2428507 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content70% 
IMG OID641562723 
Productnuclease 
Protein accessionYP_001683857 
Protein GI167646194 
COG category[K] Transcription 
COG ID[COG1475] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.622811 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCT CCCACGCTGA CCCGCGCAAG CTGGTCATCT CACCGCTCAA CATGCACTTC 
GACGAGCCCG CGCCCGACGT CTCCGACATC ATCGGATCGG TGCGCCAGCA CGGCGTCCTG
GAAACCCTCC TGGTGCGCGA GACCTTCCAG GACGGCGTCC TGGTCCCCGA GCATTTCGAG
GTGGTGGCCG GGCGGCGGCG CTGCTTCGCC GCCCAGGCCG TTGCGGGCGA GGGGATCGAC
ATCGATCCGG TCCCTATCGG CATCCTCGAA CCGGGCGATG ACGCCGCTGC CCTGGAAATC
TCGCTGATCG AGAACATGGC CCGGTTGCCG CCGAACGAGG TGGCTTGCTG GGAGACCTTC
GTGAAGTTGA TCCGCGAAGG CCGCACGCCC GAATACCTGG CCGCCACCTA CGGCAAGACC
GAGGCCGTCA TTCACCGCGT GCTGGCCTTG GGCAATCTCC TGCCGCGCAT CCGCAAGCTC
TATCGCAAGG AGGCCATCGA CGTGGCCACG GTCCGTCAAC TGACCCTGGC CAGCAAGAGC
CAGCAGAAGG ACTGGCTCAA GCTCTATGAC GATCCGGGCC AGCGCGAACC GCGCGGCGCC
AGTCTCAAGG CCTGGCTGTT CGGCGGCGAA GCGATCCCGA CCAGCTTGGC GATCTTCGCC
TTGGAAGACT ATCCCGGCGC GATCATCGCC GACCTCTTCG GCGAGGAGGC CTATTTCACC
GACGCGGCGC TGTTCTGGAC CCATCAGAAC CAAGCCGTGG CGGCCAAGGT CGAGGCCTTG
CGCGGCGAGG GCTGGAGCGA CGTCGAGGTG CTGGAGGTCG GGCAGACCTT CTATGGCTAC
CAGCACGTGG CCGTCCCGAA GGCCAAGGGC GGCAAGGTGT TCATCGCCGT CAGCCGCCGG
GGCGAGGTGA CCGTGCATGA AGGCTGGCTG ACCACCAAGG ACGCCCGTCG CGCCGAGGCC
GCCGCCCGCG CCGCCGCGCA GGGCGGCGAC GGCGCGGGCG GCTTGGGCGC CGGTGGGGAA
GGGGGCGGCA AGGCCGAGCG CTCGGAGGTC ACCGCCAACC AGCAGGCCTA TATCGACCTG
CACCGGCGCG CCGCCGTTCG CGCGGTGCTC ACCGACCACC TGGACGTGGC CTTGCGCCTG
CTCGTCGCCC ACGCCGTGGC CGGGTCGCGC TACTGGCGCG TGGAGACCGA TCCGCTGGGC
GCCGGATCGC AGATGGCCGC CGACAGCCTC AAGACCAGCC CCGGCGAAAC CTTGTTCGCG
GCCAAGCAGA CCCAGGCCCT TCGGCTCCTG GGTCTGGTGG CCGCTGGCGA TGTGGTCGGG
CGCGGCCACG CGGGCGGCGC GGCAGCGGTG TTCGCCCGGC TGCTGTCGCT CACCGACGCC
GAGGTCATGG CCGTCGCGGC GGTGGTCATC GGCGAGACCC TGGCGGCCGG GAGCGCCGAG
GTCGAGGCGG CCGGAACCTA TCTCCAGGTC GACATGGGGG ACTTCTGGCG GCCCGACGAG
GCCTTCTTCG ACGGCATCAA GGACCGCGAG ACCGTCAACG CCATGCTCAA GGAGGTGGGC
GGCAAGAAGG TCGCCGACGG CAATGTCAGC GAGAAGGTCC GCACCCAGAA GGGCATCTTG
CGCGACTTCC TGGCCGGGAC CAACGACCGG CCCAAGGTGG AGCGCTGGAC GCCCCGCTGG
CTGACCTTCC CGGCCCAGGC CTATACCCGC AGGCCCTTCG CCACGGCCCA GCGGTCCAAG
GCCGTCGCGC CGCTGCTGCG CCGGGTGCGC CCGCCGCAGA GTGGAGCGGG AGCACCGGAG
GCCATTGCGC CCCACGAGGG CGCGGTCCAG ACCCCCGCGC TGGTTCAGGC GGGCGCGGAG
ACCTTCGCCG CCGAGTGA
 
Protein sequence
MRLSHADPRK LVISPLNMHF DEPAPDVSDI IGSVRQHGVL ETLLVRETFQ DGVLVPEHFE 
VVAGRRRCFA AQAVAGEGID IDPVPIGILE PGDDAAALEI SLIENMARLP PNEVACWETF
VKLIREGRTP EYLAATYGKT EAVIHRVLAL GNLLPRIRKL YRKEAIDVAT VRQLTLASKS
QQKDWLKLYD DPGQREPRGA SLKAWLFGGE AIPTSLAIFA LEDYPGAIIA DLFGEEAYFT
DAALFWTHQN QAVAAKVEAL RGEGWSDVEV LEVGQTFYGY QHVAVPKAKG GKVFIAVSRR
GEVTVHEGWL TTKDARRAEA AARAAAQGGD GAGGLGAGGE GGGKAERSEV TANQQAYIDL
HRRAAVRAVL TDHLDVALRL LVAHAVAGSR YWRVETDPLG AGSQMAADSL KTSPGETLFA
AKQTQALRLL GLVAAGDVVG RGHAGGAAAV FARLLSLTDA EVMAVAAVVI GETLAAGSAE
VEAAGTYLQV DMGDFWRPDE AFFDGIKDRE TVNAMLKEVG GKKVADGNVS EKVRTQKGIL
RDFLAGTNDR PKVERWTPRW LTFPAQAYTR RPFATAQRSK AVAPLLRRVR PPQSGAGAPE
AIAPHEGAVQ TPALVQAGAE TFAAE