Gene Caul_3665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3665 
Symbol 
ID5901120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3955944 
End bp3959102 
Gene Length3159 bp 
Protein Length1052 aa 
Translation table11 
GC content67% 
IMG OID641564176 
Productendonuclease/exonuclease/phosphatase 
Protein accessionYP_001685290 
Protein GI167647627 
COG category[R] General function prediction only 
COG ID[COG2374] Predicted extracellular nuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.27613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGG TCTATTTCAA TCTCAGCGCG TCCGATTTCT TCCAGGACTG GTCGGATACG 
TCGCAGCTCA CCACCGCCAA CAATTGGGAC GGCGTGGCCA GCATCGTCGG CTATCGCGGC
GACGACATCG TCACCGCCAC GGCGGTTGAT CCGCGTACGG CCACGGCCGA CCTCAGTGGG
CCGATCAATG TCGCGGTCAA CGCCGGCGCG GCGCTGCTGG AAAGCGGCTC GGCCTCGGGC
GGTGTGGCCG AGTTCCAGAT CGCCGACGCG GTCGTCGGGC TGAACGGCTC GGGCACCGCC
GACGCGCCCA ACCTGGTGCT CTACCTGGAC GGCACGGGCC GCCAGAACCT GCATTTCTCG
GCCGATATCC GCGACCTCGA ACTGCGCGAC AACGCCGTCC AGCAGGTCGC CGTGCAATAT
CGCGTCGGCG GAACCGGCGT GTGGACCTCG ATGCCCGAGG GCTACGTCGC GGACGCGACC
ACGGCCACCA CCGCCACCCA GGTCACCCAC CTCGACTTCG ACCTTCCGGC CTCGGTCAAT
GGCCAGGCCC AGATCGAGAT CCGCATCCTG ACCACCAACG CCGCCGGCAA CGACGAGTGG
GTGGGCATCG ACAACATTCA TGTCACCAGC ACGGCTGGCG GCGGCGGCGA CACGGTCGCC
CCGACCCTGA CCATCAGCAA TCCGAGCGAC CCGGACAACG GCGAGACCGC CGTGGCGCCC
GACGCTCCGA TCGTCCTGCA CTTCACCGAA AGCGTGGTGG CGGGGACCGG CTCGATCACC
CTGACCAGCG GCGCGGACGT GCGCGTGATC GACGTGGCCA GCGGCCAGGT CCATTTCAGC
GGCGCGACGG TGACCATCGA TCCGTCGGCG GATCTTAACC CCGGCGCGAC CTATTCGCTG
ACCGTCGACA ACGGCGCGAT CAAGGACCTG GGCGGCAACG CCTTCGCCGG CGTGGCCTCG
GGCGCGTTCA GCTTCTCGAC CCTCGACCCG GACAAGGTCG TCGCGATCTA CGAGATCCAG
GGCAAGAGCC ACACCTCGGC CTATGACGGC GACAAGGTTC AGACCACGGG CGTGGTCACG
GCGGTCGACA GCAACGGCTT CTACATGCAG GACGCGACCG GCGACGGCGA CAGCGCCACG
TCGGACGGCA TCTTCGTGTT CACCAGCTCG GCCCCGGCGG TGTCGGTCGG CCAGTTGCTC
AAGGTCACTG GCACGGTCGA TGAATACCGG CCGGGCGACG CCACCAGCCT GACGGTCACC
GAGATCGTCT CCAGCGGCAT CCAGGTGCTG GGAACCGGCA CGGTCCACGC GACGGTGATC
GGCGAGCACG GCGTGCTGCC GCCCAACCAC ACCATCGAGG ACGACCACTT CACCAGCTTC
GACCCGGCCA CCGACGGCCT CGACTTCTAC GAGTCGCTCG AAGGCATGCT GGTCACCGTG
GAAGCCCCGA AGGTGGTCGA CAGCACCAAC GGGTTTGGCG AAACCTGGGT CGTCGCTTCG
GGCGGTCAAG GGGCCACCGG CCTCAGCGAC CGGGGCGCGA TGACGATCTC GGCCGGCGAC
TACAATCCCG AGCGCATCCA GATCGACGAC GACAGCGCCA TCAACAGCGG CTACCACCCG
GCCCACGTCC AGGGCGACGT GCTGGGCGAC GTCACCGGCG TCGTCAACTA CAGCTTCAAC
GCCTATGAAG TGCTGGTGAC CGGCCCGGTG ACGACCACCA CCTCGGTCGG CCTGGTCAAC
CGCGAGTTCA CCAACCTCAC CGCCGCCGCC GACAAGCTGA CGATCGCCAC CTACAACCTG
GAAAACCTGG ATCCCACCGA CTCGGCGGCC AAGTTCCAGT CCATCGCCTT CGACATCGCC
AACAACCTGC ACCATCCCGA CATCATCGGG GTGCAGGAGA TCCAGGACGC CGACGGCGCG
GGTTCGGGTT CGAACCTCAG CGGCGCCGCG ACCGCCCAGG TGCTGATCAA CGCCATCGTC
GCGGCCGGCG GCCCTCGCTA CACCTATGTC GAGGTCGCGC CGACCGTGGC CGGCTCGACC
GGCGGCGAGC CGGGCGGCAA CATCCGCAAC GGCTTCCTCT ACGACGCCAG CCGGGTGCAG
TACGTCGATG GCTCGGCCCG CCTGGTCCTC GACACCGACC TGACCAACGG CAACGCCTTC
AACAACAGCC GCAATCCCCT GGCCGCGGAC TTCGTCTTCA ACGACGAGAC CATCACGGCG
GTCAGCGTCC ACTCCACCTC GCGCGGCGGC AGCGGGCCGC TGTTCGGCGC CGACCAGCCG
GCCATCGTCG CCGGCGACGC CTCGCGCACC GCCCAGGCGG TCGAGCTGAC CAAGTTCGTC
AACACGCTCG AGGCCACCAA TCCCGATCAC CACGTCGCGG TGATGGGCGA CTTCAACGGC
TACTATTTCG AGACCGCGCT GGCCGGCCTC GAAGCCAACG CCAACCTCTA CAATCTCGCC
AAGTCGCTGC CGGTCGAGGA GCGCTATTCC TACAAGTTCG AGGGCTACGC GCAGCTGTTC
GACAACATCC TGGTGTCGAA GGACCTGCAG TCGGTGTCCG AATTCGACAT CGTGCACCTC
AATTCCGAAC AATTGCCCAG CGCCCAGGTC TCGAGCGACC ACGACGCCTC GATCACTCGC
ATCACCATCG CGGCGACCAC CAAGCTGGGC GGCGGGGCCG ACGAGGCCGA CTATCACACC
AGCCTCGCGG CGGTGAAGGT CGACGGCGGG CTGGGCGACG ACCACCTGGT CGGCGGCCGC
GGCTCCGACC TGCTGATCGG CGGCGACGGC GCCGACCGCA TCAGCGGCGG AGCTGGCAAC
GACATCATTC GCGGCGGCGG CGGCCACGAC CGGATGCAGG GCGGGGCCGG CGACGACACG
TTCGTCATCG GGGTCGGTGA TCTGGTCACC GGCGACAACC AGTCGGACCA CATCATCGAC
TTCCGGGGCG CGGGTTCGGC CAGCGGCGGC GAGCAGGACC TGCTGATCTT CACCGGCTTC
TCGGCCGGTT CGAAGCTGGT CTTGGATCAC GACATGGGCA ACACGCCGAC CCTGCAACTC
TACCAAGTCT ACAACGGTGC GAGCCTTGTC GGCTCGTTGC TGATACAGAC CAACACGGCC
GCCCACTTGA CCAGCGGCGA CTACCTCTTC CAAGCCTAA
 
Protein sequence
MSKVYFNLSA SDFFQDWSDT SQLTTANNWD GVASIVGYRG DDIVTATAVD PRTATADLSG 
PINVAVNAGA ALLESGSASG GVAEFQIADA VVGLNGSGTA DAPNLVLYLD GTGRQNLHFS
ADIRDLELRD NAVQQVAVQY RVGGTGVWTS MPEGYVADAT TATTATQVTH LDFDLPASVN
GQAQIEIRIL TTNAAGNDEW VGIDNIHVTS TAGGGGDTVA PTLTISNPSD PDNGETAVAP
DAPIVLHFTE SVVAGTGSIT LTSGADVRVI DVASGQVHFS GATVTIDPSA DLNPGATYSL
TVDNGAIKDL GGNAFAGVAS GAFSFSTLDP DKVVAIYEIQ GKSHTSAYDG DKVQTTGVVT
AVDSNGFYMQ DATGDGDSAT SDGIFVFTSS APAVSVGQLL KVTGTVDEYR PGDATSLTVT
EIVSSGIQVL GTGTVHATVI GEHGVLPPNH TIEDDHFTSF DPATDGLDFY ESLEGMLVTV
EAPKVVDSTN GFGETWVVAS GGQGATGLSD RGAMTISAGD YNPERIQIDD DSAINSGYHP
AHVQGDVLGD VTGVVNYSFN AYEVLVTGPV TTTTSVGLVN REFTNLTAAA DKLTIATYNL
ENLDPTDSAA KFQSIAFDIA NNLHHPDIIG VQEIQDADGA GSGSNLSGAA TAQVLINAIV
AAGGPRYTYV EVAPTVAGST GGEPGGNIRN GFLYDASRVQ YVDGSARLVL DTDLTNGNAF
NNSRNPLAAD FVFNDETITA VSVHSTSRGG SGPLFGADQP AIVAGDASRT AQAVELTKFV
NTLEATNPDH HVAVMGDFNG YYFETALAGL EANANLYNLA KSLPVEERYS YKFEGYAQLF
DNILVSKDLQ SVSEFDIVHL NSEQLPSAQV SSDHDASITR ITIAATTKLG GGADEADYHT
SLAAVKVDGG LGDDHLVGGR GSDLLIGGDG ADRISGGAGN DIIRGGGGHD RMQGGAGDDT
FVIGVGDLVT GDNQSDHIID FRGAGSASGG EQDLLIFTGF SAGSKLVLDH DMGNTPTLQL
YQVYNGASLV GSLLIQTNTA AHLTSGDYLF QA