Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3665 |
Symbol | |
ID | 5901120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3955944 |
End bp | 3959102 |
Gene Length | 3159 bp |
Protein Length | 1052 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641564176 |
Product | endonuclease/exonuclease/phosphatase |
Protein accession | YP_001685290 |
Protein GI | 167647627 |
COG category | [R] General function prediction only |
COG ID | [COG2374] Predicted extracellular nuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.27613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAGG TCTATTTCAA TCTCAGCGCG TCCGATTTCT TCCAGGACTG GTCGGATACG TCGCAGCTCA CCACCGCCAA CAATTGGGAC GGCGTGGCCA GCATCGTCGG CTATCGCGGC GACGACATCG TCACCGCCAC GGCGGTTGAT CCGCGTACGG CCACGGCCGA CCTCAGTGGG CCGATCAATG TCGCGGTCAA CGCCGGCGCG GCGCTGCTGG AAAGCGGCTC GGCCTCGGGC GGTGTGGCCG AGTTCCAGAT CGCCGACGCG GTCGTCGGGC TGAACGGCTC GGGCACCGCC GACGCGCCCA ACCTGGTGCT CTACCTGGAC GGCACGGGCC GCCAGAACCT GCATTTCTCG GCCGATATCC GCGACCTCGA ACTGCGCGAC AACGCCGTCC AGCAGGTCGC CGTGCAATAT CGCGTCGGCG GAACCGGCGT GTGGACCTCG ATGCCCGAGG GCTACGTCGC GGACGCGACC ACGGCCACCA CCGCCACCCA GGTCACCCAC CTCGACTTCG ACCTTCCGGC CTCGGTCAAT GGCCAGGCCC AGATCGAGAT CCGCATCCTG ACCACCAACG CCGCCGGCAA CGACGAGTGG GTGGGCATCG ACAACATTCA TGTCACCAGC ACGGCTGGCG GCGGCGGCGA CACGGTCGCC CCGACCCTGA CCATCAGCAA TCCGAGCGAC CCGGACAACG GCGAGACCGC CGTGGCGCCC GACGCTCCGA TCGTCCTGCA CTTCACCGAA AGCGTGGTGG CGGGGACCGG CTCGATCACC CTGACCAGCG GCGCGGACGT GCGCGTGATC GACGTGGCCA GCGGCCAGGT CCATTTCAGC GGCGCGACGG TGACCATCGA TCCGTCGGCG GATCTTAACC CCGGCGCGAC CTATTCGCTG ACCGTCGACA ACGGCGCGAT CAAGGACCTG GGCGGCAACG CCTTCGCCGG CGTGGCCTCG GGCGCGTTCA GCTTCTCGAC CCTCGACCCG GACAAGGTCG TCGCGATCTA CGAGATCCAG GGCAAGAGCC ACACCTCGGC CTATGACGGC GACAAGGTTC AGACCACGGG CGTGGTCACG GCGGTCGACA GCAACGGCTT CTACATGCAG GACGCGACCG GCGACGGCGA CAGCGCCACG TCGGACGGCA TCTTCGTGTT CACCAGCTCG GCCCCGGCGG TGTCGGTCGG CCAGTTGCTC AAGGTCACTG GCACGGTCGA TGAATACCGG CCGGGCGACG CCACCAGCCT GACGGTCACC GAGATCGTCT CCAGCGGCAT CCAGGTGCTG GGAACCGGCA CGGTCCACGC GACGGTGATC GGCGAGCACG GCGTGCTGCC GCCCAACCAC ACCATCGAGG ACGACCACTT CACCAGCTTC GACCCGGCCA CCGACGGCCT CGACTTCTAC GAGTCGCTCG AAGGCATGCT GGTCACCGTG GAAGCCCCGA AGGTGGTCGA CAGCACCAAC GGGTTTGGCG AAACCTGGGT CGTCGCTTCG GGCGGTCAAG GGGCCACCGG CCTCAGCGAC CGGGGCGCGA TGACGATCTC GGCCGGCGAC TACAATCCCG AGCGCATCCA GATCGACGAC GACAGCGCCA TCAACAGCGG CTACCACCCG GCCCACGTCC AGGGCGACGT GCTGGGCGAC GTCACCGGCG TCGTCAACTA CAGCTTCAAC GCCTATGAAG TGCTGGTGAC CGGCCCGGTG ACGACCACCA CCTCGGTCGG CCTGGTCAAC CGCGAGTTCA CCAACCTCAC CGCCGCCGCC GACAAGCTGA CGATCGCCAC CTACAACCTG GAAAACCTGG ATCCCACCGA CTCGGCGGCC AAGTTCCAGT CCATCGCCTT CGACATCGCC AACAACCTGC ACCATCCCGA CATCATCGGG GTGCAGGAGA TCCAGGACGC CGACGGCGCG GGTTCGGGTT CGAACCTCAG CGGCGCCGCG ACCGCCCAGG TGCTGATCAA CGCCATCGTC GCGGCCGGCG GCCCTCGCTA CACCTATGTC GAGGTCGCGC CGACCGTGGC CGGCTCGACC GGCGGCGAGC CGGGCGGCAA CATCCGCAAC GGCTTCCTCT ACGACGCCAG CCGGGTGCAG TACGTCGATG GCTCGGCCCG CCTGGTCCTC GACACCGACC TGACCAACGG CAACGCCTTC AACAACAGCC GCAATCCCCT GGCCGCGGAC TTCGTCTTCA ACGACGAGAC CATCACGGCG GTCAGCGTCC ACTCCACCTC GCGCGGCGGC AGCGGGCCGC TGTTCGGCGC CGACCAGCCG GCCATCGTCG CCGGCGACGC CTCGCGCACC GCCCAGGCGG TCGAGCTGAC CAAGTTCGTC AACACGCTCG AGGCCACCAA TCCCGATCAC CACGTCGCGG TGATGGGCGA CTTCAACGGC TACTATTTCG AGACCGCGCT GGCCGGCCTC GAAGCCAACG CCAACCTCTA CAATCTCGCC AAGTCGCTGC CGGTCGAGGA GCGCTATTCC TACAAGTTCG AGGGCTACGC GCAGCTGTTC GACAACATCC TGGTGTCGAA GGACCTGCAG TCGGTGTCCG AATTCGACAT CGTGCACCTC AATTCCGAAC AATTGCCCAG CGCCCAGGTC TCGAGCGACC ACGACGCCTC GATCACTCGC ATCACCATCG CGGCGACCAC CAAGCTGGGC GGCGGGGCCG ACGAGGCCGA CTATCACACC AGCCTCGCGG CGGTGAAGGT CGACGGCGGG CTGGGCGACG ACCACCTGGT CGGCGGCCGC GGCTCCGACC TGCTGATCGG CGGCGACGGC GCCGACCGCA TCAGCGGCGG AGCTGGCAAC GACATCATTC GCGGCGGCGG CGGCCACGAC CGGATGCAGG GCGGGGCCGG CGACGACACG TTCGTCATCG GGGTCGGTGA TCTGGTCACC GGCGACAACC AGTCGGACCA CATCATCGAC TTCCGGGGCG CGGGTTCGGC CAGCGGCGGC GAGCAGGACC TGCTGATCTT CACCGGCTTC TCGGCCGGTT CGAAGCTGGT CTTGGATCAC GACATGGGCA ACACGCCGAC CCTGCAACTC TACCAAGTCT ACAACGGTGC GAGCCTTGTC GGCTCGTTGC TGATACAGAC CAACACGGCC GCCCACTTGA CCAGCGGCGA CTACCTCTTC CAAGCCTAA
|
Protein sequence | MSKVYFNLSA SDFFQDWSDT SQLTTANNWD GVASIVGYRG DDIVTATAVD PRTATADLSG PINVAVNAGA ALLESGSASG GVAEFQIADA VVGLNGSGTA DAPNLVLYLD GTGRQNLHFS ADIRDLELRD NAVQQVAVQY RVGGTGVWTS MPEGYVADAT TATTATQVTH LDFDLPASVN GQAQIEIRIL TTNAAGNDEW VGIDNIHVTS TAGGGGDTVA PTLTISNPSD PDNGETAVAP DAPIVLHFTE SVVAGTGSIT LTSGADVRVI DVASGQVHFS GATVTIDPSA DLNPGATYSL TVDNGAIKDL GGNAFAGVAS GAFSFSTLDP DKVVAIYEIQ GKSHTSAYDG DKVQTTGVVT AVDSNGFYMQ DATGDGDSAT SDGIFVFTSS APAVSVGQLL KVTGTVDEYR PGDATSLTVT EIVSSGIQVL GTGTVHATVI GEHGVLPPNH TIEDDHFTSF DPATDGLDFY ESLEGMLVTV EAPKVVDSTN GFGETWVVAS GGQGATGLSD RGAMTISAGD YNPERIQIDD DSAINSGYHP AHVQGDVLGD VTGVVNYSFN AYEVLVTGPV TTTTSVGLVN REFTNLTAAA DKLTIATYNL ENLDPTDSAA KFQSIAFDIA NNLHHPDIIG VQEIQDADGA GSGSNLSGAA TAQVLINAIV AAGGPRYTYV EVAPTVAGST GGEPGGNIRN GFLYDASRVQ YVDGSARLVL DTDLTNGNAF NNSRNPLAAD FVFNDETITA VSVHSTSRGG SGPLFGADQP AIVAGDASRT AQAVELTKFV NTLEATNPDH HVAVMGDFNG YYFETALAGL EANANLYNLA KSLPVEERYS YKFEGYAQLF DNILVSKDLQ SVSEFDIVHL NSEQLPSAQV SSDHDASITR ITIAATTKLG GGADEADYHT SLAAVKVDGG LGDDHLVGGR GSDLLIGGDG ADRISGGAGN DIIRGGGGHD RMQGGAGDDT FVIGVGDLVT GDNQSDHIID FRGAGSASGG EQDLLIFTGF SAGSKLVLDH DMGNTPTLQL YQVYNGASLV GSLLIQTNTA AHLTSGDYLF QA
|
| |