Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4884 |
Symbol | |
ID | 5902346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5279205 |
End bp | 5281388 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641565404 |
Product | serralysin |
Protein accession | YP_001686502 |
Protein GI | 167648839 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTAC ACGACGAACA CCGCGATCAG GATCTTCATC GCGAAACCGA TTTCCCCACC TTCCAGGGGC CGACCAACAC GGTGTCGCTG GCCGTCCCCG TCGCGCCGGT CGACGCGTCG GTCGATGGCC AGGGCGGCGA GATCCGCGGC AAGCCGGTAT TCACGATCGA GCAGGCGACC GAGCAGCTCA ATCGCGGCGG GGCCGGGTGG ACGCCCGGGG TGGTCGACCA CGCCGTACCG CGCGACGGCG ACGTCTCGGT GCTGAACTTC GGCTTCCACA CCGCGCAGAG CATGTTCGCC GAGCCCTATG TCTATGAGGA AGGCGGGGAG CTGTACGGCC GGACCGAATA TTTCGGCTTC GCGGAGTTCA CCGAGCCCCA GAAGGCGGCC GCGCGCGAGG CCATGGCCAG CTGGGACGAC CTGATCGCCC CGACCCTGGT GGAAAGCGCC CCCGAGGTCG CCGACATCAC CTTCGCCAAC TACACCAACC GGCCTGGCAC CCAGGCCTAT GCCTACCTGC CCTACGACTA CACGCCCGAC AACGCCGACC TGATCGCCGG CGACGTCTGG GTCAGCGCCA ACCAGCCCAG CAACTTCCAG CTCGACGAGG GCCTCTACGG CATCCACACC CTGACCCACG AGAGCGGCCA CGCCCTGGGG CTGGAGCATC CGGGCGACTA CAACGCCGCG CCCGGCGTCA CCATCACCTA CGCCGACAAC GCCGAATATT ATCAGGACAG CCGCGCCTAC AGCATCATGT CCTACTTCGC CGCCTCGGAG ACCGGCGCGC GGCATTTCGA CTTCAACATC TCGACCACGG TCTACGCCTC GACGCCCCTG GTGCATGACA TCGCCGCCAT CCAGGCGATC TATGGCGCGG ACATGACCAC GCGGACGGGC GACACGACCT ACGGCTTCAA CAGCAACGCC GGCCGCGATT CCTACGACTT CACCAAGACC CCGGCCCCGG TGATGGCCAT CTGGGACGCC GGGGGGAACG ACACCCTGGA CGCCTCGGGC TATCACACCG ACCAGATCAT CGACCTGACG CCCGGTTCGC TGAGCAGCAT CGGCGGCGTG ACCTACGACA CCGCGCCCTC GTTCGAGCAG GTCAACGCCA ACCGGGCGGC GGCCGGCTAC GCCCCGGTGG CCCTGGCGAC CTACACCTCG AACATGGCCC TGCTGGCGTC CAATCCGGTG GTGGGACGCC TGACCGACAA TGTCGCCATC GCCTATGGCG CGACGATCGA GAACGGGATC GGCGGCAGCG GCTCCGACCG ACTGATCGGC AACGCCGTCG GCAACACCCT GACCGGCAAT GCCGGGAACG ACACGCTGGA CGGCAAGGCC GGCGCCGACA TCCTCTACGG CGGCGACGGT AACGACGTGC TGGACGGAGG CGCGGGCTTC GACCGGATGA TCGGCGGAGC CGGCGACGAC CGCTACATCA TCGACACCCT TCACGACGCG ATCCGGGAAA TGCCGAACGA AGGCGTGGAT ACAGTCTGGA GCTCGGTCAA CTACACGCTC GTGCCGCATG TCGAGAACCT GCGACTGACC GGCGAGGCGA TGATCGGGAC GGGCAATGGC CTGGCCAACG TCATCGACGG CAACGACATC TCCAACAAGC TGTCCGGCGA GGCCGGAAAC GACACCCTGT CGGGCTTTGG CGGCGTCGAC TATCTCGAAG GCGGGGCGGG CAATGACAGC CTGTTCGGCG GCGACGGCAC CGACTTCCTG CTGGGCGGCG CCGGCGCCGA CCGGCTCAAT GGCGGGGCGG GGTTCGACAT CCTGATCGGC GGGGCCGGAA ACGACGTCTA CGCCTTCGAC GTCAGCGCCT TCCCGGACAG CGACGGCCTG TCGATCGACT TCGTCAGCGA TTTCGAGCGC GGCGACCTGT TCGACTTCAG CGCCCTGGAC GCCAACGCCA CGCTCGAGGG CGATCAGGCC TTCTCGCTGG TGTCGCAATG GGCGGCCAAT GGGGTCGGCC AAGTGATGGT CAGCACCTAT TCCAATGTGG TGAAGGCCGT CGGCGCCCTA GGCGGCGTCG ACCTGGGCGC CCTTGGCTGG GGCCTGGGCG GCGGCGTGTC GATCGTCTGG GGCGACCTCG ACGGCGACGG CCAATCAGAC TTCGCCGTGC TGGTGGCCAG CAACGCGCTG TCGACCGGCG GCTGGCTGCT CTGA
|
Protein sequence | MRLHDEHRDQ DLHRETDFPT FQGPTNTVSL AVPVAPVDAS VDGQGGEIRG KPVFTIEQAT EQLNRGGAGW TPGVVDHAVP RDGDVSVLNF GFHTAQSMFA EPYVYEEGGE LYGRTEYFGF AEFTEPQKAA AREAMASWDD LIAPTLVESA PEVADITFAN YTNRPGTQAY AYLPYDYTPD NADLIAGDVW VSANQPSNFQ LDEGLYGIHT LTHESGHALG LEHPGDYNAA PGVTITYADN AEYYQDSRAY SIMSYFAASE TGARHFDFNI STTVYASTPL VHDIAAIQAI YGADMTTRTG DTTYGFNSNA GRDSYDFTKT PAPVMAIWDA GGNDTLDASG YHTDQIIDLT PGSLSSIGGV TYDTAPSFEQ VNANRAAAGY APVALATYTS NMALLASNPV VGRLTDNVAI AYGATIENGI GGSGSDRLIG NAVGNTLTGN AGNDTLDGKA GADILYGGDG NDVLDGGAGF DRMIGGAGDD RYIIDTLHDA IREMPNEGVD TVWSSVNYTL VPHVENLRLT GEAMIGTGNG LANVIDGNDI SNKLSGEAGN DTLSGFGGVD YLEGGAGNDS LFGGDGTDFL LGGAGADRLN GGAGFDILIG GAGNDVYAFD VSAFPDSDGL SIDFVSDFER GDLFDFSALD ANATLEGDQA FSLVSQWAAN GVGQVMVSTY SNVVKAVGAL GGVDLGALGW GLGGGVSIVW GDLDGDGQSD FAVLVASNAL STGGWLL
|
| |