Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3734 |
Symbol | |
ID | 5901196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4046903 |
End bp | 4049827 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564257 |
Product | peptidase M16 domain-containing protein |
Protein accession | YP_001685359 |
Protein GI | 167647696 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.13373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCG TTTCCCGCCT GGCGCTCGTT TTCTCGGCCG GGCTGAGTCT TTCCGCCTGC GCGACCCTGT CACACTTGAA GCCCGGCGGC GGCGATCGTC CGGTCACCGC CAAGGCCCCG GCTCCCGCCC AGGCTCCCCG CGCGCCCGTC GCGTCCAACG TCCCGACGAT CGACGCGGTG AAGCCCGGCC AGTGGCCGCA GGCGGTGTCG GACGTCGCGC CCGACCCGCG CGTCCGCTTC GGCGTCCTGC CCAACGGCCT GCGCTACGCG ATCCAGAAGA ACGCCACCCC GCCCGGCCAG GCCGCCCTGC GCCTGTGGTT CGACGCCGGC TCGCTGGACG AGACCGACGA GCAGCAGGGC CTGGCTCACT TTCTGGAGCA CATGGCCTTC AATGGCTCGA AGAACGTGCC CGAAGGCGAG ATGACCAAGA TCCTCGAGCG CCACGGCCTG GCGTTCGGCG CCGACACCAA CGCCTCGACC AATTTCGGCG CCACGACCTA TCAGCTCGAC CTGCCCAAGA CCGACGACGA CACAGTCGAC AGCGCCATGA TGCTGCTGCG CGAAGCGGCG GGCGAGCTGA CCATCGCCCA GGACGCGGTG GACCGCGAGC GCGGGGTGGT GCTGTCGGAG GAGCGCACCC GCGACAGCCC CGGCTACCGG GTGTTCGTCA ACACCTTCGG CTTCCAGCTG GAGGGCCAGC GCCCGCCCAA GCGCCTGCCG ATCGGCAAGA CCGAGATCCT CAAGACCGCC CCCGCCCAGC GGATCCGCGA CTTCTACCAG GCCTGGTACC GGCCCGAGAA CGCGGTGTTC GTCGCGGTCG GCGACTTCGA CGTCGACGCC ATGGAGGCCC GGATCAAGGC CCGGTTCGGC GACTGGAAGG GCCAGGGTCA GCCCGGCGTG AAGCCGGATC TGGGTCCGGT GGCCAAGCGT GGCCTGACCG CCAAGGTGCT GGTCGAGCCG GGCGCGCAGA CCTCGGTGCA GATGTCGTGG ATCGCGCCGC CCGACCTGGA ACTGGAGACC CGGGCCAAGG ACGCCCAGGA ACTGGTCAAG GCCCTGGGTT TCGCCGTGCT GAACCGCCGG CTGCAGGTGC TGACCCGGTC CGACGCGCCG CCGTTCATCG CCGCCGTGGC CCTGCAGAAC GACCAGGAGC ACGCCGCCCA GATCACCACC CTGGCCGCCA CCGTCCAGCC TGGCGGCTGG AAAGAGGCGC TGACCGTCTT CGACCAGGAG CAGCGCCGCG TCGTGCAATA CGGCGTGCGC CAGGATGAGC TGGATCGCGA GATCGCCGCG ATGCGCGCGG GCTTCGTGGC CGCCGCGGCC GGAGAGGCCA CCCAGCGCAC CACCGCCCTG GCCGGCGCCA TCGTCGGCAC GCTGGACGAC AAGGAGATCG TCACCAGCCC GTCCCAGAAC CTGGCGGTGT TCGACGAGGC GACCAAGGGC CTGACCGCCG ACAAGGTCTC GGCCGTGCTG AAGGCCCAGT TCGTCGGCCA GGGCCCGCTG ATCACCATCC CCACCCCGAC GGCCATCGAG GGCGCGGAGA AGACCGTGAC GGAGGCCTAT CTGGCCTCGG GCAAGACGCC GGTCGCCGCG CCCGCCGCGC CGGGAACCCT GAACTGGCCC TATGCCAGCT TCGGCCCGAT CGGCAAGGTC GCCGAGCAGC GCGACGTCAC CGACCTGGAC ACCGTCTTCG TGCGCTTCGC CAACGGCGTG CGGCTGACCG TCAAGCCGAC CAAGTTCCGC GACGACCAGA TCCTGGTGAA GGCCCGGATC GGCCATGGCC TGCTCGACCT GCCGGCCAAC ACCCAGAGCC CGATGTGGGC CGGCTCGGCC TATATCGAGG GCGGCCTCAA GCAGATCAGC ACCCAGGACA TGGAACGCGT GCTGAACGGC AAGGTCTGGA ACGCCGGCCT GGGGGTCGAG GATGACGCCT TCAGCTTGTC GGGTCGCACC CGCCCAGAGG ACTTCGGCAC GGAACTCCAG GTCCTGGCCG CCTACGCCAC CGAAGGCGGC TGGCGCCCCG AGGCCTATAC GCGGATCAAG ACCTACTACG GCACGATCCA CGATCAGCTG GAGTCCACCC CCAGCGGCGT CATGGGACGC GACCTGGGCG GATTGCTGCA TGGCGGCGAT GGGCGCTGGA CCTTCCCGAC CCGCCAGCAG ATCGCCGCCG CCTCGCTCGA CCAACTGAAG GCCTCGGTCT CCGGACCGCT GACCAGCGAC TCCATCGAGG TGGTGATCGT CGGCGACATC ACGGTCGACA AGGCCATCGC CGGCGTGGCC GAGACCTTCG GGGCGCTGCC CACCCGCGCC GACACGCCCG TCCCGGCCGG CGCCGCCAAC GCGCCCTTCC CCGCCCCGTC GCCGACCCCG GTGGTGCGCA CCCACAAGGG CCGCGCCGAC CAGGGCCAGC TGTTCATGGC GTGGAAGACC GACGACCTGT TCGCCAACCT GCAGCGGGCC CGCGACACCC AGGTCCTGGC CCAGGTGATG CAACTGCGGC TGACCGACGA ACTGCGCGAG AAGCAGGGCG CCACCTATTC GCCGTCGGCC TCGGCCGCGG CCAGCGTGGC GTTCAACCAC TGGGGCTATC TGGCGGTCAG CGTCGAGACC CCGCCCGACA AGATTGACGG CGTCATGGCC AGCATCCGCC AGATCGCCGC CGACCTACGC GACAAGCCGA TCAGCGAGGA CGAGCTGGAC CGGGCCAAGA AGCCGCGCAT CGACCAGATC GAGAAGGCCC GCGAGACCAA CGAGTACTGG CTGGGAACCC TGTCGGGCGC CCAGACCGAT CCGCGCCTGA TCGACGCCAC CCGCTCGGTG ATCGCCGGCC TGCAACGCGT GTCGGCCGCC GACGTCCAGA AGGCGGCCAA GGATTTCCTC GGCGACGACA AGTCCTGGAC GATGATCGTG CGGCCGGAGA AATAA
|
Protein sequence | MIRVSRLALV FSAGLSLSAC ATLSHLKPGG GDRPVTAKAP APAQAPRAPV ASNVPTIDAV KPGQWPQAVS DVAPDPRVRF GVLPNGLRYA IQKNATPPGQ AALRLWFDAG SLDETDEQQG LAHFLEHMAF NGSKNVPEGE MTKILERHGL AFGADTNAST NFGATTYQLD LPKTDDDTVD SAMMLLREAA GELTIAQDAV DRERGVVLSE ERTRDSPGYR VFVNTFGFQL EGQRPPKRLP IGKTEILKTA PAQRIRDFYQ AWYRPENAVF VAVGDFDVDA MEARIKARFG DWKGQGQPGV KPDLGPVAKR GLTAKVLVEP GAQTSVQMSW IAPPDLELET RAKDAQELVK ALGFAVLNRR LQVLTRSDAP PFIAAVALQN DQEHAAQITT LAATVQPGGW KEALTVFDQE QRRVVQYGVR QDELDREIAA MRAGFVAAAA GEATQRTTAL AGAIVGTLDD KEIVTSPSQN LAVFDEATKG LTADKVSAVL KAQFVGQGPL ITIPTPTAIE GAEKTVTEAY LASGKTPVAA PAAPGTLNWP YASFGPIGKV AEQRDVTDLD TVFVRFANGV RLTVKPTKFR DDQILVKARI GHGLLDLPAN TQSPMWAGSA YIEGGLKQIS TQDMERVLNG KVWNAGLGVE DDAFSLSGRT RPEDFGTELQ VLAAYATEGG WRPEAYTRIK TYYGTIHDQL ESTPSGVMGR DLGGLLHGGD GRWTFPTRQQ IAAASLDQLK ASVSGPLTSD SIEVVIVGDI TVDKAIAGVA ETFGALPTRA DTPVPAGAAN APFPAPSPTP VVRTHKGRAD QGQLFMAWKT DDLFANLQRA RDTQVLAQVM QLRLTDELRE KQGATYSPSA SAAASVAFNH WGYLAVSVET PPDKIDGVMA SIRQIAADLR DKPISEDELD RAKKPRIDQI EKARETNEYW LGTLSGAQTD PRLIDATRSV IAGLQRVSAA DVQKAAKDFL GDDKSWTMIV RPEK
|
| |