Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4358 |
Symbol | |
ID | 5901819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4736427 |
End bp | 4737512 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564876 |
Product | proline iminopeptidase |
Protein accession | YP_001685976 |
Protein GI | 167648313 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0513684 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.622811 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTTTT CCAGACTCCA GGCCACCAGC AAGGTCACGA GCGAGTGGAG GTACCCGCAA GCGGAGCCTA ACCGGACCGG AATGCTGAAG GTCGATCAGG CGCCTGACCA CACGCTCTAC TGGGAGGAAT ACGGCGCGCC AGACGGCGAG CCGGTGATGT TCCTGCATGG CGGACCGGGC GGAGCCTGCG CGCCGGTGAT GGCGCGGTTC TTCGACCCGG CGCGCTACCG CGTGATCCTG TTCGACCAGC GCGGCTGCGG CAAGAGCACG CCGACCGTGG CCTCGCATGG CCCGGCCGTC GCCCTGGTCC GCAACGACAC CGACCACCTA GTGGCCGACA TCAACCGCCT GCGCGAGGCG CTGAACATCA CCGGCAAGAT GCACGTGTTC GGCGGCAGCT GGGGCAGCAC CCTGGCGCTC GTCTACGCCA TCCGCCATCC CGAGCACGTC GCCTCGCTGA TCCTGCGCGG CATCTTCCTG GGGACGCGCG AGGACCTGCT TTACATGTAC CAGGGCAACG CGGCGGTGTT CGACAAGACG CCCTACGCCC TGAGCGAGCC GGGCGCCTAT GTGACCTATC CCGACGAGTG GAAGGCCTTC GTCGAGGTCA TACCGGCCGA CAGGCGCGGC GACATAATGG GCGCCTACAA GGCGATCTTC GACGGCAAGC CCGACGACGC GGCCGGGCGC GAGGCCCAAC TTCAAGCCGC CCTGGCCTGG TCGGTCTGGG AGGGCGCGAT CTCGAACATG ATCCCCGAGC AGGGCGATCC GGGGAAGTTC GGCGAGGCCG ATTTCGCCCT GTGCTTCGCC CAGATCGAGG CCCACTTCTT CGCCAACAAT CTGTTCCTGG AGCCGGACGA GATCACGCGC GACATCGCGC GGATCGCCAA GCTGCCGATC CACATCGTCC ACGGCCGCTT CGACCAGGTT TGCCCGCTGA CCCAGGCCTC GCGTCTGGTG GCGGCCCTAG CGGCGGTGGG CGCTACGCCG GCTAGCTATG TGCGCACCAA CGCCGGCCAC AGCGCCATGG AGGCTCAGAC GGTGCTGGCC CTGACGGCGA TCATGGACGG GTTGCCGAGG CTCTGA
|
Protein sequence | MDFSRLQATS KVTSEWRYPQ AEPNRTGMLK VDQAPDHTLY WEEYGAPDGE PVMFLHGGPG GACAPVMARF FDPARYRVIL FDQRGCGKST PTVASHGPAV ALVRNDTDHL VADINRLREA LNITGKMHVF GGSWGSTLAL VYAIRHPEHV ASLILRGIFL GTREDLLYMY QGNAAVFDKT PYALSEPGAY VTYPDEWKAF VEVIPADRRG DIMGAYKAIF DGKPDDAAGR EAQLQAALAW SVWEGAISNM IPEQGDPGKF GEADFALCFA QIEAHFFANN LFLEPDEITR DIARIAKLPI HIVHGRFDQV CPLTQASRLV AALAAVGATP ASYVRTNAGH SAMEAQTVLA LTAIMDGLPR L
|
| |