Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1598 |
Symbol | |
ID | 5899053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1687785 |
End bp | 1688774 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562085 |
Product | proline iminopeptidase |
Protein accession | YP_001683225 |
Protein GI | 167645562 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGTA ACGCTTCCGC CTCCGTCATG TCGTCCAGCG GCCGCCGCGG CCTGTTTCGC GACATCGAGC CCTTCTCGTT CGGCTGGCTG GGCACGGACG GCCCGCACGA AATCTACTAC GAGGAATGCG GCGCGCCGCG CGGCAAGCCG GCCGTGATCC TGCACGGCGG CCCGGGCGGG GCGGTCAATC CGACCATGCG GCGGTTCTTC GATCCCGGCA AATGGCGCAT GGCCCTGTTC GACCAGCGCG GCTGCGGCCG CTCGCGTCCC AACGCCAGTC TCGACGACAA CACCACCTGG AGCCTGATCG CCGACATCGA GCGGCTGCGC GAGCATCTGG GGATCGAGAA GTGGACCGTG TTCGGTGGTT CCTGGGGCTC GACCCTGGCC CTGGCCTACG CCCTCACCCA TCCCGACCGG GTCGAGGGGC TGGTGCTGCG CGGCGTCTTC CTGCTGACCC AGAAGGAGCT GCGCTGGTTC TACCAGGACG GCGCTTCCAT GCTGTTCCCC GACGCCTGGG AGCGGTTCCT GGCCCCGATC CCCGAGGATG AGCGGGGCGA CCTGGTGAGC GCCTATCACC GGCGCCTGAC CCACCCCGAC CGCCGGATCC AGGCCGAGGC GGCCGGCGCC TGGAGCCAGT GGGAGGGCGA CACCATTTCG CTGCGCGGTC CCGAAGCCCG CCCCCCGAAG TTCAACGAGG AAGACTTCGC CATCGCCTTC GCGCGGATCG AATGCCACTT CTTCGCCAAC CGGGGCTTCT TCGAGGAAGA CGGCTGGATC CTGAAGAACA TCGACAGGAT CCGCCACATC CCCGCCTGGA TCGTCCAGGG CCGCTTCGAC GTGGTCACCC CGCTGGACAG CGCCTGGTCG CTGCACAAGG CCTGGCCCGA GGCCAGGTTC GAGATCGTCT GGGACGCCGG GCACGCCTCG ACCGAGCCGG GGATCATCGA CGGACTGGTG CGGGCGACGG ACGCCGCGCT GGGGGGCTAG
|
Protein sequence | MDRNASASVM SSSGRRGLFR DIEPFSFGWL GTDGPHEIYY EECGAPRGKP AVILHGGPGG AVNPTMRRFF DPGKWRMALF DQRGCGRSRP NASLDDNTTW SLIADIERLR EHLGIEKWTV FGGSWGSTLA LAYALTHPDR VEGLVLRGVF LLTQKELRWF YQDGASMLFP DAWERFLAPI PEDERGDLVS AYHRRLTHPD RRIQAEAAGA WSQWEGDTIS LRGPEARPPK FNEEDFAIAF ARIECHFFAN RGFFEEDGWI LKNIDRIRHI PAWIVQGRFD VVTPLDSAWS LHKAWPEARF EIVWDAGHAS TEPGIIDGLV RATDAALGG
|
| |