Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3551 |
Symbol | pip |
ID | 5713782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3734063 |
End bp | 3735049 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641269480 |
Product | proline iminopeptidase |
Protein accession | YP_001534885 |
Protein GI | 159046091 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGAG CGGCGAGCCA AAAACACGCA GCGGAGTATC TCTACCCGCC GCTCGATCCC TACGACCAGC GCGTGCTGCC GGTCTCCGGC GGGCACCGGA TCTATGTGGA GCAATGCGGC AATCCGCAAG GCATCCCCGT GGTGGTCCTG CATGGCGGCC CCGGCGGCGG CTGCAGCCCG GCCATGCGGC GCTATTTCGA CCCCGATACC TACCGGATCG TGCTCTTCGA CCAGCGTGGC TGCGGCCGCT CCCGCCCCCA TGCCTCGGTG GAGCAGAACA CCACCTGGGA CCTCGTGGAC GACATCGAGG CGATTCGCAC CACCCTGGAG ATCGACGCCT GGGACGTGTT CGGCGGCAGC TGGGGCGCGA CCCTCGCGCT GATCTACGGA CAGACCCATC CCGACCGCGT TACCCACTTG ATTTTGCGGG GCGTTTTCCT GATGACCGAC GCCGAGCTCG ACTGGTTCTA TGGCGGCGGC GCGGCGCAGT TCTGGCCCGA TGTGTGGAAA CGCTTCGTCA ACCTGATCCC CGAGGAAGAG CGCGGCGACC TGATCGCGGC CTATAACAAA CGGCTTTTCA GCGGTAACAT GATGGAAGAG ACCCGCTATG CCCGCGCCTG GTCGGCCTGG GAAAACGCGC TGGCCTCGAT CCATTCCGAG GGGCTGACCG GCGAGAGCCC GGCAGAATAC GCCCGCGCCT TCGCCCGGCT GGAGAACCAT TATTTCCTCA ACAAGGGGTT CCTCGACGAG GATGGCCAGA TCCTGCGCGA CCTGCCCCGG CTTGCGGATG TGCCGATTAC CATCGTGCAG GGGCGCTTCG ACATGATCTG CCCGCCCGCG GGCGCCTGGC AGATCGCCGA GGCGCTGCCG CAGACCGACC TGCGGATGAT CCCGCTTGCC GGGCACGCCT TGTCGGAATC CGGCATCAGC GCCGAGCTGG TGCGGGTGAT GGACCGGCTG CGCTATGGAC GGCGCCCGTC CAACTGA
|
Protein sequence | MTRAASQKHA AEYLYPPLDP YDQRVLPVSG GHRIYVEQCG NPQGIPVVVL HGGPGGGCSP AMRRYFDPDT YRIVLFDQRG CGRSRPHASV EQNTTWDLVD DIEAIRTTLE IDAWDVFGGS WGATLALIYG QTHPDRVTHL ILRGVFLMTD AELDWFYGGG AAQFWPDVWK RFVNLIPEEE RGDLIAAYNK RLFSGNMMEE TRYARAWSAW ENALASIHSE GLTGESPAEY ARAFARLENH YFLNKGFLDE DGQILRDLPR LADVPITIVQ GRFDMICPPA GAWQIAEALP QTDLRMIPLA GHALSESGIS AELVRVMDRL RYGRRPSN
|
| |