Gene Dshi_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3551 
Symbolpip 
ID5713782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3734063 
End bp3735049 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content66% 
IMG OID641269480 
Productproline iminopeptidase 
Protein accessionYP_001534885 
Protein GI159046091 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGAG CGGCGAGCCA AAAACACGCA GCGGAGTATC TCTACCCGCC GCTCGATCCC 
TACGACCAGC GCGTGCTGCC GGTCTCCGGC GGGCACCGGA TCTATGTGGA GCAATGCGGC
AATCCGCAAG GCATCCCCGT GGTGGTCCTG CATGGCGGCC CCGGCGGCGG CTGCAGCCCG
GCCATGCGGC GCTATTTCGA CCCCGATACC TACCGGATCG TGCTCTTCGA CCAGCGTGGC
TGCGGCCGCT CCCGCCCCCA TGCCTCGGTG GAGCAGAACA CCACCTGGGA CCTCGTGGAC
GACATCGAGG CGATTCGCAC CACCCTGGAG ATCGACGCCT GGGACGTGTT CGGCGGCAGC
TGGGGCGCGA CCCTCGCGCT GATCTACGGA CAGACCCATC CCGACCGCGT TACCCACTTG
ATTTTGCGGG GCGTTTTCCT GATGACCGAC GCCGAGCTCG ACTGGTTCTA TGGCGGCGGC
GCGGCGCAGT TCTGGCCCGA TGTGTGGAAA CGCTTCGTCA ACCTGATCCC CGAGGAAGAG
CGCGGCGACC TGATCGCGGC CTATAACAAA CGGCTTTTCA GCGGTAACAT GATGGAAGAG
ACCCGCTATG CCCGCGCCTG GTCGGCCTGG GAAAACGCGC TGGCCTCGAT CCATTCCGAG
GGGCTGACCG GCGAGAGCCC GGCAGAATAC GCCCGCGCCT TCGCCCGGCT GGAGAACCAT
TATTTCCTCA ACAAGGGGTT CCTCGACGAG GATGGCCAGA TCCTGCGCGA CCTGCCCCGG
CTTGCGGATG TGCCGATTAC CATCGTGCAG GGGCGCTTCG ACATGATCTG CCCGCCCGCG
GGCGCCTGGC AGATCGCCGA GGCGCTGCCG CAGACCGACC TGCGGATGAT CCCGCTTGCC
GGGCACGCCT TGTCGGAATC CGGCATCAGC GCCGAGCTGG TGCGGGTGAT GGACCGGCTG
CGCTATGGAC GGCGCCCGTC CAACTGA
 
Protein sequence
MTRAASQKHA AEYLYPPLDP YDQRVLPVSG GHRIYVEQCG NPQGIPVVVL HGGPGGGCSP 
AMRRYFDPDT YRIVLFDQRG CGRSRPHASV EQNTTWDLVD DIEAIRTTLE IDAWDVFGGS
WGATLALIYG QTHPDRVTHL ILRGVFLMTD AELDWFYGGG AAQFWPDVWK RFVNLIPEEE
RGDLIAAYNK RLFSGNMMEE TRYARAWSAW ENALASIHSE GLTGESPAEY ARAFARLENH
YFLNKGFLDE DGQILRDLPR LADVPITIVQ GRFDMICPPA GAWQIAEALP QTDLRMIPLA
GHALSESGIS AELVRVMDRL RYGRRPSN