Gene Sare_4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4387 
Symbol 
ID5706095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4960664 
End bp4961644 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content69% 
IMG OID641273807 
Productproline iminopeptidase 
Protein accessionYP_001539157 
Protein GI159039904 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000828974 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCGTT TGCATCCCGA GACCGAACCG TTCGCGCAGG GCATGCTCGA TGTCGGAGAC 
GGCCACCTCG TCTACTGGGA GAGCTGCGGC AACCCGCTCG GCAAGCCGGC ACTGGTGTTG
CACGGCGGCC CAGGCTCGGG AGCCGGCGCC TACTGGCGGC GGTTCTTCGA CCCGGCGGTC
TACCGGGTGG TCCTGTTCGA CCAGCGGGGA TGCGGACGCA GCACTCCGGA CGCGGGCGAC
GTCCGCACCG ACCTGTCGAC CAACACCATG CCTCATCTGC TGGCCGATAT CGAAAGACTG
CGCGTCCACC TGAAGATCGA CCGATGGTTG CTCCTCGGCG GGTCATGGGG CAGCGCGCTC
GGTCTCGGCT ACGCCCAACG GCACCCCGAC CGGGTCACCG AGATCGTGCT GTTCAGCGTC
GTCACCAGTA CTCCGGCCGA GCATCAGTGG CTCACCCGCG ACCTGGGACG GATCTTCCCC
GAGCAGTGGG AACGCTTCCG CGACGCGGTG CCTGCGGCCG AACGCGACGG CAACCTGCCC
GCCGCATACG CCGGGATGCT GGCCGACCCG GACGAGACCG TGCGGGACCG GGCCGCGCGC
GCCTGGTGCG CCTGGGAGGA CGCACTCGTC TCCAACCTGC CCGGCAGTCG GCCCGACCCC
CGGTACGAGC ACCCGGCGTT CCGGGTGACC TTCACACGCC TGGTCTCCCA CTATTGGGCG
CACGACGGCT GGTTCGCCGA CGGCGAGCTG ATGGCCGGCG CACACCGGCT CACCGGAATT
CCCGGCGTAC TCGTTCACGG CCGGCTCGAC CTCGGCAGCC CCGTCGACAT CCCCTGGCAG
CTGTCCAAAC TCTGGCCTGA CGCACGGCTG AAGCTGATCG ACGACGCCGG CCACGGCACC
GGGCACGGCA TCGGCGACGC GGTCATCGAC GCCCTGGACT GTATTGGGGC CACCTACCGC
AGCTGCGAAG AGAATCGATA G
 
Protein sequence
MSRLHPETEP FAQGMLDVGD GHLVYWESCG NPLGKPALVL HGGPGSGAGA YWRRFFDPAV 
YRVVLFDQRG CGRSTPDAGD VRTDLSTNTM PHLLADIERL RVHLKIDRWL LLGGSWGSAL
GLGYAQRHPD RVTEIVLFSV VTSTPAEHQW LTRDLGRIFP EQWERFRDAV PAAERDGNLP
AAYAGMLADP DETVRDRAAR AWCAWEDALV SNLPGSRPDP RYEHPAFRVT FTRLVSHYWA
HDGWFADGEL MAGAHRLTGI PGVLVHGRLD LGSPVDIPWQ LSKLWPDARL KLIDDAGHGT
GHGIGDAVID ALDCIGATYR SCEENR