Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4387 |
Symbol | |
ID | 5706095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4960664 |
End bp | 4961644 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273807 |
Product | proline iminopeptidase |
Protein accession | YP_001539157 |
Protein GI | 159039904 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000828974 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCGTT TGCATCCCGA GACCGAACCG TTCGCGCAGG GCATGCTCGA TGTCGGAGAC GGCCACCTCG TCTACTGGGA GAGCTGCGGC AACCCGCTCG GCAAGCCGGC ACTGGTGTTG CACGGCGGCC CAGGCTCGGG AGCCGGCGCC TACTGGCGGC GGTTCTTCGA CCCGGCGGTC TACCGGGTGG TCCTGTTCGA CCAGCGGGGA TGCGGACGCA GCACTCCGGA CGCGGGCGAC GTCCGCACCG ACCTGTCGAC CAACACCATG CCTCATCTGC TGGCCGATAT CGAAAGACTG CGCGTCCACC TGAAGATCGA CCGATGGTTG CTCCTCGGCG GGTCATGGGG CAGCGCGCTC GGTCTCGGCT ACGCCCAACG GCACCCCGAC CGGGTCACCG AGATCGTGCT GTTCAGCGTC GTCACCAGTA CTCCGGCCGA GCATCAGTGG CTCACCCGCG ACCTGGGACG GATCTTCCCC GAGCAGTGGG AACGCTTCCG CGACGCGGTG CCTGCGGCCG AACGCGACGG CAACCTGCCC GCCGCATACG CCGGGATGCT GGCCGACCCG GACGAGACCG TGCGGGACCG GGCCGCGCGC GCCTGGTGCG CCTGGGAGGA CGCACTCGTC TCCAACCTGC CCGGCAGTCG GCCCGACCCC CGGTACGAGC ACCCGGCGTT CCGGGTGACC TTCACACGCC TGGTCTCCCA CTATTGGGCG CACGACGGCT GGTTCGCCGA CGGCGAGCTG ATGGCCGGCG CACACCGGCT CACCGGAATT CCCGGCGTAC TCGTTCACGG CCGGCTCGAC CTCGGCAGCC CCGTCGACAT CCCCTGGCAG CTGTCCAAAC TCTGGCCTGA CGCACGGCTG AAGCTGATCG ACGACGCCGG CCACGGCACC GGGCACGGCA TCGGCGACGC GGTCATCGAC GCCCTGGACT GTATTGGGGC CACCTACCGC AGCTGCGAAG AGAATCGATA G
|
Protein sequence | MSRLHPETEP FAQGMLDVGD GHLVYWESCG NPLGKPALVL HGGPGSGAGA YWRRFFDPAV YRVVLFDQRG CGRSTPDAGD VRTDLSTNTM PHLLADIERL RVHLKIDRWL LLGGSWGSAL GLGYAQRHPD RVTEIVLFSV VTSTPAEHQW LTRDLGRIFP EQWERFRDAV PAAERDGNLP AAYAGMLADP DETVRDRAAR AWCAWEDALV SNLPGSRPDP RYEHPAFRVT FTRLVSHYWA HDGWFADGEL MAGAHRLTGI PGVLVHGRLD LGSPVDIPWQ LSKLWPDARL KLIDDAGHGT GHGIGDAVID ALDCIGATYR SCEENR
|
| |