Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_4965 |
Symbol | |
ID | 8329163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 5925378 |
End bp | 5926304 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 644945405 |
Product | proline iminopeptidase |
Protein accession | YP_003102637 |
Protein GI | 256378977 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.328745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCACCCGG TGACCGAACC ACACACGAGC GGGCGGCTGG CGGTCGGCGA CGGCCACGAG CTGCACTGGC AGATCCACGG GAACCCGACC GGGAAACCGG TGGTGGTCCT GCACGGCGGG CCGGGGTCGG GCAGCCGCGC CAGGGCCACC AGGCTGCTCG ACCCGGCGGT GTACCGGGTG GTGCTCTTCG ACCAGCGCGG CTGCGGGCGC TCGACGCCGC ACGCGGGCGA GCCGGAGGTC GACCTGTCCA CCAACACCAC GGACCACCTG GTGGCGGACC TGGAACTGCT GCGCGCGTCC CTGGACGTCG AGCGCTGGCT GGTGCTCGGG GGGTCCTGGG GCGCGGTGCT CGGGCTGGTC TACGCGCAGC GGTACCCGGA GCGGGTGACC GGGCTGGTGC TCGCGGGCGT GGCGACCGGG CGGCGGGCGG AGACGGACCT GCTGACGCGT GGGCTCGGGG AGGTGTTCCC GCGGGCGTGG CAGGAGTTCA GCGACTTCGT CGGCGCGCCC GACGGCGACC TCTCGGCGGC CTACCTGGAG CGGCTGGTCG ACCCGGACCC ACTGGTGCAC CTCGCGGCGG CGGACGCCTG GTGCGCGTGG GAGGAGGCGA TGCTGCCGCA GACGCCGGGG TCGTTGGAGG ACGTGGTGGG GCGGGACCGG CTGGCGTTCG CGCGGCTGGT GGCGCACTAC TGGGCGCACG GGAGCTGGTT GCGGGAGAAC GAGGTCCTGG ACGGCTGCGA CCGGCTGGCC GGGGTGCCGG GGATCGTGGT GCAGGGCGAG CTTGACCTGA TCAACCTGGT CGGGACGCCG TGGCTGCTGG ACCGGGCCTG GACGGCCGGC GAGCTGGTGG TGGTGCGGGA GACCGCGCAC GGCGGGTCGG CGGCGCTGAG CGAGGCGTGG AAGGCGGCGG CGGACAGCTT CCGGTGA
|
Protein sequence | MHPVTEPHTS GRLAVGDGHE LHWQIHGNPT GKPVVVLHGG PGSGSRARAT RLLDPAVYRV VLFDQRGCGR STPHAGEPEV DLSTNTTDHL VADLELLRAS LDVERWLVLG GSWGAVLGLV YAQRYPERVT GLVLAGVATG RRAETDLLTR GLGEVFPRAW QEFSDFVGAP DGDLSAAYLE RLVDPDPLVH LAAADAWCAW EEAMLPQTPG SLEDVVGRDR LAFARLVAHY WAHGSWLREN EVLDGCDRLA GVPGIVVQGE LDLINLVGTP WLLDRAWTAG ELVVVRETAH GGSAALSEAW KAAADSFR
|
| |