Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ksed_01230 |
Symbol | |
ID | 8371636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Kytococcus sedentarius DSM 20547 |
Kingdom | Bacteria |
Replicon accession | NC_013169 |
Strand | - |
Start bp | 128666 |
End bp | 129643 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644990433 |
Product | proline iminopeptidase |
Protein accession | YP_003147979 |
Protein GI | 256824019 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 73 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCCT CCGAAGAACC TGTTCATGCC GGGATGCTCG ATGTAGGCCG TGGTCAGCTC ATCTACTGGG AGGAGTGGGG AACAGCTCAG GGGGTCCCGG CTGTGTACGT GCACGGCGGC CCCGGTGGGA CTCTCGGCAC CAGCGCCTAC CGAAGGCGGT TCGATCTCTC ACGGACTCGG GTCATCGGGT TCGAGCAGCG CGGGTGCGGC CGCTCGACAC CACACGCCAG TGACCCCTCG ACCTCGCTGC AGGACAACGA CACAGCTCAT CTGGTCGCGG ACATGGAGGT CCTGCGCGAG CATCTGGGGA TCGAGCGGTG GATCGTCAAC GGCGTCTCGT GGGGATCGAC CCTCGCCCTT GCCTATGCCG TCACGCATCC GGACAGGGTG CTGGGCGTCG TCCTGTTCGC CGTCACGACG ACCAGCCGCT CCGAAGTCGA CTGGATCACC GAGGGTGTCG GCACCGTCTT CCCCGCGGCA TGGGACCGTT TCGCTGCCCA CGCCGAACAG GCAGGGATCG GCTACCAACG TGGACAGGGC CGCATCATCG ACGCCTACGC CCACCTCATG GAGTCAGACG ACCCCCGCGT GCGCAACGCA GCTTCCCGGG AGTGGGCGCT GTGGGAGGAC ACCCACATCT CCATCGGTGC CGGTGGTTTC CGCCGCGACC CCCGGTGGGA CGACGAGGCG TTCCGCATCG CGTTCTTCCG GCTGACCGCC CACTACTGGT CCCACGACGG CTTCGTCCAG CCCCCGATCC TCGACCAGGG CGAGCGGCTG GCGGGGATCC CGGCAACGCT CATCCACGGC CGCCGCGACA TCTCCAGCCC CGCCATCACG CCGTGGCGCC TCCATCGAGC GTGGCCCGGA TCGCAGTTGA TCCTCGACGA AGGGGACGGC CACGGAGGCG CCACGATGGT CGAGCACTGG CGGGCAGCGA ACGAGGCACT CGTCGCCCAG CACTCACGGC CGCTCTAG
|
Protein sequence | MTASEEPVHA GMLDVGRGQL IYWEEWGTAQ GVPAVYVHGG PGGTLGTSAY RRRFDLSRTR VIGFEQRGCG RSTPHASDPS TSLQDNDTAH LVADMEVLRE HLGIERWIVN GVSWGSTLAL AYAVTHPDRV LGVVLFAVTT TSRSEVDWIT EGVGTVFPAA WDRFAAHAEQ AGIGYQRGQG RIIDAYAHLM ESDDPRVRNA ASREWALWED THISIGAGGF RRDPRWDDEA FRIAFFRLTA HYWSHDGFVQ PPILDQGERL AGIPATLIHG RRDISSPAIT PWRLHRAWPG SQLILDEGDG HGGATMVEHW RAANEALVAQ HSRPL
|
| |