Gene Ksed_01230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_01230 
Symbol 
ID8371636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp128666 
End bp129643 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content68% 
IMG OID644990433 
Productproline iminopeptidase 
Protein accessionYP_003147979 
Protein GI256824019 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones73 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCT CCGAAGAACC TGTTCATGCC GGGATGCTCG ATGTAGGCCG TGGTCAGCTC 
ATCTACTGGG AGGAGTGGGG AACAGCTCAG GGGGTCCCGG CTGTGTACGT GCACGGCGGC
CCCGGTGGGA CTCTCGGCAC CAGCGCCTAC CGAAGGCGGT TCGATCTCTC ACGGACTCGG
GTCATCGGGT TCGAGCAGCG CGGGTGCGGC CGCTCGACAC CACACGCCAG TGACCCCTCG
ACCTCGCTGC AGGACAACGA CACAGCTCAT CTGGTCGCGG ACATGGAGGT CCTGCGCGAG
CATCTGGGGA TCGAGCGGTG GATCGTCAAC GGCGTCTCGT GGGGATCGAC CCTCGCCCTT
GCCTATGCCG TCACGCATCC GGACAGGGTG CTGGGCGTCG TCCTGTTCGC CGTCACGACG
ACCAGCCGCT CCGAAGTCGA CTGGATCACC GAGGGTGTCG GCACCGTCTT CCCCGCGGCA
TGGGACCGTT TCGCTGCCCA CGCCGAACAG GCAGGGATCG GCTACCAACG TGGACAGGGC
CGCATCATCG ACGCCTACGC CCACCTCATG GAGTCAGACG ACCCCCGCGT GCGCAACGCA
GCTTCCCGGG AGTGGGCGCT GTGGGAGGAC ACCCACATCT CCATCGGTGC CGGTGGTTTC
CGCCGCGACC CCCGGTGGGA CGACGAGGCG TTCCGCATCG CGTTCTTCCG GCTGACCGCC
CACTACTGGT CCCACGACGG CTTCGTCCAG CCCCCGATCC TCGACCAGGG CGAGCGGCTG
GCGGGGATCC CGGCAACGCT CATCCACGGC CGCCGCGACA TCTCCAGCCC CGCCATCACG
CCGTGGCGCC TCCATCGAGC GTGGCCCGGA TCGCAGTTGA TCCTCGACGA AGGGGACGGC
CACGGAGGCG CCACGATGGT CGAGCACTGG CGGGCAGCGA ACGAGGCACT CGTCGCCCAG
CACTCACGGC CGCTCTAG
 
Protein sequence
MTASEEPVHA GMLDVGRGQL IYWEEWGTAQ GVPAVYVHGG PGGTLGTSAY RRRFDLSRTR 
VIGFEQRGCG RSTPHASDPS TSLQDNDTAH LVADMEVLRE HLGIERWIVN GVSWGSTLAL
AYAVTHPDRV LGVVLFAVTT TSRSEVDWIT EGVGTVFPAA WDRFAAHAEQ AGIGYQRGQG
RIIDAYAHLM ESDDPRVRNA ASREWALWED THISIGAGGF RRDPRWDDEA FRIAFFRLTA
HYWSHDGFVQ PPILDQGERL AGIPATLIHG RRDISSPAIT PWRLHRAWPG SQLILDEGDG
HGGATMVEHW RAANEALVAQ HSRPL