Gene Caci_1352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1352 
Symbol 
ID8332690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1539956 
End bp1540894 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content69% 
IMG OID644954500 
Productproline iminopeptidase 
Protein accessionYP_003112116 
Protein GI256390552 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.800954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCCTT ACGCCACCGG ATTTCTCGAC GTCGGCGACG GGAATCGGAT TTACTACGAA 
GAACTCGGCA ACCCCGACGG CAAGCCGGCG GTGAACCTGC ACGGCGGTCC CGGCAGCGGC
TCGATGAAGC GCCCGACCAA GGCCTGGGAC CCCGAGAAGT GGCGCGTCAT CCGCTTCGAC
CAGCGCGGGT GCGGGCTCAG TACGCCCAGC GCCGCCGACC CCGCGACGGA CATGTCCGTC
AACACCACGC AGCACCTCAT CCGCGACATC GAGCTTCTGC GCGAGCATCT GGGTATTGAG
AAGTGGCTGG TGAAGGGCGG CTCGTGGGGT GCCGCCCTGG CCCTGCTCTA CGCGCAGGCG
CACCCCGAGC GCGTCACCGA GATGATCATC CCGGCGGTCA CCACGACCCG TCCGGAGGAG
ACCGACTGGC TGTACCACGG CGCCCGCCGC CTGTTCCCCG AAGCCTGGGA CCGCTTCCGC
AACCATGTGC CGGAGGACGA GCGCGACGGA AACCTGCTCC TGGCATACGG ACGCCTGGTA
GCGAACCCGG ACCGCGCGGT GCGCGAAGCC GCGGCGGCGG AGTGGATGAG GTGGGAGGAC
ACCTTGATCT CCCAGGAATC CAACGGCAAG CCCGGCTCCT ACAGCGCGGT GGTCGACGAC
GACCGGGTGG CCTTCGTCCG CATCTGCGCG CACTACTTCG GCAGCGACGC CTGGCTGGAG
CCGGACCAGG TTCTGCGCAA CGTCGACAAG CTGCGCGGCA TCCCGGCGGT CCTCGTCCAC
GGCCGCCACG ATCTGGGCAG CCCGGTCTAC ACCGCCTGGG AGCTGGCGCA GGCGTGGCCG
GACGCGAAGC TGGTGATCAT CGAGGACTCC GGGCACACCG GCAGCGAGGC GATGGGGCAG
GCGCTCAACG AGGCGGCGGA GGAGTTCTCG AAGCGATAG
 
Protein sequence
MGPYATGFLD VGDGNRIYYE ELGNPDGKPA VNLHGGPGSG SMKRPTKAWD PEKWRVIRFD 
QRGCGLSTPS AADPATDMSV NTTQHLIRDI ELLREHLGIE KWLVKGGSWG AALALLYAQA
HPERVTEMII PAVTTTRPEE TDWLYHGARR LFPEAWDRFR NHVPEDERDG NLLLAYGRLV
ANPDRAVREA AAAEWMRWED TLISQESNGK PGSYSAVVDD DRVAFVRICA HYFGSDAWLE
PDQVLRNVDK LRGIPAVLVH GRHDLGSPVY TAWELAQAWP DAKLVIIEDS GHTGSEAMGQ
ALNEAAEEFS KR