Gene Caci_3212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3212 
Symbol 
ID8334565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3540011 
End bp3543322 
Gene Length3312 bp 
Protein Length1103 aa 
Translation table11 
GC content68% 
IMG OID644956357 
Productpeptidase S45 penicillin amidase 
Protein accessionYP_003113960 
Protein GI256392396 
COG category[R] General function prediction only 
COG ID[COG2366] Protein related to penicillin acylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0203581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.139028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATCC GTGCCCGAAC GATGGCCGCG GCGGTGACCG CGCTGGTAGT CGGCGGGGCT 
CTCGCGGTGC CCGCGGTGAC GGCCTCGGCC GCCTCCGCGG GAGCGGCCAG GGCGAAGGCG
GCCACGATCT CCGCCTCCGG CGCCGCCGAC TACTGCCTGG GTCAGTGCAA TGACATCCTG
CCGCCCGGCG AGAACGGCAG CGCCACGACC GCGCAGATCC TGTGGTTCAA GGCCACCGGC
AACCGCCCGG CGAACACCGA CGACCAGCTC GGCAAGTACG CCGGCCTGGT CGACAGCTAC
ACCGGCCTGA CCAACGGGAA CCTGGGGAAC TTCTTCAACA GCTCCTCCTT CGGCGTCCCG
GCCAACCAGG TCGCCAGCAC CCTGAACCCG GGCGGCCGCA GCGACGTCAC CATCACCCGC
GACAAGCAGG ACATCCCGCA CATCTACGGG ACGACCCGCT CCGGCGGCGA GTACGGCGCG
GGCTACGCCG CGGGCCAGGA CCGGCTGTGG ATGATGGACG TCTTCCGGCA CGTCGGGCGC
GGTGAACTCT CCGGCTTCGC CGGCGGCGCG GCCGGCAACC GCCAGCTGGA GCAGCAGTTC
TACCTCAACG GCGCCTACAC CGAACCCGAT CTGCAGGCGC AGGTCGACCG GGTGAAGAAC
AGCGGGACGC GCGGCGCGCA GGCCTACCAG GACATGACGG ACTACCTGGC CGGGCTCAAC
CAGTACATCG CCGACGTGAA GGCCGGCGAC GACTTCCCCG GCGAGTACGA CCTCACCGGC
AACGCCAACA TCCTCACCGG CGACGGCATC CAGAACTTCC AGCCCACCGA CCTGGTCGCG
ATCGGCTCGG TCGTCGGCGC GCTGTTCGGT GCCGGCGGCG GCAACCAGGT CGCCTCCGCG
CTGGTCAAGG AAGCCGCCGA GGCCAAGTAC GGGACCGCGC AGGGCGACCA GATCTGGAAC
GCCTTCCGCG AGGAGAACGA CCCCGAGGCG AACCTGACGC TGCACAACGG GCAGTCGTTC
CCGTTCAACG GGAGCCCGGC GAACCCCTCC GGCGTGGCGA TGCCCGACGC GGGATCGGTG
ACGCCGCAAC AGGTCGTCTT CGACCCCACC GGTTCGGCGG CTTCAGGCGC GGCGGCTTCG
AATGCGACGG CCTCGAACGC CAAGACATCG ACGGTTCCGA CGACCGCGAC CGGCTCGGCC
GCGGCCAAGT CCGCGACGTC CGCCGCCAAC CTCAAGGCGA ATCCGGCGCT GGGCAAGGAA
ACCAACGCGA TGGCCAACGG CGTGTTGCCG GCCGGGCTGT TCAAGACCAA GCGCGGTATG
TCCAACGCGC TGGTGGTCTC GGGTCAATAC ACTGACACCG GGAACCCGGT CGCCGTCTTC
GGTCCGCAGA CCGGCTACTT CGCCCCGCAG CTGCTCATGC TGGAGGAGAT TCAGGCTCCT
GGTATCAGTG CCCGTGGAGC CGCCTTCGCC GGTCTGAACT TCTACGTCGA GCTCGGCCGC
GGCGCCGACT ACTCCTGGAG CGCCACCTCG GCGGGCCAGG ACATCATCGA CACCTACGCC
GTGACGCTGT GCAACACCGA CGGATCGCCG GCGACCAAGA ACTCCAACGC CTACCTGTAC
AACGGCGTCT GCACGCCGAT GCAGCAGATC GAGCGTGACG ACTCCTGGAG CCCGACCATC
GCCGACAGCA CACCGGCGGG GTCGTACAAA CTCATCGCGT TCCGGACCAA CTTCGGCATC
GTGCAATCGC GCGCCACGAT CGGCGGCAAA CCGGTCGCCT ACACCTCGTT GCGCTCCACG
TACCAACACG AGGTCGATAC CATCGTCGGC TTCCAGATGT TCAACGACCC GAGCGTGGTG
ACCGGTCCGG CCGGCTTCCA GCAGGCGGCG TCGAACATCA CCTACACGTT CAACTGGTTC
TACGTCGACT CCCAGCACAC GGCGTACTAC AACTCGGGTT TGAACCCGAC GCGCCCGGCC
AACGACGACC CGAACCTGCC GATCACCGCC GACGCCGCGC ACCAGTGGCT GAACTGGGAT
CCGAGCACGA ACACGGTGGC CAACACCGCG TTCTCGGCGC ATCCGAACTC CGTGGACCAG
GACTACTACG AGTCCTGGAA CAACAAGATC GCCCAGAACT ACACCGTCTC GGGCTTCGGC
GACGGCTCGA TCTACCGCAG CAATCTGCTC GACGAGCGCA TCAAGGGTCT GATCACCTCC
GGGACCAAGG TCACCCGCGC CTCGCTGACC AAGGCGATGG AGGACGCGGC CGTCACCGAC
CTGCGCGGTG AGGAGCTGCT GCCGAAACTG CTGCAGGTGA TCGGCACCCC GACCGATCCG
ACGCAGGCTG CCGCCGTGAA CGAGCTGAAA ACCTGGCTCG CCGACGGCAC CAAGCGCAAG
GAGAGCGCGG CCGGCAACAA GACCTACGCC GACTCCGACG CGATCCGCAT CATGGACGCC
TGGTGGCCGC TGCTGGTGCA GGCCGAGTTC CAGCCCGGTA TGGGCGCGAG CCTGTACTCG
GCGATGACCG GCGTGCTCTC GGTGGACGAC TCGCCGTACG GCGGCTCGGA AGCCGGCGTG
TCGCACAAGG GCTCCTCGTT CCAGTCCGGC TGGTACTCCT ACGTCGACAA GGATCTGCGC
TCCGTGCTCG GCCAGCCGGT TTCCGGCGGC CTGACGCAGA CCTTCTGCGG CGGCGGCAAC
CTGGCACAGT GCCGCACGGC GCTGCTGTCC GCGCTGTCCA CCGCGGCGGC GACCCCGGCG
ACCAGCGTCT ACCCGGCCGA CTCGGTCTGC TCCGCCGGCG ACCAGTGGTG CGCCGACTCC
ATCGAGCAGG ACCCGCTGGG CGGCATCACC GACGCGCAGA GCAACTGGCA GAACCGGCCG
ACGTTCCAGC AGGTCGTGCA GTACCCCTCG CACCGCGGTG TGAACGCCGC CGACCTGGCG
ACGCAGGGCG CTGCGACAGC CTCCAGCGCA CAGTCCGGCT ACCCGGCGCA GAACGCCGTC
ACCGGCAACG GCGCCAACCG CTGGGCCAGC AACTGGGACG ACAACGAGTG GCTGCAGGTG
GACCTCGGCT CGGTGAAGCA GGTCGGACGC GCGATCCTGA ACTGGGAGAC CGCTTACGGC
AAGGCGTACG ACATCCAGGT CTCCACCGAC GGCAAGACCT GGCGCACCGT CTACGCCACG
ACCACCGGCG ACGGCGGCCA GGACGTGGAC AGCTTCCCGG CGACCCAGGC GCGCTATGTG
CGGATGCAGG GAGTCCAACG CGCCACCGGT TGGGGTTACT CTCTGTACCA GTTCCAGGTC
TACGCCCAGT AG
 
Protein sequence
MHIRARTMAA AVTALVVGGA LAVPAVTASA ASAGAARAKA ATISASGAAD YCLGQCNDIL 
PPGENGSATT AQILWFKATG NRPANTDDQL GKYAGLVDSY TGLTNGNLGN FFNSSSFGVP
ANQVASTLNP GGRSDVTITR DKQDIPHIYG TTRSGGEYGA GYAAGQDRLW MMDVFRHVGR
GELSGFAGGA AGNRQLEQQF YLNGAYTEPD LQAQVDRVKN SGTRGAQAYQ DMTDYLAGLN
QYIADVKAGD DFPGEYDLTG NANILTGDGI QNFQPTDLVA IGSVVGALFG AGGGNQVASA
LVKEAAEAKY GTAQGDQIWN AFREENDPEA NLTLHNGQSF PFNGSPANPS GVAMPDAGSV
TPQQVVFDPT GSAASGAAAS NATASNAKTS TVPTTATGSA AAKSATSAAN LKANPALGKE
TNAMANGVLP AGLFKTKRGM SNALVVSGQY TDTGNPVAVF GPQTGYFAPQ LLMLEEIQAP
GISARGAAFA GLNFYVELGR GADYSWSATS AGQDIIDTYA VTLCNTDGSP ATKNSNAYLY
NGVCTPMQQI ERDDSWSPTI ADSTPAGSYK LIAFRTNFGI VQSRATIGGK PVAYTSLRST
YQHEVDTIVG FQMFNDPSVV TGPAGFQQAA SNITYTFNWF YVDSQHTAYY NSGLNPTRPA
NDDPNLPITA DAAHQWLNWD PSTNTVANTA FSAHPNSVDQ DYYESWNNKI AQNYTVSGFG
DGSIYRSNLL DERIKGLITS GTKVTRASLT KAMEDAAVTD LRGEELLPKL LQVIGTPTDP
TQAAAVNELK TWLADGTKRK ESAAGNKTYA DSDAIRIMDA WWPLLVQAEF QPGMGASLYS
AMTGVLSVDD SPYGGSEAGV SHKGSSFQSG WYSYVDKDLR SVLGQPVSGG LTQTFCGGGN
LAQCRTALLS ALSTAAATPA TSVYPADSVC SAGDQWCADS IEQDPLGGIT DAQSNWQNRP
TFQQVVQYPS HRGVNAADLA TQGAATASSA QSGYPAQNAV TGNGANRWAS NWDDNEWLQV
DLGSVKQVGR AILNWETAYG KAYDIQVSTD GKTWRTVYAT TTGDGGQDVD SFPATQARYV
RMQGVQRATG WGYSLYQFQV YAQ