Gene Sare_4884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4884 
Symbol 
ID5707536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5535979 
End bp5538699 
Gene Length2721 bp 
Protein Length906 aa 
Translation table11 
GC content69% 
IMG OID641274279 
Productpeptidase S45 penicillin amidase 
Protein accessionYP_001539624 
Protein GI159040371 
COG category[R] General function prediction only 
COG ID[COG2366] Protein related to penicillin acylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCTG CACGCATTCT TTCGTCCCGA GTGGGCAGGA TCGCGCTCTG GGCCGTGGCA 
GTCCTCACCA CGCTGACCCT CGTGCTCACC CTCGCGGCCG TGTGGACCGT ACGACGAGCG
TTCCCTCAGC ACGACGGCGC ACTCCGGCTG CCGGGCCTCA CCGCGCCGGT CACCGTGCAT
CGCGACGACC ACGGAATCCC GCAGGTGTAC GCGACGACCG CCGAGGACCT GTTCCGTGCG
CAGGGCTACC TGCACGCGCA GGACCGGTTC TGGGAGATGG ACTTTCGCCG CCATGTGACC
GGAGGTCGGC TCGCCGAACT GTTCGGTGAG AGCCAACTGG AGACCGACAT CTACCTGCGG
ACGATGGGCT GGCGACGGGT CGCCGAGCAG GAGTGGGACA TCCTCGCCGC GGACACGAAG
CGCTACCTGC AGGTGTACGC CGACGGTGTG AACGCCTGGC TCGACGAGCA CGACGGGGGT
CGGGCGAGTC TGGAATACGC GGTACTCGGC CTACAGAACT CGGACTACGA GATCGAGGCG
TGGCACCCGG TGGACAGTCT CGCCTGGCTC AAGGCGATGG CGTGGGACCT GCGGGGCAAC
ATGAGAGACG AGATCACCCG AGCGGCGCTA CTCGCCGAGG GCCTGACCCG CCAGCAGGTC
GAGGACCTGT ACCCGGCGTA CCCCTTCGAT CGGAACACAC CGATTGTCAC CGAGGGTGAC
ATCGTGGCCG GTGTCTTCGA TCAGGAAGCC CCGGTGTCCC CAGGCCGTGA GGGCAACGCG
GATCCCGGCG CGACCGACAC CGCCGGAGCG GGCGGTGGTG CGGTCACCGC GCAGCCGATC
GACGCGGTCG AGTCAACCGG CGACGCGGCT GCCGCGCGGA CGGTGACGGC GCTGGCCGGC
GGTTTGTCCC GGCTGCCGAC CGTGCTGGGC GCCGGCCGAG GCATCGGCTC CAACTCGTGG
GTCATCGACG GGGACCTGAC CGACACCGGC AAGCCGATCC TCGCCAACGA CCCGCATCTT
GGTCCCACCA TGCCTGGTAT CTGGTACCAG AACGGTCTGC ACTGCGAGTG CGAGTTCAAC
GTCACCGGCT TCAGCTTCTC CGGCCTGCCC GGGGTGGTCA TCGGCCACAA CTCCCGCATC
GCGTGGGGCC TCACCAACCT GAACCCGGAC GTGACCGATC TGTATCTGGA GCGGGTGAAC
GGCGACCGGG TTCAGGTCGA CGGAGAATGG CGGCCGCTGG AGACCCGGAC GGAAACCATC
AGAGTCGCGG GCGGTGAGGA CGTGACCATC ACCGTGCGGG CCTCCGGGCA TGGACCGCTG
GTGTCCGACG CGTCGGCCGA ACTGCGCGAC ATCGGCCTCG CGCCGCCGGT CGATCCGGCG
GGTTCGCCGG CGCCGGTCGC CGCCACGCCA CAGCTCGCCC CGGAGCCGTC GACCAACTCC
GAGGACGACC GGCGAGACGG CTACGCCATC GCGCTGAGCT GGACCGCGTT GCGTCCGGGT
CGGACCGCCG ACGCCATCTT CGCGCTCAAC ACCGCGCCGG GCTGGACCGA GTTCCGTGCC
GCGGCGGCGA TGTTCGAGGT GCCCGCGCAG AACCTCATCT ACGCCGACAC CGACGGCACC
ATCGGCTACC AGGCCCCCGG CCGGGTTCCG GTGCGCGGCA AGGGTGACGG GCGGTGGATG
GCGCCCGGTT GGGACTCGGC GTACGACTGG CAGGGTTTCA TCCCCTTCGA GGAGCTGCCC
AGTGTCCTCG ACCCGCCGGC CGGCTACCTG GTGACCGCCA ACCAGGCCGT CATCGGCCCC
TCGTACCCGC ACATGCTCAC CACGGACTGG GCCTACGGCT ACCGCAGCCA GCGTATTCAC
GAGTTGATCG AGTCAGCCCG GGACGCTGGA AAGATCACCG TCGCGGATGT ACAAACTATG
CAGTTCGACA ACCGCAACGG CTTCGCGCCG ACGCTAGTCC CGGCGGTCGA GACGGCTCTC
GCCGCCGGGG ATCCGTCCAG CCTGGCCCGG TCCGCCGCCG ACCTGTGGAG GGATTGGGAC
TACCAGCAGC CCGCCGAGGG CGAGCCGGAC ACCGACGACG GGCGCAGCTC CGCCGCGGCG
GCGTACTACA ACGCCGTCTG GCGGCACCTG CTCCTGGAAA CCTTCGACGA GTTGCCGGAG
GAGCACCGGC TCGACGGCAG CGACCACTCG TACGAGGTGG TGCGGGGTCT GCTTGGCCTA
CCCGGATCGC CGTGGTGGGA CCGGACGGAG ACCGAGGTCG TCGAGGGCCG CGACGACATT
CTGTGGGCGG CGGCCGAAGC GGCAGCGGGT GAACTGGCCC GCGATCAGGG CGACCAGCCG
GCCGAATGGC GCTGGGGCCG CATGCACACG TTGACCGTAC GGAACCAGTC CTTCGGCACC
TCGGGCATGG GGCTCGTCGA GTGGCTCTTC AACGCCGACC CCGTCGCCGT GTCGGGTGGC
GGTGCGATCG TGAACGCCAC GGGATGGAAC GCCGCCGCCG GGTACGAGGT CAACGCTGTC
CCGTCCATGC GGATGATCGT CGATCTGGCC GACCTGGACG CCTCCCGTTG GATCCAGTTG
ACCGGTAACT CCGGGCATGC GTTCCACCGC AACTACGACG ACCAGCTCGA GTTGTGGCGC
ACCGGCGAAA CCCTGCCGAT GCGCTGGGAG CGAGCCACGA TCGAGGCCGG GGCGGCACAG
ACGCTCACCC TGAAGCCGTA A
 
Protein sequence
MSPARILSSR VGRIALWAVA VLTTLTLVLT LAAVWTVRRA FPQHDGALRL PGLTAPVTVH 
RDDHGIPQVY ATTAEDLFRA QGYLHAQDRF WEMDFRRHVT GGRLAELFGE SQLETDIYLR
TMGWRRVAEQ EWDILAADTK RYLQVYADGV NAWLDEHDGG RASLEYAVLG LQNSDYEIEA
WHPVDSLAWL KAMAWDLRGN MRDEITRAAL LAEGLTRQQV EDLYPAYPFD RNTPIVTEGD
IVAGVFDQEA PVSPGREGNA DPGATDTAGA GGGAVTAQPI DAVESTGDAA AARTVTALAG
GLSRLPTVLG AGRGIGSNSW VIDGDLTDTG KPILANDPHL GPTMPGIWYQ NGLHCECEFN
VTGFSFSGLP GVVIGHNSRI AWGLTNLNPD VTDLYLERVN GDRVQVDGEW RPLETRTETI
RVAGGEDVTI TVRASGHGPL VSDASAELRD IGLAPPVDPA GSPAPVAATP QLAPEPSTNS
EDDRRDGYAI ALSWTALRPG RTADAIFALN TAPGWTEFRA AAAMFEVPAQ NLIYADTDGT
IGYQAPGRVP VRGKGDGRWM APGWDSAYDW QGFIPFEELP SVLDPPAGYL VTANQAVIGP
SYPHMLTTDW AYGYRSQRIH ELIESARDAG KITVADVQTM QFDNRNGFAP TLVPAVETAL
AAGDPSSLAR SAADLWRDWD YQQPAEGEPD TDDGRSSAAA AYYNAVWRHL LLETFDELPE
EHRLDGSDHS YEVVRGLLGL PGSPWWDRTE TEVVEGRDDI LWAAAEAAAG ELARDQGDQP
AEWRWGRMHT LTVRNQSFGT SGMGLVEWLF NADPVAVSGG GAIVNATGWN AAAGYEVNAV
PSMRMIVDLA DLDASRWIQL TGNSGHAFHR NYDDQLELWR TGETLPMRWE RATIEAGAAQ
TLTLKP