Gene Sare_2063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2063 
Symbol 
ID5703274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2361397 
End bp2363736 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content69% 
IMG OID641271550 
Productpeptidase S45 penicillin amidase 
Protein accessionYP_001536921 
Protein GI159037668 
COG category[R] General function prediction only 
COG ID[COG2366] Protein related to penicillin acylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.530766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0449106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTC GCCGACTCGC CGCCGCAGCC ACCGTCGTCG GACTGGTCGC CGCCGGCCTC 
ACCGTCACCG GCGGTACCGC CGTGGCACAC GACCATGGCT ACTCGGCCTT GATCCAGCGC
GCGTCGTACG GCGTACCACA CATCACCGCC CGCGATTTCG CCAGCCTCGG ATTCGGTGCC
GGCTACGCCC AAGCCGAGGA CAACATCTGC CTGATCGCCG AGCGGATGGT GACCGTTCGC
GCCCAACGGT CCCGCTGGTT CGGCACGGAA GCCAGCAACG TCAGCAGCGA CATCTTCCAT
CAGAAGGCCA TCGACGATCA GGTCGCCGAG CGGCTGCTCG CCGGCCCCCG CGACGGCGTC
CGCGCACCAT CCGCACAGGT GCGTGACCAG ATCCGCGGCT TCGTTGCCGG GTACAACGCC
TACCTCCTCG ACGCGAAGAC GATTACCGAC CCGGCCTGCG CCGGGAAGGA ATGGGTACGC
CCGATCAGCG AGCTGGACAT GTGGCGGGCG TACTGGGCCG AGATGGTCCG GGCCAGCTCT
GGCGCCCTCG CCGACGGTAT CGTCGCCGCC ACCCCACCTA CCGCCAACGG AAGCAACGTG
CCGGCACGCG CCCCGCGGGC GGAAGCGGTC GTCACTGCCT TGGATGGTGC GCCGGCCGGG
TTGGGCAGCA ACGCGTATGG CCTGGGGCGG GACGGCACCG CCAGCGGCGC CGGCATATTG
TTGGCCAACC CGCACTTCCC GTGGGACGGT GCCGAGCGCT TCTACCGGAT GCATCTGAAG
GTGCCCGGCC GGTACGACGT CGAGGGCGCG GCACTGGTTG GTGACCCATT CATCCAGATC
GGCCACAACG GCCAGATTGC CTGGAGTCAC ACCGTCTCCA CCGCCCGCCG ATTTGTCTGG
CACCGGCTGG CCCTGGTGCC CGGCGATCCG ACCAGCTACC TCTACGACGG TCGAGCCCGA
CAGATGACCG CCCGTACGGT CACCGTCCAG ACTCCCGCCG GTCCGGTCAG CCGCACCCTC
TACGACACCC ACTTCGGCCC GGTCGTCGTG GTGCCCGGTC ACTTCGATTG GACCACCACC
ACCGCGTATG CGATCACCGA CGCCAACGCC ACCAACAACC GCGCGCTCGA TGGCTGGCTG
GCCATGGGAC GGGCCCGGTC CGTGAGCGAG TTGCGGGCGG TGTTGGACCG GCGGCAGTTC
CTGCCCTGGG TCAACGTCAT CGCCGCTGAC CGCGGCGGCC GGGCACTCTA CGCCGACCAC
TCCGTCGTGC CCCGGGTGAC CGACTCCCTG GCCGCCGCCT GCATCCCGGC CCCCTTCCAA
TCCCTGTACG CGAGCAGTGG TCAGGCCGTC CTCGACGGAT CCCGCTCATC GTGCGAGTTG
GGCCGTGATC CGGACGCAGT GGTACCAGGC ATTCTCGGCC CGGCCAACCT GCCCACCCTG
GTCCGCGGTG ACTATGTAAC CAACTCCAAC GACAGCTACT GGCTGGCCCA TCCGGAGCAG
CCGCTGGAGG GCTACCCGCG CGTCGTCGGC GATGAGCGGA CCCAGCGCAG CCTGCGGACT
CGGCTCGGCG TGCATCAGGT ACGGCAGCGC CTCGCCGGCG CCGATGGGCT TCCTGGGAAA
GGCTTCACCA CCAGCAACCT GTGGGAGGTG ATGCTCGGCA ACCGGGCCTA CGGTGGCGAA
CTGGTCCGCG ACGACCTGGT CCGGATCTGC GAGGCACAGC CGGCGGCGAC CACCTCCGAC
GGTGCCACCG TCGACCTGAC CGCCGCCTGT GCCGCCCTGC GCGGTTGGGA CCTTCGAACC
GAGCTGGACA GTCGGGGCGC ACACCTATTC ACCGAGTTCG CCCTGGCCGG CGGCCTGCGC
TTCGCCGACG CGTTCGACCC GACCGCACCG CTGACGACGC CGAGCCGGCT CGCTGTTGAC
GACCCGGGGA TACGCACCGC TCTCGCGGAC GCCGTCCAGA AGCTGGCCGA CATCCCGCTC
GATGCTCGCC TCGGAGACGT CCAGACCGAG CCGCGTGGTG CCGAGGACAT CCCGATCCAC
GGTGGTCGTC CCGAAGCCGG CGTCTTCAAC ATGATCATCG GTGGGTTCGA GTCCGGCGTC
GGCTATCCCA AGGTCCATCA CGGCACGTCG TTCCTGATGG CCGTCGAGCT GGGTGGCAAC
GGGCCGTCGG GCCGGCAGGT CCTGACCTAC TCGCAGTCGG CCAACCCGAA CTCGCCCTGG
TACGCCGATC AGACCCGGCT GTATTCAGGT AAGGGTTGGG ACACCATCAA ATTCACGCAG
CGACAGCTCC GAGCCGATCC GAACCTGACC ACGTATCGGG TGGGAAAGCA GCGCCGTTGA
 
Protein sequence
MSFRRLAAAA TVVGLVAAGL TVTGGTAVAH DHGYSALIQR ASYGVPHITA RDFASLGFGA 
GYAQAEDNIC LIAERMVTVR AQRSRWFGTE ASNVSSDIFH QKAIDDQVAE RLLAGPRDGV
RAPSAQVRDQ IRGFVAGYNA YLLDAKTITD PACAGKEWVR PISELDMWRA YWAEMVRASS
GALADGIVAA TPPTANGSNV PARAPRAEAV VTALDGAPAG LGSNAYGLGR DGTASGAGIL
LANPHFPWDG AERFYRMHLK VPGRYDVEGA ALVGDPFIQI GHNGQIAWSH TVSTARRFVW
HRLALVPGDP TSYLYDGRAR QMTARTVTVQ TPAGPVSRTL YDTHFGPVVV VPGHFDWTTT
TAYAITDANA TNNRALDGWL AMGRARSVSE LRAVLDRRQF LPWVNVIAAD RGGRALYADH
SVVPRVTDSL AAACIPAPFQ SLYASSGQAV LDGSRSSCEL GRDPDAVVPG ILGPANLPTL
VRGDYVTNSN DSYWLAHPEQ PLEGYPRVVG DERTQRSLRT RLGVHQVRQR LAGADGLPGK
GFTTSNLWEV MLGNRAYGGE LVRDDLVRIC EAQPAATTSD GATVDLTAAC AALRGWDLRT
ELDSRGAHLF TEFALAGGLR FADAFDPTAP LTTPSRLAVD DPGIRTALAD AVQKLADIPL
DARLGDVQTE PRGAEDIPIH GGRPEAGVFN MIIGGFESGV GYPKVHHGTS FLMAVELGGN
GPSGRQVLTY SQSANPNSPW YADQTRLYSG KGWDTIKFTQ RQLRADPNLT TYRVGKQRR