Gene Sare_2500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2500 
Symbol 
ID5703950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2856961 
End bp2858481 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content68% 
IMG OID641271964 
Productpeptidase M23B 
Protein accessionYP_001537334 
Protein GI159038081 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0374787 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAG CCGCACCTCG GGCAGGCCTC GCCGCGTTTC TCGCGGCCTG CCTGGTGGCC 
ACGGGTGTCC TCATGGGGGC GTTGCCCGCA CGGGCCGCAC CCGTCTTCAA GGCCCCGTTT
CCATGCGGTC AACAGTGGAC ATACAGCCAC CACAGCAGTG AGGTACGTCA GGCGCTGGAC
TTCGTCCGCG CCGACGGTGG CGTCACCGAC GGCACCCCCG TGCTCGCCTC GGCAGCCGGT
ACCGCCTACC GGCACTCGCA GCCCAGCGGC GCCGGCAACT ACGTCGTCAT CGATCACGGC
AGCGGGTGGC AGACCTACTA CTTCCACCTC TCCGCGTACT CGGTGGCAAC CGGCGCGCGC
GTCTCCCAGG GCCAGCAGAT CGGGCTGACC GGCAACACCG GCAACTCCTT CGGCGCACAC
ATCCACTACG AGCAGCTCTA CCACGGGGTG GGGCGGACCA TCGTGATCAA CGGTGTGTCG
CTCGCGCCGT ATCCCGGCTC GTACCACCAG CGCTACCTCA CCAGCGACAA CTGCGGCCGC
TTCGCCGAAC CCGTCGGAGG GCGGGCGTTC GGGTTCGGTA GTGGGCAGCA TTATGTGGCG
GTTGGTTCGG CGGGGACGTT GGCGAGTCTG TCGTGGTCGC CGGAGTCGGG GGTGGTGCAG
GCGGACTGGG GTGGTGGGGT GTTGACCGGT AGGGCGATGG GGTACGCGCA CGCGGGTCAG
CGGCATGTGT TCGCTCGGGG GCGGGATGAC ACGTTGCGGC ATTGGTGGTT CGGGCCGGGG
ATGAGCGTGC CGGGGTTGGA TGACTGGGAC ACGGTCGGGC GGGTGGTGTC GGATCCCACG
GGGTTTGCCT ACGGCAGTCA GCAGCATGTG TTCTACCGGA ACCCGGACGG GTGGTTGGAG
CATCGGTTCT ACGATCTCGA CACGGGTCGG GTTACGGGTG GGGTGTGGCC TGGTGGGAGG
TTCGTGGGGA ATCCGTTCGC GTTCGTGCAC CGCGATCAGC AGCACGTCTT CGGTCGTACG
GCCTCGGGTG GGTTGATCCA CTGGTACTGG TGGCCGGGGA TCAGTCCTGG TACGGACGAC
TGGGGTGTGC GCTCGGGTGT GGCCTCGGAT CCGACGGGTT TCTCCTACGG TGGCCAGCAC
CATGTGTTCT TCCGGGACAG TGACGGCGGT CTGGGGCACC GGTTCTTCGA CGACCTGTCG
GGTACCTTCG GCGGCGGTGT CTGGCCGGGT GCGGTGTTCG TGGGCAACCC GCACGCGTTC
GTGCACCGCG ATCAGCAGCA TGTCTTCGGT CGTACGGCCT CGGGTGACCT GGTGCACTGG
TACTGGTGGC CCGGGATCGA TCCGAGGGTG GACGACTGGG GTGCGCGTGG GGTGGTGACC
GGTGATCCCG CGGGCCTGTC CGCTGCGGGG CAGCATCACG TGTTCTACCG GCTCGGCGAC
GGCACCCTGG AACACCGCTT CGTCGACGAC ACCACCGGTC AGATCGTCAC CGACAACTGG
GGCGGATCAC TCGCACCATA A
 
Protein sequence
MTRAAPRAGL AAFLAACLVA TGVLMGALPA RAAPVFKAPF PCGQQWTYSH HSSEVRQALD 
FVRADGGVTD GTPVLASAAG TAYRHSQPSG AGNYVVIDHG SGWQTYYFHL SAYSVATGAR
VSQGQQIGLT GNTGNSFGAH IHYEQLYHGV GRTIVINGVS LAPYPGSYHQ RYLTSDNCGR
FAEPVGGRAF GFGSGQHYVA VGSAGTLASL SWSPESGVVQ ADWGGGVLTG RAMGYAHAGQ
RHVFARGRDD TLRHWWFGPG MSVPGLDDWD TVGRVVSDPT GFAYGSQQHV FYRNPDGWLE
HRFYDLDTGR VTGGVWPGGR FVGNPFAFVH RDQQHVFGRT ASGGLIHWYW WPGISPGTDD
WGVRSGVASD PTGFSYGGQH HVFFRDSDGG LGHRFFDDLS GTFGGGVWPG AVFVGNPHAF
VHRDQQHVFG RTASGDLVHW YWWPGIDPRV DDWGARGVVT GDPAGLSAAG QHHVFYRLGD
GTLEHRFVDD TTGQIVTDNW GGSLAP