Gene Sare_4738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4738 
Symbol 
ID5704563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5361836 
End bp5363002 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content69% 
IMG OID641274136 
Productpeptidase M23B 
Protein accessionYP_001539482 
Protein GI159040229 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000208513 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGGAG AAGCTGAGCC GAAGCGCCGT CGACTGCGTG CGGGTGCGAT CGCTGGCGTG 
CTCACCGCGG CGTTGGCCCT GCTCTGCTGC GCCGGCGGTG CGGGAGCCTT CTTCCTCACC
GAACTCGGAG GCGGCGACGA CCGGCTACGC CTTGCCAACC AGAACTGCAC CGGCGACTTC
CGCGTCGAGA TCACCGGGGA GATGCCCCGG ATGTCGGAGT ACGGTGAGGT CCAGCTCCGC
AACGCCGCGC GGATCATCAA GGTCGGGCAG GAGCTGCAGA TACCACCCCG CGGCTGGGTA
ATCGCCGTGG CGACCGCCAT GCAGGAGTCC CGTCTACGCA ACCTCGCCAA CCCCACCGTG
GCCGGGTCGG AGCAGCTTCC GAACGAAGGC GTCGGCTCGG ACCACGACTC GGTGGGACTG
TTCCAGCAGC GGGCGAGCTG GGGCACGGTC GAGCAGCGGA TGACTCCGGA GTACGCGGCC
CGCAGGTTCT ACGAGAAGTT GCGTGGGGTG CTCAACTGGG AGCAGCTACC GCTGACCCGA
GCCGCGCAGG CCGTACAGAT CAGCGCCTTT CCGGATGCGT ACGCCAAGCA CGAGGCGCTG
GCGTCAACGA TCGTCAACGC GCTGGCCGGC GGCGCCGCCC GCACCGTGCC CCTCACCGAC
GGGCACGTCT GCGACGCGGC GGAGGATGGC CTGATCGCCG CCTCCGGCTG GACCGCCCCG
ATCCCCGGTG ACGTCGGCTC CGGATTCCGC ACCGAGAAGC GGCCGGCACA CCACGGGGTG
GACATCGCCG CACGGAAGGG TATCGATATT CGCGCCGCGT CCAGCGGTCG AGTCCTGGTC
GCCCGTTGCG ACCCCGATCG GGCCGGGCAG CTGAGCTGCG ATGTGGACGG TTGGCCGGGC
AAGGGTGGCT GCGGATGGTT CGTCGACATT CTCCACGCTG GGAAGATCAT CACCCGCTAT
TGCCACATGG CGCACAAACC TCAGGTCAGC GTGGGCCAGA CGGTGCGGGC CGGTGAGATC
ATCGGTGTGA TCGGCAGCAG CGGCAATTCG TCCGGACCGC ACCTGCACTT CGAGGTGCAC
ACCGACGGTG ACCGGAGCAG CGACGGCGCG ATCGACCCGG TACGGTTCAT GCGGGAGCAG
GGTGCACCGC TGCGAAGCGT GGAGTGA
 
Protein sequence
MSGEAEPKRR RLRAGAIAGV LTAALALLCC AGGAGAFFLT ELGGGDDRLR LANQNCTGDF 
RVEITGEMPR MSEYGEVQLR NAARIIKVGQ ELQIPPRGWV IAVATAMQES RLRNLANPTV
AGSEQLPNEG VGSDHDSVGL FQQRASWGTV EQRMTPEYAA RRFYEKLRGV LNWEQLPLTR
AAQAVQISAF PDAYAKHEAL ASTIVNALAG GAARTVPLTD GHVCDAAEDG LIAASGWTAP
IPGDVGSGFR TEKRPAHHGV DIAARKGIDI RAASSGRVLV ARCDPDRAGQ LSCDVDGWPG
KGGCGWFVDI LHAGKIITRY CHMAHKPQVS VGQTVRAGEI IGVIGSSGNS SGPHLHFEVH
TDGDRSSDGA IDPVRFMREQ GAPLRSVE