Gene Sare_3948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3948 
Symbol 
ID5708219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4490888 
End bp4492177 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content70% 
IMG OID641273373 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001538729 
Protein GI159039476 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.425388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.429143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACA GCGGTTTTCC CTGGCCCATC GAGACGACCC GACTGGACAA CGGCCTGCGC 
GTGGTGGTGA GCGAGGACCG CACCGCCCCG GCCGTGGCGG TGAACCTCTG GTACGACGTC
GGCTCCCGGC ACGAACCGGA GGGTCAGACC GGCTTCGCCC ACCTCTTCGA GCACCTGATG
TTCGAAGGCT CGGTCAACGT GGCGAAGACC GAGCACATGA AGCTGGTGCA GGGATGCGGT
GGGTCACTCA ACGCCACCAC CAACCCAGAC CGCACCAACT ACTTCGAGAC AGTCCCCGCC
GAGCACCTCG AACTGGCGCT CTGGCTCGAG GCCGACCGCA TGGGCGGGCT GGTGCCGGCG
TTGACTCAGG AGACGCTGGA CAACCAGCGG GACGTGGTCA AGAACGAGCG GCGGCAGCGC
TACGAGAACG TCCCGTACGG CGACGCGTGG CTGCGACTGC TGCCACTGCT CTACCCGCCC
CGCCACCCGT ACCACCACGC GACGATCGGC TCGATGGCCG ACCTGAACGC CGCTGACCTC
GCCACCTTCC AGGCCTTCCA CACCGCGTAC TACGCGCCGA ACAACGCGGT CCTGACGGTG
GTCGGCGACA CCTCCGCCGT CGAGGTGTTC GCCCTGGCAG AAAAGTACTT CGGCGCGATC
CCGCCCCGAT CGGACATCCC AGCCGCGCCG GACGGCCGGC ACGTCTCGAA CACCGATGCG
GCGACGACGG AGACGGTCGT CACCGACGTG CCCGCGCCCC GGGTGTACGT CGCGCACCGC
ACCCACCCGT TCGGCACCCC CGGCTACGAC GTGACCACCG TGCTCGCCAC CGTCCTCGGC
AGCGGGCGGG GCAGCCGGCT CTACCAACGG CTCGCCGACG GTGAGCGGAT CGCACAGCCG
GACCTGGTCG GCGCGTACGG AGTGGACCTG ACGTACGCCC CGGCGCCGTT GATCGCCACC
GCCACCGCCC GCCCCGGAGT GCCCGCCGAA CAGTTGGCCG CCGGGTTGGG CGAGGTCATG
GACGAACTGG CCACGGTGCC GGTCACCGCC GCCGAGTTGG ACCGGGCCAA GGCACTGCTC
AGCACCGCCT GGTGGCGGCA GATGTCCACG GTGGAGGGCC GTGCCGACAC CCTCGGCCGG
TATGCGACAC AGTTCGGCGA CCCGCGGCGG GCGGCCGAAC GGCTGCCGGC GCGGCTGGCG
GTGACCGCCG AGCAGATCGC GGCGGTGGCC GCCGAGGTGC TCGTCACCAC CGACCGGGTG
ATCCTGACCT ACCTGCCCGA GGAGAAATGA
 
Protein sequence
MPDSGFPWPI ETTRLDNGLR VVVSEDRTAP AVAVNLWYDV GSRHEPEGQT GFAHLFEHLM 
FEGSVNVAKT EHMKLVQGCG GSLNATTNPD RTNYFETVPA EHLELALWLE ADRMGGLVPA
LTQETLDNQR DVVKNERRQR YENVPYGDAW LRLLPLLYPP RHPYHHATIG SMADLNAADL
ATFQAFHTAY YAPNNAVLTV VGDTSAVEVF ALAEKYFGAI PPRSDIPAAP DGRHVSNTDA
ATTETVVTDV PAPRVYVAHR THPFGTPGYD VTTVLATVLG SGRGSRLYQR LADGERIAQP
DLVGAYGVDL TYAPAPLIAT ATARPGVPAE QLAAGLGEVM DELATVPVTA AELDRAKALL
STAWWRQMST VEGRADTLGR YATQFGDPRR AAERLPARLA VTAEQIAAVA AEVLVTTDRV
ILTYLPEEK