Gene Sare_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3037 
Symbol 
ID5707239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3446002 
End bp3447180 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content67% 
IMG OID641272482 
Productpeptidase M7 snapalysin 
Protein accessionYP_001537850 
Protein GI159038597 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5640] Secreted trypsin-like serine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.872515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAGAA GGCAACTGTT GCGCGCGTTC GGCGCCGTGC TGGCGGCGGT GCTGGCAGCG 
GCCGGGGTGC AGATCGCCAC CGGTGCGCCG GCCACAGCGG CTCGCACGGT CTACTACGAC
GCGAGCCGGG CCGGCGAGTT CCGTACGAAC TTCGACCAAG CGGCCCAGAT TTGGAACAGT
TCGGTCAGCA ATGTACGGCT GGCTCCACGC AGTCCAGGCA ACGTCACCAT CTACGTCGAC
GGCGGCTGGC CGCGGGCGCA GGTCACCGGG CTCGGCTCTG GCCGGATCTG GATGGGATGG
ACCGCGGTCA ACCAGGGATA CGACCGTACC CGCATCGCCA GCCATGAGTT CGGCCACATC
CTCGGCCTAC CCGATCGGCG TACCGGGCTC TGCTCCGACC TGATGTCGGG CAGCAGTGCG
GCGGTCTCGT GCGACAACGC GTACCCCAGC AGCGCCGAGG CGTCCCGGGT CGACTCGCTG
TTCGCCGGCA GTCGCACGAC AAGTGTCACC GGTACGTTCA CCTGGGCTGA TGCCGATATC
ACGCCGTTCG TGGTCGGCGG CCGGCCGGCG ACCGAGAACT ACCCGTGGCT GGTCTACACC
TCTGGCTGCA CCGGTACGTT GATCAAGTCG GACTGGGTCG TCACGGCGCG GCACTGCCCG
ACACCGTCGT CGGTCCGTGT GGGTAGCGTC AACCGCACCA GCGGTGGCAC GGTCGTCGGG
GTCCGCCGCG CCGTCAGCAA CCCCACAATC GATGTCAAGC TGCTGCAACT GTCCAATGCG
GTCTCGTACG CCCCGGCCCC GATCCCGATG ACGTCCGGAG AGGTCGGTAC CGCTACCCGG
ATCATCGGCT GGGGTCTGAC CTGTCCGTTC CGGGGCTGCG GTTCGGCGCC GACGGTCGCA
CACGAGCTGG ACACGTCGAT CCTGTCGGAC AGTCGCTGCA TCGGCATCAA CGGCCCGTAC
GAGATCTGCA CCGACAACAC GAACGGTGAC TCGGGCGCCT GCTACGGCGA CTCGGGGGGC
CCGCAGGTTC GTCAGATCGG TGGGGTGTGG TATCTGGTCG GTGCCACCAG CCGGTCGGGC
AACAACCACC CGATCTGTGC CACCGGTCCA TCGATTTACG GTGACCTGAC GTCGATCCGT
TCCTGGATCG ACACCCGGGT CGGCGGCCTT CCCGCCTGA
 
Protein sequence
MVRRQLLRAF GAVLAAVLAA AGVQIATGAP ATAARTVYYD ASRAGEFRTN FDQAAQIWNS 
SVSNVRLAPR SPGNVTIYVD GGWPRAQVTG LGSGRIWMGW TAVNQGYDRT RIASHEFGHI
LGLPDRRTGL CSDLMSGSSA AVSCDNAYPS SAEASRVDSL FAGSRTTSVT GTFTWADADI
TPFVVGGRPA TENYPWLVYT SGCTGTLIKS DWVVTARHCP TPSSVRVGSV NRTSGGTVVG
VRRAVSNPTI DVKLLQLSNA VSYAPAPIPM TSGEVGTATR IIGWGLTCPF RGCGSAPTVA
HELDTSILSD SRCIGINGPY EICTDNTNGD SGACYGDSGG PQVRQIGGVW YLVGATSRSG
NNHPICATGP SIYGDLTSIR SWIDTRVGGL PA