Gene Sare_3266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3266 
Symbol 
ID5707553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3757996 
End bp3759174 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content69% 
IMG OID641272693 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001538060 
Protein GI159038807 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0356463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCGT CCCCGGAACG CATCAGCCGA TTCGCCGCCC GCCGCCGGGC GCGGCTGGAG 
GGCCTCCCGA AGCTGGCACA CACCGACGCT GGCGGCGGCT CCCGCGCGTG GTTCGTCGCC
GACGAGTTGC TGGTCGTCGA CGACAGTCGC CGGGAGGTTG AGCGCTATCT CGGACGTGCC
CGGGCCGCAC AGCCCGGTGC CGGCGACGAG GAACTGATGC CGGGCCTGCG TCGCTACCGG
GCGCCGGGAC TCGACGTGCC GACTGCGGTC CGCGCGCTGC GATCGGATCG TCCGGCTGGC
CGCCAGACGG TCTGCCCCAA CCATGTCTTC CTGTCCAGTC CGTTCAATCA GGGAGGTCCG
TTCGGGCCGC CGGCTCCCAC CGCCGCAGCG ACGCTCAAGA CGCCCGCCGA AACCGACCGG
GTCGCGGTTT CCATCGTCGA CACCGGATTC TGGACCGACA CCCCCCTTCC GGTCGACTAT
CTCGCATCGG ACGGCGTCGA GGTGGAAACG GAAACCGACG TCGACAACGA CGGGTTGCTC
GACGGTGATG TCGGCCACGC CAACTTTATC GGTGGCGTGA TCGCGAACCA CACGGACCGG
GCAGTGCTGC GGGTTGTCCG GACGTTGGAC ACCTTCGGCG TCTGCACCGA GGACGAGCTC
ATCGCCTCCC TTGGACGGGT GCACCCGGAC ACCAAGGTGA TCAACCTGTC GCTCGGCGGT
TACACCGCCG ACGGAACCGC GCCGCTCGGC GTACAGGCCG CGTTGCAGCA GGCCCTGTCC
GGGCTCGACC GAGTGGTGGT CGCGGCCGCT GGCAATGACG GCAACCGCAG TGACCCGTTC
TGGCCCGCGG CGTTCGCCGG TGCCGGCGAG TCGTGGAGTG GACAGGTGGT GGCCGTCGCC
GCCCACGATG GCGTCGACCT GTGCTCCTGG AGCAACGCCG GATCGTGGGT CAGCCTTGTC
GCACCTGGTC AGGACGTCCG AAGCACCTAT ATCGACCACG CGCTGTTTCC GGAGGGATGG
GCGCAATGGA GCGGAACCTC GTTCGCGGCG CCGCGAGTGG CTGCCGAGAT CACGGCGCGG
ATCGACGCAC AGGTTGGTGC GGTAGCTGCC ACCAACCAGT TCATGGCCGA CGTGGCAGCG
GCCAACCAGC AGTTCGGAGG TCACCTTGGC TTGATCTGA
 
Protein sequence
MPPSPERISR FAARRRARLE GLPKLAHTDA GGGSRAWFVA DELLVVDDSR REVERYLGRA 
RAAQPGAGDE ELMPGLRRYR APGLDVPTAV RALRSDRPAG RQTVCPNHVF LSSPFNQGGP
FGPPAPTAAA TLKTPAETDR VAVSIVDTGF WTDTPLPVDY LASDGVEVET ETDVDNDGLL
DGDVGHANFI GGVIANHTDR AVLRVVRTLD TFGVCTEDEL IASLGRVHPD TKVINLSLGG
YTADGTAPLG VQAALQQALS GLDRVVVAAA GNDGNRSDPF WPAAFAGAGE SWSGQVVAVA
AHDGVDLCSW SNAGSWVSLV APGQDVRSTY IDHALFPEGW AQWSGTSFAA PRVAAEITAR
IDAQVGAVAA TNQFMADVAA ANQQFGGHLG LI