Gene Sare_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2354 
Symbol 
ID5706938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2708937 
End bp2710097 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content73% 
IMG OID641271832 
Productpeptidase M50 
Protein accessionYP_001537203 
Protein GI159037950 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.176917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCCA AGCGGCAAGC CCCACGCCCG CCTACCCGGC ACTCCGGTGT GACCGTCGGT 
CGGGTGGTCG GGGTGCCGCT GCGCCTGGAC TGGTCGATGC TGCTGCTCGC CCTGGCCGTC
GCCGTGATGT ACGCCGAATT CGCCCGCCAC CAGCTCGCCC TCTCGCCGGC CGGTGGCTAC
GTGATCGGCC TCGGCTTCGT GGTTTCGCTG CTCGGGTCGG TGCTCCTGCA CGAACTCGGG
CACGCCCTCA CCGCCCGCCG GTACGGCATC GGGGTCCGCG GCATCACCCT GGAGCTGCTC
GGCGGCTACA CCGAGATGGA CCGCGACGCC CCGACTCCCC GCGTCGACCT GCTGGTGTCG
CTGGCCGGGC CGGCCGTCTC CGCGGTACTG GGCGGGGCAG CGGTCGCCGT CACGATGGCG
CTGCCGGACC GTACGGTGGG TCACCAGCTC GCCTTCCAGC TCGCGGTGAG TAACGTCGTT
GTCGCAGCGT TCAACGTGCT ACCCGGGCTG CCGCTCGATG GTGGCCGCGC GCTGCGAGCC
GCCCTCTGGG CCGCCACCCG GGACCGGCAC CGGGCCACCG AGGTGGCTGG CTGGGTCGGC
CGTGTCGTTG CCATCGGTAC CGTCGGGGCG GCAGTCGTCC TTGCCCTCAC CCGTCCCCCG
ACACCTCCGG TACTGCTCGC GCTACCACTG ATGCTGCTGG TCGCGTTCAC CCTCTGGCGG
GGCGCCGGGC AGTCGATCCG GCTGGCCCGG GTCACCCGCC GGTTCCCGCT GATCGATCTC
TCGCGGTTGG CCCGTCCGGT GTGCGCCGTC CCGGCCGGAA CCCCTCTCGC CGAGGCGCAG
CGCCGCGCTG CCGGGACCGA CCCTCCGGCC GCGCTGCTGG TCACCGACTC CGCGGGTGGC
CCGCACGCCC TGGTCAATCC GGTCGAGGTG GCGGCGGTAG CGGTGGACCG TCGACCCTGG
GTGCCGGTGG ACGCGGTGTC CCGGCCACTG GCCGAGGTGC CGGCCGTGTC GGTCGGCCTC
GACGGCGAGC AGGTGATGGA GACGGTGCGG CGCCACCCGG GCGCACAGTA CGTGGTGACC
TCAGGCGAAG ATGTCGTCGG CATCCTGTAC CTCGCGGATC TGGCTCAGCT ACTCGAACCT
CACCGGAAGA TGAACACGTG A
 
Protein sequence
MESKRQAPRP PTRHSGVTVG RVVGVPLRLD WSMLLLALAV AVMYAEFARH QLALSPAGGY 
VIGLGFVVSL LGSVLLHELG HALTARRYGI GVRGITLELL GGYTEMDRDA PTPRVDLLVS
LAGPAVSAVL GGAAVAVTMA LPDRTVGHQL AFQLAVSNVV VAAFNVLPGL PLDGGRALRA
ALWAATRDRH RATEVAGWVG RVVAIGTVGA AVVLALTRPP TPPVLLALPL MLLVAFTLWR
GAGQSIRLAR VTRRFPLIDL SRLARPVCAV PAGTPLAEAQ RRAAGTDPPA ALLVTDSAGG
PHALVNPVEV AAVAVDRRPW VPVDAVSRPL AEVPAVSVGL DGEQVMETVR RHPGAQYVVT
SGEDVVGILY LADLAQLLEP HRKMNT