Gene Sare_1340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1340 
Symbol 
ID5703871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1547225 
End bp1548613 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content70% 
IMG OID641270851 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001536232 
Protein GI159036979 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000321073 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTCGGG CCGCCTCCGC ACCGTCCACA CCGGAACGAC GGGGGGTGGC GGCGTACCGG 
AGCACCCCCA GCGGTGCTGT CGGCCGCGCC GTCACCCGCA CGCTCAGCGA CGATCCGCTG
GGTGGCGCTG TTCGACGGAC CGTGCTGCCC AGCGGGCTGC GGGTGCTCAC CGAGACGATC
CCGGCGATGC GCAGCGTCTC GTTCGGCATC TGGGTGTCGG TCGGTTCCCG GGACGAGACC
GGACCCCAGT CCGGTGCCGC CCACTTCCTC GAACACCTGC TGTTCAAGGG AACGCACCGG
CGGGCGGCCC TGGAGATCTC CTCGGCGATC GAGGCGGTGG GTGGTGAGAC CAACGCCTTC
ACCACCAAGG AGTACACCTG CTACTACGCG CGGGTGCTGG ACGAGGACCT GCCCCTCGCC
ATCGACGTGA TGTGCGACCT GGTCGCCGAT TCGGTGCTCA CCCCGGACGA TGTGGAGATC
GAGCGGGGAG TGATCCTCGA GGAGATCGCC ATGCACGACG ACGAGCCCGG CGACGAGGTG
CACGACCTCT TCGCCCGGGC CGTCTACGGC GAGCACCCGC TGGGCCGGCT GATCTCGGGG
ACGGAACAGA CGGTCACGCC GATGACCCGA CGGCAGATCC AGAGCTTCTA CCGGCGGCAC
TACACCCCGC CGCGCATCGT CATCGCCGCC GCGGGGAACC TTGACCACGC CAGCGTGGTC
ACGATGGTCC GCCAGGCGCT GCGCGGCACA CCACTGGACA CCGACCCGGC GACGCCGGCG
CCGCACCGGG CCGCCACCCC GGCGGTCCGC ACGCGGCCCG CCACCACGCT GGTCACGCCG
AAGGAGACCG AGCAGGCGCA CGTCGTACTC GGCTGCACCG GCATCGACTG GCACGACGAC
CGTCGGTTCG CCCTCGGGGT GCTCAACAAC ATCCTCGGCG GCGGCATGTC CAGCCGGCTC
TTCCAGGAGA TCCGCGAGCA GCGCGGGCTC GCCTACTCGG TCTACTCCTA CGCCAGCCAG
CACGCCGACA GCGGCCTGTT CGGCATCTAC GCCGGTTGTG CGCCGGGTCG GGTCAACGAG
GTGTTGGATC TGATCCGCGC GGAGTTGACC CGGGTGGCCG TGGACGGGCT CACGGAGGCC
GAGGTGGCCC GGGGCAAGGG CATGAGCAAG GGATCGTTCG TGCTGGGTTT GGAGGACAGC
GGCTCCCGGA TGAGCCGGCT GGCCAAGGGG GAATTGCTGT ACGGCGACCT GTTGCCGGTC
GACGCGTTGC TGGCCCGGGT CGACGCGGTC ACCGTCGACG ACGTGAACAC GCTGGCCACG
GAGTTGCTGA GCCGTTCGCT GTCGTTGGCG GTGGTGGGTC CCTTCGGCGA GTCCGACTTC
ACCGCCTGA
 
Protein sequence
MTRAASAPST PERRGVAAYR STPSGAVGRA VTRTLSDDPL GGAVRRTVLP SGLRVLTETI 
PAMRSVSFGI WVSVGSRDET GPQSGAAHFL EHLLFKGTHR RAALEISSAI EAVGGETNAF
TTKEYTCYYA RVLDEDLPLA IDVMCDLVAD SVLTPDDVEI ERGVILEEIA MHDDEPGDEV
HDLFARAVYG EHPLGRLISG TEQTVTPMTR RQIQSFYRRH YTPPRIVIAA AGNLDHASVV
TMVRQALRGT PLDTDPATPA PHRAATPAVR TRPATTLVTP KETEQAHVVL GCTGIDWHDD
RRFALGVLNN ILGGGMSSRL FQEIREQRGL AYSVYSYASQ HADSGLFGIY AGCAPGRVNE
VLDLIRAELT RVAVDGLTEA EVARGKGMSK GSFVLGLEDS GSRMSRLAKG ELLYGDLLPV
DALLARVDAV TVDDVNTLAT ELLSRSLSLA VVGPFGESDF TA