Gene Sare_3354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3354 
Symbol 
ID5705860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3872287 
End bp3873927 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content74% 
IMG OID641272780 
ProductX-Pro dipeptidyl-peptidase domain-containing protein 
Protein accessionYP_001538147 
Protein GI159038894 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.417425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0112773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGCGG CGCTGGCGAC CCGGGTGGCC ACGGCCGCGT TGCGCCTGCC GCCGTCACGT 
ACCCGCCGGG TCACCCTGAC CCGCGACATC CTGGTCCGGA CCCGCGACGG CGTGTCGCTG
CGCACCGACC ACCACGCCCC GGACCGGCCG GCCGCACCCA CGGTGCTCAT CCGCACCCCG
TACGGGCGGG GTGGGCCGAT GCGCCTGCTC GGCCGGCTCG CCGCCGAGCG GGGCTACCAC
GTGGTGATCC AGTCCTGCCG GGGTACCGGT GGGTCCGGCG GGCTGTTCGA CCCGCTGGTG
CACGAACGCG ACGACGGCCT GGACACCCTC GACTGGCTGC GCCGCCAGTC CTGGTGGAAC
GGCACATTCG GCATGTTCGG GGCCAGCTAC CAGGGCTTCG TCCAGTGGGC CGTCGCCGCT
GACGCCGGGG CCGACCTTCG CGCGATGGTC GCGGTGGTGA CCGCCTCCGG CACCCGCGAC
TCGACGTATC CGGGCGAGTC CTTCGCCCTG GACACCGTGC TCACCTGGGC CGAACTGCTC
CAGGCGCAGA CCGTCGGGTG GCTCGCCCGG CAGTGGGAAC TCAAGCGTGG CCAGCCCCGG
CTCGCCGCCG GGTTGTCCCA CCTGCCGCTG GCCGAGGCGG ACCGGGTGGC CACCGGCGTC
ACCGTGCCCT TCTTCCAGGA ATGGTTACGC CACCACACCC CGGACGCGGC GTACTGGCGG
AGGCGGGTCT TCGGTGACCG GCTTGCGGAG GTCCACGCCC CCGTTTCCAT GATCAGCGGC
TGGCACGACA TCTTTCTCCC CGCCCAGTTG CGGGACTTCG CGGCCCTGCG TGCCGCCGGT
GCCGCGCCCC GGCTCACCGT CGGGCCGTGG ACGCACGGCA GCCCCGGGCT GTTCGTCGCC
GCGCTCCGCG ACGGACTGGA CTGGATCGAC CAACATCTGG GCGGGTACCC GGGGCGTCAC
CGCGCCCCGG TCCGCGTGCA CGTCGGCGGG GCCGGCGGCG GCTGGCGAGA TCTGCCAGAC
TGGCCGCCAC CAGGCACGCC GACCGCCTGG CACCTGCACC CACACGGTGC GCTGCGGGCC
ACGCCGCCGC CGGTGTCGAC CCCAGACGGT TTCTGGTACG ACCCGGCCGA TCCCACCCCC
TCGGTGGGCG GCCCGCTGCT GGTGGCCCAA CAGGCCGGCA AGGTGGACAA CCGGCCCGTC
GAGGCCCGCT CCGACGTGCT GACCTGGACC AGCGCGGCGT TGACCGCGGC AGTGGAAGTC
ATCGGACCGG TCCAGGCCGA GATCTTCGTC CGCAGCGAGC TACCTCACCT GGACGTTTTC
GTGCGGCTGT GCGACGTGGA CCGCCGGGGT CGCTCCTGGA ACGTCTGTGA CGGGCTGGTC
CGGGTCAGGC CGCCCGCCTT CTCGCCCGAC CAGACGAGCG CGGTCCGCGT CGCGGTGCCG
TTGTGGCCGG TGGCCCACCG GTTCGCCGCC GGTCACCGAC TGCGGGTGCA GATCTCCGGC
GGGGCCCACC CCCGGTACGC GCGTAACCCC GGCACCGGCG AACCGCTCGG CACCGCGGTC
ACCCTGCGCG CCGGATGGCG GGAGATCCTG CACGATCCGC AGCACCCGTC CGCGCTGGTG
CTACCCACTG TCGAGGGTTG A
 
Protein sequence
MLAALATRVA TAALRLPPSR TRRVTLTRDI LVRTRDGVSL RTDHHAPDRP AAPTVLIRTP 
YGRGGPMRLL GRLAAERGYH VVIQSCRGTG GSGGLFDPLV HERDDGLDTL DWLRRQSWWN
GTFGMFGASY QGFVQWAVAA DAGADLRAMV AVVTASGTRD STYPGESFAL DTVLTWAELL
QAQTVGWLAR QWELKRGQPR LAAGLSHLPL AEADRVATGV TVPFFQEWLR HHTPDAAYWR
RRVFGDRLAE VHAPVSMISG WHDIFLPAQL RDFAALRAAG AAPRLTVGPW THGSPGLFVA
ALRDGLDWID QHLGGYPGRH RAPVRVHVGG AGGGWRDLPD WPPPGTPTAW HLHPHGALRA
TPPPVSTPDG FWYDPADPTP SVGGPLLVAQ QAGKVDNRPV EARSDVLTWT SAALTAAVEV
IGPVQAEIFV RSELPHLDVF VRLCDVDRRG RSWNVCDGLV RVRPPAFSPD QTSAVRVAVP
LWPVAHRFAA GHRLRVQISG GAHPRYARNP GTGEPLGTAV TLRAGWREIL HDPQHPSALV
LPTVEG