Gene Sare_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0430 
Symbol 
ID5708407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp491656 
End bp494934 
Gene Length3279 bp 
Protein Length1092 aa 
Translation table11 
GC content71% 
IMG OID641269955 
Productendonuclease/exonuclease/phosphatase 
Protein accessionYP_001535350 
Protein GI159036097 
COG category[R] General function prediction only 
COG ID[COG2374] Predicted extracellular nuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.784359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00134706 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCCCGC GCCGCGCGCT CGCCGCGCTC GCCACCGTCA CCACTGCCGC GAGCCTCACG 
GCCGTCGCGG TCACCCCCAA CGCGGCAAGC GCCGCACCAA CCGACCTCTT CATCTCGGAG
TACGTCGAAG GCTCGTCCAA CAACAAGGCG ATCGAACTCT TCAACGGCAC CGGCGCGCCG
GTCGACCTGG CTGCCGGCGG CTACCAACTG CTGATCTACT TCAACGGCGC CACCACGCCG
ACCACCTTCT CCCTCACCGG CACGGTCACG GCGGATGACG TCTTCGTCTT CGCCCATTCC
TCGGCGAACG CGGCGATCCT CGCCCAGGCC GACCAGGTCA CCGGTGCCGG GCTGTTCAAC
GGCGACGACG CGATCGTGCT CCGCCGCGGT GGCACCGTGC TCGACGCGAT CGGCCAGGTG
GGCACCGATC CGGGCGCCGA ATGGGGCACC GGCCTGACCA GCACCGCCAA CAACACCCTC
CGCCGAGTGG GTGGCGTCAC GTCGGGTGAC ACCGAGCCCG GTGACGACTT CGACCCGGCC
ATTCAGTGGG CCGGGTTCGC CACTGACACA GTGGACGGAC TCGGCGCGCA CAGCCTCGAC
GGCGGCGGCC CGGTGGACGC ACCAGCCACC GTCGTCTGTG GTGATGCCCT GGTCACACCG
GCGGGTACCG CAGCGTCCCG GGAGGTCACC GCGACCGACC CGGACGACGA GATCGTCGAC
CTGGCTGTCA CGTCCGTGAC CCCGGCGCCG GATACCGGGA CGATCAGCCG GACGGCCGTC
ACCCCCGCCG GAACGGTCGG TGGCACCGCC CGGGCCACGG TCAGCGCGAG TGCCGACCTG
GCCGCCGGGG CCTACTCGGT GCTGGTGACC GCGACGGACG CCGACGGCAC CACCGCGACC
TGCACCCTGC CCGTGCAGGT CACCCGGGAG CTGACGGTCG GCGAGGTACA GGGCCAGACG
ACCGACGCCG AGGCCGGCGC CGCCGACCGC TCGCCGCTCG CGCCGGCCAG CGGCAACGGC
ACCAGCAGCC TGCGGTACGA GGTCCGTGGT GTCATCACCC AGCGCACCCT GGCCCGCGAT
TCGTCCGGTC GGGACCAGCA CGGCTTCTTC CTCCAGAGTC GAGCCGACGC GACCGACGGC
GACCCCACCA GCTCCGACGG GATCTTCGTC TTCATGGGCT CGTACACGTC ACTCATCGGC
GGTTACGTGC CGACCGTCGG CGACGAGGTG GTGCTCCAAG CCCGGGTCTC CGAGTACTAC
AACATGACGC AGCTCTCCGG CGCCTCGCTG GTCCGCCGGA TCGCCACCGG CCTGGACGTG
GAACAGGTGG TCACCGTGAC CGACGCGGTG CCACCGGCCG ACCTGGCCGA CGCGCAGCGC
TTCTGGGAGC GACACGAGGG GGCCCGGTTG CGGGTACGCG CCGGCAGCAC GGCGGTGAGC
GGGCGCGACG TCTTCGCCGC CACGGCCGAT GCCGAGACCT GGCTGATCGA CCGGGACGAC
CCACTGCTCG ACCGGGACGA ACCGGACACC CGTCGCGTGT TTCGGGATGC CCACCCGCTG
GACAATGACC CGAGCCGCGT CTTCGACGAC GGCAACGGCC AGCGGGTCAT GCTGGGCAGC
CTGGGTGTCA AGGCAGCCGC CGGGGACAAC ACGGCGCTAC TTCCCCCGGC ACGCACCTTC
GACGCCCTGA CCGACGACGC GGTGGGCGGC CTCTACTATT CGTTCCGGAA GTACGGCGTC
CAAGTCGAGT CCGCCGCCTT CGCCGCCGGA ACCGACCCGT CGACGAACAA CCCGCCGCAG
CCGGCCCGGC GATCGACGGA GTACGCGGTC GCCGCCTACA ACGTCGAGAA CCTGTACGAC
TTCCGCGACG ACCCGTTCGA CGGCTGCGAC TTCGCGGGAA ACGACGGCTG CCCCGGCGTA
CGGCCGCCGT TCAACTACGT GCCGGGCAGC GAGCAGGAGT ACCAGGACCA GCTCACCGCC
CTCGCCGACC AGATCACCAA CGACCTGCAC TCCCCTGACC TGATCCTGGT GCAGGAGGCG
GAGGACCAGG ACATCTGCAC GGTCGAGGGC GCCGAGCTGG TCTGCGGTGA CACGAACGAC
GCCGACGGCG CTCCGGACTC ACTCCAGGAG CTCGCCCTGA CCATCACCGG CAACGGCGGC
CCGGCCTACG CGGCCGCGTA CGACCGCACC GGTGCGGACA ACCGGGGCAT CACCTCGGCC
TTCCTCTACC GCACCGACCG GGTGGCGCTG GCCGAGGCAA CGGCCGACGA TCCATTACTC
GGCTCGTCAC CGACCGTCCA GTACCGCGCA CCCGGGCTGC CGTCCAACGC CGACGTGCAG
AACCCCAAGG CGCTCAACGC GGTCCTTCCG CCCGATGTGG ATACCAGCAC CGGGCAGGAT
GGCGACAACG TCTTCACCCG CGCGCCGCAG CTCGGCCGGT TCACGATGGC CGCCGCCCCC
GGCTCCCGCG AGGGATTCAC GCTCTGGGCA GCCAGCAACC ACTATTCGTC CGGCCCGGAC
CGCCGGGTGG GGCAACGACG GGAGCAGGCG GCGTACGGTG CCGCGATCGT GTCCGCGATC
GAGGCGTCGG ACCCGGACGC CCGGGTGGTG TTCGGTGGGG ACCTGAACGT CTTCCCCCGC
CCCGACGATC CCATCGCGAC GGCCGCGGAC CCGACTCCGT CCGACCAACT CGGTCCGCTG
TACGAGGCGG GGCTGCGGAA CCTCTGGGAT GATCTGCTGG CCGCGGCGCC GTCGTCCGCG
TACTCGTACA GCTACGCGGG CCAGGCACAG ACGTTGGATC ACCTGTTCGT GACGGAGGCG
CTGCACGATG ACCTCGTGCA GATGCGAGCC GCGCACATCA ACGCCGACTG GCCGGCGGAG
TACGCGGGTG ACGGATCGCG CGGCTCCAGT GACCACGATC CGCAGGTGGC CCGGTTCCGG
TCGCGCGCGA CGCTGACCGT TGCCGACACG TCGGTCGTCG AGGGCGACCG GGGCCGCGCC
GAACTCGCCT TCGCCGTCAC CGTCTCGCGA CCGCTGTCCG AGCCCACCCT GGTGTGTGCC
CTGACCTTCG GCAAGACCGC CCGGCCCGCC ATCGACTACC GGTCGTACGC CGGTTGCCAG
ACGCTCGCCG CCGGGCAGAC GACCCTGACG TTCCCGGTAT CCGTGCGCGG GGACCGGAGG
CAGGAGGCCG ACGAGAAGCT GGCGTTGCTG GTGGCCGGCG GTCCGGGGCT CCGCCTCGCC
GATCCGCTGG GCACCGGGAC CATCGTCGAC GACGACTGA
 
Protein sequence
MRPRRALAAL ATVTTAASLT AVAVTPNAAS AAPTDLFISE YVEGSSNNKA IELFNGTGAP 
VDLAAGGYQL LIYFNGATTP TTFSLTGTVT ADDVFVFAHS SANAAILAQA DQVTGAGLFN
GDDAIVLRRG GTVLDAIGQV GTDPGAEWGT GLTSTANNTL RRVGGVTSGD TEPGDDFDPA
IQWAGFATDT VDGLGAHSLD GGGPVDAPAT VVCGDALVTP AGTAASREVT ATDPDDEIVD
LAVTSVTPAP DTGTISRTAV TPAGTVGGTA RATVSASADL AAGAYSVLVT ATDADGTTAT
CTLPVQVTRE LTVGEVQGQT TDAEAGAADR SPLAPASGNG TSSLRYEVRG VITQRTLARD
SSGRDQHGFF LQSRADATDG DPTSSDGIFV FMGSYTSLIG GYVPTVGDEV VLQARVSEYY
NMTQLSGASL VRRIATGLDV EQVVTVTDAV PPADLADAQR FWERHEGARL RVRAGSTAVS
GRDVFAATAD AETWLIDRDD PLLDRDEPDT RRVFRDAHPL DNDPSRVFDD GNGQRVMLGS
LGVKAAAGDN TALLPPARTF DALTDDAVGG LYYSFRKYGV QVESAAFAAG TDPSTNNPPQ
PARRSTEYAV AAYNVENLYD FRDDPFDGCD FAGNDGCPGV RPPFNYVPGS EQEYQDQLTA
LADQITNDLH SPDLILVQEA EDQDICTVEG AELVCGDTND ADGAPDSLQE LALTITGNGG
PAYAAAYDRT GADNRGITSA FLYRTDRVAL AEATADDPLL GSSPTVQYRA PGLPSNADVQ
NPKALNAVLP PDVDTSTGQD GDNVFTRAPQ LGRFTMAAAP GSREGFTLWA ASNHYSSGPD
RRVGQRREQA AYGAAIVSAI EASDPDARVV FGGDLNVFPR PDDPIATAAD PTPSDQLGPL
YEAGLRNLWD DLLAAAPSSA YSYSYAGQAQ TLDHLFVTEA LHDDLVQMRA AHINADWPAE
YAGDGSRGSS DHDPQVARFR SRATLTVADT SVVEGDRGRA ELAFAVTVSR PLSEPTLVCA
LTFGKTARPA IDYRSYAGCQ TLAAGQTTLT FPVSVRGDRR QEADEKLALL VAGGPGLRLA
DPLGTGTIVD DD