Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0430 |
Symbol | |
ID | 5708407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 491656 |
End bp | 494934 |
Gene Length | 3279 bp |
Protein Length | 1092 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641269955 |
Product | endonuclease/exonuclease/phosphatase |
Protein accession | YP_001535350 |
Protein GI | 159036097 |
COG category | [R] General function prediction only |
COG ID | [COG2374] Predicted extracellular nuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.784359 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00134706 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCCCGC GCCGCGCGCT CGCCGCGCTC GCCACCGTCA CCACTGCCGC GAGCCTCACG GCCGTCGCGG TCACCCCCAA CGCGGCAAGC GCCGCACCAA CCGACCTCTT CATCTCGGAG TACGTCGAAG GCTCGTCCAA CAACAAGGCG ATCGAACTCT TCAACGGCAC CGGCGCGCCG GTCGACCTGG CTGCCGGCGG CTACCAACTG CTGATCTACT TCAACGGCGC CACCACGCCG ACCACCTTCT CCCTCACCGG CACGGTCACG GCGGATGACG TCTTCGTCTT CGCCCATTCC TCGGCGAACG CGGCGATCCT CGCCCAGGCC GACCAGGTCA CCGGTGCCGG GCTGTTCAAC GGCGACGACG CGATCGTGCT CCGCCGCGGT GGCACCGTGC TCGACGCGAT CGGCCAGGTG GGCACCGATC CGGGCGCCGA ATGGGGCACC GGCCTGACCA GCACCGCCAA CAACACCCTC CGCCGAGTGG GTGGCGTCAC GTCGGGTGAC ACCGAGCCCG GTGACGACTT CGACCCGGCC ATTCAGTGGG CCGGGTTCGC CACTGACACA GTGGACGGAC TCGGCGCGCA CAGCCTCGAC GGCGGCGGCC CGGTGGACGC ACCAGCCACC GTCGTCTGTG GTGATGCCCT GGTCACACCG GCGGGTACCG CAGCGTCCCG GGAGGTCACC GCGACCGACC CGGACGACGA GATCGTCGAC CTGGCTGTCA CGTCCGTGAC CCCGGCGCCG GATACCGGGA CGATCAGCCG GACGGCCGTC ACCCCCGCCG GAACGGTCGG TGGCACCGCC CGGGCCACGG TCAGCGCGAG TGCCGACCTG GCCGCCGGGG CCTACTCGGT GCTGGTGACC GCGACGGACG CCGACGGCAC CACCGCGACC TGCACCCTGC CCGTGCAGGT CACCCGGGAG CTGACGGTCG GCGAGGTACA GGGCCAGACG ACCGACGCCG AGGCCGGCGC CGCCGACCGC TCGCCGCTCG CGCCGGCCAG CGGCAACGGC ACCAGCAGCC TGCGGTACGA GGTCCGTGGT GTCATCACCC AGCGCACCCT GGCCCGCGAT TCGTCCGGTC GGGACCAGCA CGGCTTCTTC CTCCAGAGTC GAGCCGACGC GACCGACGGC GACCCCACCA GCTCCGACGG GATCTTCGTC TTCATGGGCT CGTACACGTC ACTCATCGGC GGTTACGTGC CGACCGTCGG CGACGAGGTG GTGCTCCAAG CCCGGGTCTC CGAGTACTAC AACATGACGC AGCTCTCCGG CGCCTCGCTG GTCCGCCGGA TCGCCACCGG CCTGGACGTG GAACAGGTGG TCACCGTGAC CGACGCGGTG CCACCGGCCG ACCTGGCCGA CGCGCAGCGC TTCTGGGAGC GACACGAGGG GGCCCGGTTG CGGGTACGCG CCGGCAGCAC GGCGGTGAGC GGGCGCGACG TCTTCGCCGC CACGGCCGAT GCCGAGACCT GGCTGATCGA CCGGGACGAC CCACTGCTCG ACCGGGACGA ACCGGACACC CGTCGCGTGT TTCGGGATGC CCACCCGCTG GACAATGACC CGAGCCGCGT CTTCGACGAC GGCAACGGCC AGCGGGTCAT GCTGGGCAGC CTGGGTGTCA AGGCAGCCGC CGGGGACAAC ACGGCGCTAC TTCCCCCGGC ACGCACCTTC GACGCCCTGA CCGACGACGC GGTGGGCGGC CTCTACTATT CGTTCCGGAA GTACGGCGTC CAAGTCGAGT CCGCCGCCTT CGCCGCCGGA ACCGACCCGT CGACGAACAA CCCGCCGCAG CCGGCCCGGC GATCGACGGA GTACGCGGTC GCCGCCTACA ACGTCGAGAA CCTGTACGAC TTCCGCGACG ACCCGTTCGA CGGCTGCGAC TTCGCGGGAA ACGACGGCTG CCCCGGCGTA CGGCCGCCGT TCAACTACGT GCCGGGCAGC GAGCAGGAGT ACCAGGACCA GCTCACCGCC CTCGCCGACC AGATCACCAA CGACCTGCAC TCCCCTGACC TGATCCTGGT GCAGGAGGCG GAGGACCAGG ACATCTGCAC GGTCGAGGGC GCCGAGCTGG TCTGCGGTGA CACGAACGAC GCCGACGGCG CTCCGGACTC ACTCCAGGAG CTCGCCCTGA CCATCACCGG CAACGGCGGC CCGGCCTACG CGGCCGCGTA CGACCGCACC GGTGCGGACA ACCGGGGCAT CACCTCGGCC TTCCTCTACC GCACCGACCG GGTGGCGCTG GCCGAGGCAA CGGCCGACGA TCCATTACTC GGCTCGTCAC CGACCGTCCA GTACCGCGCA CCCGGGCTGC CGTCCAACGC CGACGTGCAG AACCCCAAGG CGCTCAACGC GGTCCTTCCG CCCGATGTGG ATACCAGCAC CGGGCAGGAT GGCGACAACG TCTTCACCCG CGCGCCGCAG CTCGGCCGGT TCACGATGGC CGCCGCCCCC GGCTCCCGCG AGGGATTCAC GCTCTGGGCA GCCAGCAACC ACTATTCGTC CGGCCCGGAC CGCCGGGTGG GGCAACGACG GGAGCAGGCG GCGTACGGTG CCGCGATCGT GTCCGCGATC GAGGCGTCGG ACCCGGACGC CCGGGTGGTG TTCGGTGGGG ACCTGAACGT CTTCCCCCGC CCCGACGATC CCATCGCGAC GGCCGCGGAC CCGACTCCGT CCGACCAACT CGGTCCGCTG TACGAGGCGG GGCTGCGGAA CCTCTGGGAT GATCTGCTGG CCGCGGCGCC GTCGTCCGCG TACTCGTACA GCTACGCGGG CCAGGCACAG ACGTTGGATC ACCTGTTCGT GACGGAGGCG CTGCACGATG ACCTCGTGCA GATGCGAGCC GCGCACATCA ACGCCGACTG GCCGGCGGAG TACGCGGGTG ACGGATCGCG CGGCTCCAGT GACCACGATC CGCAGGTGGC CCGGTTCCGG TCGCGCGCGA CGCTGACCGT TGCCGACACG TCGGTCGTCG AGGGCGACCG GGGCCGCGCC GAACTCGCCT TCGCCGTCAC CGTCTCGCGA CCGCTGTCCG AGCCCACCCT GGTGTGTGCC CTGACCTTCG GCAAGACCGC CCGGCCCGCC ATCGACTACC GGTCGTACGC CGGTTGCCAG ACGCTCGCCG CCGGGCAGAC GACCCTGACG TTCCCGGTAT CCGTGCGCGG GGACCGGAGG CAGGAGGCCG ACGAGAAGCT GGCGTTGCTG GTGGCCGGCG GTCCGGGGCT CCGCCTCGCC GATCCGCTGG GCACCGGGAC CATCGTCGAC GACGACTGA
|
Protein sequence | MRPRRALAAL ATVTTAASLT AVAVTPNAAS AAPTDLFISE YVEGSSNNKA IELFNGTGAP VDLAAGGYQL LIYFNGATTP TTFSLTGTVT ADDVFVFAHS SANAAILAQA DQVTGAGLFN GDDAIVLRRG GTVLDAIGQV GTDPGAEWGT GLTSTANNTL RRVGGVTSGD TEPGDDFDPA IQWAGFATDT VDGLGAHSLD GGGPVDAPAT VVCGDALVTP AGTAASREVT ATDPDDEIVD LAVTSVTPAP DTGTISRTAV TPAGTVGGTA RATVSASADL AAGAYSVLVT ATDADGTTAT CTLPVQVTRE LTVGEVQGQT TDAEAGAADR SPLAPASGNG TSSLRYEVRG VITQRTLARD SSGRDQHGFF LQSRADATDG DPTSSDGIFV FMGSYTSLIG GYVPTVGDEV VLQARVSEYY NMTQLSGASL VRRIATGLDV EQVVTVTDAV PPADLADAQR FWERHEGARL RVRAGSTAVS GRDVFAATAD AETWLIDRDD PLLDRDEPDT RRVFRDAHPL DNDPSRVFDD GNGQRVMLGS LGVKAAAGDN TALLPPARTF DALTDDAVGG LYYSFRKYGV QVESAAFAAG TDPSTNNPPQ PARRSTEYAV AAYNVENLYD FRDDPFDGCD FAGNDGCPGV RPPFNYVPGS EQEYQDQLTA LADQITNDLH SPDLILVQEA EDQDICTVEG AELVCGDTND ADGAPDSLQE LALTITGNGG PAYAAAYDRT GADNRGITSA FLYRTDRVAL AEATADDPLL GSSPTVQYRA PGLPSNADVQ NPKALNAVLP PDVDTSTGQD GDNVFTRAPQ LGRFTMAAAP GSREGFTLWA ASNHYSSGPD RRVGQRREQA AYGAAIVSAI EASDPDARVV FGGDLNVFPR PDDPIATAAD PTPSDQLGPL YEAGLRNLWD DLLAAAPSSA YSYSYAGQAQ TLDHLFVTEA LHDDLVQMRA AHINADWPAE YAGDGSRGSS DHDPQVARFR SRATLTVADT SVVEGDRGRA ELAFAVTVSR PLSEPTLVCA LTFGKTARPA IDYRSYAGCQ TLAAGQTTLT FPVSVRGDRR QEADEKLALL VAGGPGLRLA DPLGTGTIVD DD
|
| |