Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1391 |
Symbol | |
ID | 5703750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1606639 |
End bp | 1608405 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270901 |
Product | RNA binding metal dependent phosphohydrolase |
Protein accession | YP_001536282 |
Protein GI | 159037029 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0559611 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGGT TCGACGCGGT CCTTCTCGTG GCCGTGCTGC TGCTCGCCCT CATCGTGCTC GGCGCGGTGC TGGTTGGTGT CCGAGCCGTG CGCGGCATCG CCGGCGCGCC CCGGCCCGAG GACCCGGCCT TCATCGCCGA GAAAGACCGC CAGGAACAGT CCCTCGCCGC CCTGCGGTCC GCCGCCGACG AGGCGAACAG CACAGTCGAC GCGGCGAAGT CCGCCGCCGC CGCGGCCCGC ACCGAGGCCG CTGCCGCCCG AGCTGAGGCG AAGGCCGCCC GCGCCGAGGC CCGGCGGGTG CTCGACGGCG CCCGCGCCGA AGCGGACACC ATCCTGGAAC GGGTACACAA GCAGGCCGAG GCCGACGCCG AACAGTTGCG AACCGCCGCC CGGCGCAGCG GGGAGCGGGA GGCAGCTGTT CTCGCCACGA CCACCCGGGA ACAGGCGGCC GAGGTGGAGC GGCGTGCCGC CCGGATGGAC GATCGGGAGC GGCTGCACAG CGAGGAGGTG GAGCGGCTCG CCGAGCGGGA TCGTCAGCTC AGCGCCGCCA GCGCCGCCCT GGCCGCCCGT GAGTCGACTC TCGTCGACCG GGACCGGGAG TTGGCGCAGG CGGAGGATCG GCGCCGCCGC GAGTTGGAGC GGGTCGCGGG GATCACCGCG GAGGCCGCCC GTGGCGAACT GGTCGAGGCG ATCGAGGCGC AGGCCAAGCG GGAGGCCGCC CTGCTGGTAC GCGAGATCGA GTCGGAGGCG CGCAACACGG GCGAGGAGCG TGCCCGGCAC ATCGTGGTTG ACGCGATCCA GCGGGTGGCC AGCGAGCAGA CCGCGGAGAG TGTGGTCAGC GTGCTGCACC TGCCGGGTGA CGAGATGAAG GGTCGGATCA TCGGCCGGGA GGGGCGCAAC ATCCGCGCCT TCGAATCCGT GACCGGCGTC AACCTGATCA TCGACGACAC CCCGGAGGCG GTGCTGCTGT CCTGCTTCGA CCCGGTACGT CGGGAAGTCG GCCGACTCAC CCTGGAAAAG CTCGTCCTGG ACGGCCGTAT CCATCCACAC CGGATCGAGG AGGTGCACGA CCTGGCCCGG CAGGAGGTGG TGCAGCTCTG CCAGCGTGCC GCCGAGGACG CCCTCGTCGA GGTCGGCATC ACCGAGATTC ACCCCGAGTT GGTCAGCCTG CTGGGCCGGC TGCGCTACCG CACCTCGTAC GGGCAGAACG TGCTCAAGCA CCTCGTCGAG ACCGCCCATA TCGCCGGGAT CATGGCGGCC GAACTGCGGT TGGACGTACC GACGATCAAG CGGTGCGCCT TCCTGCACGA CATCGGTAAG GCGCTCACCC ACGAGGTCGA GGGCAGTCAT GCCATCGTCG GCGCCGACGT CGCCCGCAAG TACGGCGAGA GCGAGGACGT CGTGCACGCC ATCGAGGCGC ACCACAACGA GGTGCCGCCG CAGACCATCG AGGCGGTGCT GACCCAGGCC TCGGACGCCT GCTCCGGCGG TCGGCCGGGG GCCCGTCGGG AGAGCCTGGA GGCGTACGTG CGGCGGCTGG AGCGGATCGA GGAGATCGCC GCGGGCAAGC TCGGCGTGGA GCGGGTCTTC GCGATGCAGG CGGGCCGGGA GGTCCGGGTG ATGGTCCGGC CGGAGGACGT GGACGACATC AGCGCCTCCG TGCTGGCCCG TGACGTGGCC AAGCAGATCG AGGAGGAGCT GACCTATCCG GGGCAGATCC GGGTAACCGT GGTCCGCGAA TCCCGGGTCA CCGAGATCGC CCGCTGA
|
Protein sequence | MSGFDAVLLV AVLLLALIVL GAVLVGVRAV RGIAGAPRPE DPAFIAEKDR QEQSLAALRS AADEANSTVD AAKSAAAAAR TEAAAARAEA KAARAEARRV LDGARAEADT ILERVHKQAE ADAEQLRTAA RRSGEREAAV LATTTREQAA EVERRAARMD DRERLHSEEV ERLAERDRQL SAASAALAAR ESTLVDRDRE LAQAEDRRRR ELERVAGITA EAARGELVEA IEAQAKREAA LLVREIESEA RNTGEERARH IVVDAIQRVA SEQTAESVVS VLHLPGDEMK GRIIGREGRN IRAFESVTGV NLIIDDTPEA VLLSCFDPVR REVGRLTLEK LVLDGRIHPH RIEEVHDLAR QEVVQLCQRA AEDALVEVGI TEIHPELVSL LGRLRYRTSY GQNVLKHLVE TAHIAGIMAA ELRLDVPTIK RCAFLHDIGK ALTHEVEGSH AIVGADVARK YGESEDVVHA IEAHHNEVPP QTIEAVLTQA SDACSGGRPG ARRESLEAYV RRLERIEEIA AGKLGVERVF AMQAGREVRV MVRPEDVDDI SASVLARDVA KQIEEELTYP GQIRVTVVRE SRVTEIAR
|
| |