Gene Sare_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1391 
Symbol 
ID5703750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1606639 
End bp1608405 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content72% 
IMG OID641270901 
ProductRNA binding metal dependent phosphohydrolase 
Protein accessionYP_001536282 
Protein GI159037029 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0559611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGT TCGACGCGGT CCTTCTCGTG GCCGTGCTGC TGCTCGCCCT CATCGTGCTC 
GGCGCGGTGC TGGTTGGTGT CCGAGCCGTG CGCGGCATCG CCGGCGCGCC CCGGCCCGAG
GACCCGGCCT TCATCGCCGA GAAAGACCGC CAGGAACAGT CCCTCGCCGC CCTGCGGTCC
GCCGCCGACG AGGCGAACAG CACAGTCGAC GCGGCGAAGT CCGCCGCCGC CGCGGCCCGC
ACCGAGGCCG CTGCCGCCCG AGCTGAGGCG AAGGCCGCCC GCGCCGAGGC CCGGCGGGTG
CTCGACGGCG CCCGCGCCGA AGCGGACACC ATCCTGGAAC GGGTACACAA GCAGGCCGAG
GCCGACGCCG AACAGTTGCG AACCGCCGCC CGGCGCAGCG GGGAGCGGGA GGCAGCTGTT
CTCGCCACGA CCACCCGGGA ACAGGCGGCC GAGGTGGAGC GGCGTGCCGC CCGGATGGAC
GATCGGGAGC GGCTGCACAG CGAGGAGGTG GAGCGGCTCG CCGAGCGGGA TCGTCAGCTC
AGCGCCGCCA GCGCCGCCCT GGCCGCCCGT GAGTCGACTC TCGTCGACCG GGACCGGGAG
TTGGCGCAGG CGGAGGATCG GCGCCGCCGC GAGTTGGAGC GGGTCGCGGG GATCACCGCG
GAGGCCGCCC GTGGCGAACT GGTCGAGGCG ATCGAGGCGC AGGCCAAGCG GGAGGCCGCC
CTGCTGGTAC GCGAGATCGA GTCGGAGGCG CGCAACACGG GCGAGGAGCG TGCCCGGCAC
ATCGTGGTTG ACGCGATCCA GCGGGTGGCC AGCGAGCAGA CCGCGGAGAG TGTGGTCAGC
GTGCTGCACC TGCCGGGTGA CGAGATGAAG GGTCGGATCA TCGGCCGGGA GGGGCGCAAC
ATCCGCGCCT TCGAATCCGT GACCGGCGTC AACCTGATCA TCGACGACAC CCCGGAGGCG
GTGCTGCTGT CCTGCTTCGA CCCGGTACGT CGGGAAGTCG GCCGACTCAC CCTGGAAAAG
CTCGTCCTGG ACGGCCGTAT CCATCCACAC CGGATCGAGG AGGTGCACGA CCTGGCCCGG
CAGGAGGTGG TGCAGCTCTG CCAGCGTGCC GCCGAGGACG CCCTCGTCGA GGTCGGCATC
ACCGAGATTC ACCCCGAGTT GGTCAGCCTG CTGGGCCGGC TGCGCTACCG CACCTCGTAC
GGGCAGAACG TGCTCAAGCA CCTCGTCGAG ACCGCCCATA TCGCCGGGAT CATGGCGGCC
GAACTGCGGT TGGACGTACC GACGATCAAG CGGTGCGCCT TCCTGCACGA CATCGGTAAG
GCGCTCACCC ACGAGGTCGA GGGCAGTCAT GCCATCGTCG GCGCCGACGT CGCCCGCAAG
TACGGCGAGA GCGAGGACGT CGTGCACGCC ATCGAGGCGC ACCACAACGA GGTGCCGCCG
CAGACCATCG AGGCGGTGCT GACCCAGGCC TCGGACGCCT GCTCCGGCGG TCGGCCGGGG
GCCCGTCGGG AGAGCCTGGA GGCGTACGTG CGGCGGCTGG AGCGGATCGA GGAGATCGCC
GCGGGCAAGC TCGGCGTGGA GCGGGTCTTC GCGATGCAGG CGGGCCGGGA GGTCCGGGTG
ATGGTCCGGC CGGAGGACGT GGACGACATC AGCGCCTCCG TGCTGGCCCG TGACGTGGCC
AAGCAGATCG AGGAGGAGCT GACCTATCCG GGGCAGATCC GGGTAACCGT GGTCCGCGAA
TCCCGGGTCA CCGAGATCGC CCGCTGA
 
Protein sequence
MSGFDAVLLV AVLLLALIVL GAVLVGVRAV RGIAGAPRPE DPAFIAEKDR QEQSLAALRS 
AADEANSTVD AAKSAAAAAR TEAAAARAEA KAARAEARRV LDGARAEADT ILERVHKQAE
ADAEQLRTAA RRSGEREAAV LATTTREQAA EVERRAARMD DRERLHSEEV ERLAERDRQL
SAASAALAAR ESTLVDRDRE LAQAEDRRRR ELERVAGITA EAARGELVEA IEAQAKREAA
LLVREIESEA RNTGEERARH IVVDAIQRVA SEQTAESVVS VLHLPGDEMK GRIIGREGRN
IRAFESVTGV NLIIDDTPEA VLLSCFDPVR REVGRLTLEK LVLDGRIHPH RIEEVHDLAR
QEVVQLCQRA AEDALVEVGI TEIHPELVSL LGRLRYRTSY GQNVLKHLVE TAHIAGIMAA
ELRLDVPTIK RCAFLHDIGK ALTHEVEGSH AIVGADVARK YGESEDVVHA IEAHHNEVPP
QTIEAVLTQA SDACSGGRPG ARRESLEAYV RRLERIEEIA AGKLGVERVF AMQAGREVRV
MVRPEDVDDI SASVLARDVA KQIEEELTYP GQIRVTVVRE SRVTEIAR