Gene Sare_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1115 
Symbol 
ID5706058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1258694 
End bp1260274 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content72% 
IMG OID641270630 
Productmetallophosphoesterase 
Protein accessionYP_001536014 
Protein GI159036761 
COG category[R] General function prediction only 
COG ID[COG1408] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.190762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00561367 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGGTC AGGAGAACGA GCAGCAGGAC AGCCAGGGTG GCCAACCGTC GGAGACCGGC 
CGGACCCGAC GGGTTCCGCT TCGCCGCCGG GCACCCGACG GCCGCCCCGC CCGCTGGCGG
GCGCTGCGAG CCGTCGGGAT GACGCTGGCC GTGCTCGCCG TGACCCTGGC TGGGGTCCTG
ATCGGCACGC TTGCCGGTGG CCGGGTCAGC ACCGACATCG GGCCGTTTCA GGCGAACCTG
TCCCTGGCTC CGGCGCTGCA CGGCGGCACC ACCATCGACG TACCCCCCCT CGGCGCGCTG
CTGCTCGACA GCCACGACGG GCCCACCCAG CTGACGGTGC AGTTCGGCGC CCTCGACCAG
GGTCGCACCG AGGCCCTCCT ACACGACCCG GCCAGCCTCA GCCGGGCGAG CCAGACCGCC
GTCGACGACG TCCGTGCGGG CGTCCTGCAG CTCGGTGTCC GCACGATCGC CGCCACGGTT
CTGGTCACCC TGGTGCTGGC CCTGCTGGTG TTCCGCGACA CCCGCCGCGC GGCCTGGGCG
GGGGTGCTCG CACTGGTGAT CGCCGCGGGC AGCCTGGGCA CCGCGGCGGC CACTCTGCGG
CCCCAGGCGA TCGAGGAACC GCGCTACGAG GGGCTCCTGG TCAACGCGCC GGCACTCGTC
GGAGACGCGC GCCGGATCGC CAACGACTAC ACCCGCTACG CCGAGCAGCT CCAACGCATC
GTCGGCAACG TCAGCCAGCT CTACACCACC GTCTCGGCGC TGCCGGTGTT CGAGCCGGAG
CCCGGCACCA CGCGCGTACT ACACATCTCC GACATGCACC TCAACCCGGC TGCCTGGCAG
GTCATCCGGA CCGTGGTGGA GCAGTTCGGG ATCGACGTCG TGGTCGACAC CGGCGACATC
ACCGACTGGG GCAGCGAACC GGAGGCGAAC TACGTCGCCT CAATCGGGCT CCTCCAGAAG
CCCTACGTCT TCATCCGCGG CAACCACGAC TCGGGGAGCA CCGCCGCGGC CGTGGCCCAG
CAGCGCAACG CCATCGTGCT GGACAACACG ACCACCACCG TCGCCGGGCT GACCATCGCC
GGAATCGGTG ATCCGCGCTT CACGCCGGAC AAGAGCACCT CGCCGGCGGG CAGCGGCCTG
ACCCAGGAGA CCGCCGACCA ACTCATCGAC GTCGGAGACC AGTTGGCGGC CACGGCCCGC
ACCTCACCCC GGCCGGTGGA CCTGGCGCTG GTGCACGACC CCGCGTCGGC GGGGCCGCTC
GCCGGCGTCA CCCCGCTGGT GCTCGCTGGG CACACGCACA ACCGGGAGGT GCACCGGTTG
CCCCAGGAGC CCGACCAGTC CCCGACGCTG CTGATGGTGC AGGGCTCGAC CGGCGGCGCC
GGCCTGCGGG GCCTGGAGGG CGAGCAACCC ACCCCACTGT CGATGACCGT CCTCTACTTC
GACGAGGAGA AGCTGCTCCA GGCGTACGAC GACATCACCG TGGGTGGCAC CGGCCAGGCT
CAGGTGAACC TCGAACGACA CATCGTGGAG GACCCGAAGG CCGGCGAGCC CGCCCCGGTC
ACCCCCACAC CGACCCGCTG A
 
Protein sequence
MDGQENEQQD SQGGQPSETG RTRRVPLRRR APDGRPARWR ALRAVGMTLA VLAVTLAGVL 
IGTLAGGRVS TDIGPFQANL SLAPALHGGT TIDVPPLGAL LLDSHDGPTQ LTVQFGALDQ
GRTEALLHDP ASLSRASQTA VDDVRAGVLQ LGVRTIAATV LVTLVLALLV FRDTRRAAWA
GVLALVIAAG SLGTAAATLR PQAIEEPRYE GLLVNAPALV GDARRIANDY TRYAEQLQRI
VGNVSQLYTT VSALPVFEPE PGTTRVLHIS DMHLNPAAWQ VIRTVVEQFG IDVVVDTGDI
TDWGSEPEAN YVASIGLLQK PYVFIRGNHD SGSTAAAVAQ QRNAIVLDNT TTTVAGLTIA
GIGDPRFTPD KSTSPAGSGL TQETADQLID VGDQLAATAR TSPRPVDLAL VHDPASAGPL
AGVTPLVLAG HTHNREVHRL PQEPDQSPTL LMVQGSTGGA GLRGLEGEQP TPLSMTVLYF
DEEKLLQAYD DITVGGTGQA QVNLERHIVE DPKAGEPAPV TPTPTR