Gene Sare_1746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1746 
Symbol 
ID5705379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2017994 
End bp2019775 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content75% 
IMG OID641271249 
ProductHAD superfamily hydrolase 
Protein accessionYP_001536624 
Protein GI159037371 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0241] Histidinol phosphatase and related phosphatases
[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0409111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCACAGG TGGAGCCGGA TCAGCGCAGG TCGACCCGGG CAACAGCCCG CCGGGGCGGC 
TCGGTTTCCC GCCTGTTCGA CGCGGTGCTG CTGGACCGGG ATGGCACCCT CATCGAGGAC
GTGCCGTACA ACGGGAACCC GGAGCGGGTA CGGCCGATGC CGGGGGCGCG GGCGGCGCTG
GACGCGCTGC GGTCGGCGGG CCTGCGGCTG GCGGTGGTGA CGAACCAGTC CGGGCTGGCC
AAGGGGCTGT TCACCGAGGC GCAGCTGCGG GCCGTACACG CGCGGGTCGA GCAGGTGCTC
GGCCCGTTCG ACGCCTGGCT GGTCTGTCCG CACGACGACG ACGACCGGTG CAGCTGTCGC
AAACCAGCGC CGGAACTGAT CCACGCCGCC GCCCGCCGGC TCGGCACCAC GCCGCAGCGG
TGCGTTCTGG TCGGTGACAT CGGTCGCGAT GTCACCGCTG CACTGGCCGC CGGCGCCCAG
GCGGTCCTGG TGCCCACGCC ACTGACCCGC CCACCCGAAA CCAGCGCCGC GCCGTGGGTC
GCGGCGGATC TACCGGCCGC GGCAGCCGAG ATTCTCCGTC GGCAGGCCGC CATCGATCCG
GCCACCCACC GACGCGCCCT GCCGTCGGTG GGCCTTCCGT CGCCGGCCAC TGTGGCGTCG
GCGGGCGCCC CGTCCCCGGC CACTGTGGCG TCGGCGGTCC GTCGCTCCCG CCCCAGCCGG
CGTGCCGGGA CCGTGCTCGT CGTCCGTTCC GACTCGGCCG GCGACGTGCT TGTCACGGGC
CCGGGGATCC GTGCCGTCGC CGCCCACGCG CGCCGGGTCG TCCTGCTGTG CGGACCGCGC
GGTCGTGCCG CCGCCGACCT CCTACCTGGC GTCGACACCG TCATCGAGCA CCCACTGCCG
TGGATCGACC CCGCACCCGC ACCGGTTACC CCGCACGACA TCGCCACCCT CACCACCGCC
CTCGCTGCCG TCGACGCCGA CGAGGCGGTG ATCTTCACCA GCTACCACCA GTCCCCGCTC
CCCTTGGCCC TGCTGCTGCG TGCCGTCGAC GTCGAGCGCA TCTGCGCGAT CAGCGACGAC
TACCCCGGCA GCCTGCTCGA CGTCCGCCAC CACGTCCCGA CCGGCACCCC CGAGCCCGAA
CGTGCCCTCT CGCTCGCCGC CGCCGCCGGC TACCCACTAC CGTCCGACGA CGAACCGGTC
CTGCGGCTGC GGCCGGTGCC ACCGCCACCT GCGCGGGTGG GCGCGCCGGG CTACGTGGTG
CTGCACCCCG GCTCGGCGGC TCAGTCCCGG GGGTTGCCCC CCGACCTGGC AGCGGAGATC
GTCCGGACCC TGGTCGGCGC GGGCCACCGG GTCGTGGTCA CCGGCGGTCC GGACGAGGTG
GCGTTGACCG CGCGGGTGGC CGGTGGGATC GCCGTTGATC TCGGTGGTGG GACCGGACTG
GCCGACCTGG CCGCGACCGT CGCCGGTGCC GCCGCGGTGG TCGTCGGTAA CACCGGTCCC
GCCCACCTCG CCGCCGCGTA CGGCGTTCCG GTGGTCAGCC TCTTCGCCCC GACGGTCCCG
TTCGGGCAGT GGGGGCCGTG GCGGGTACCG ACCGTCCGGC TCGGCGATCC GGACGCCCCC
TGCCGCGGCA CCCGTGCCGC CACCTGCCCG GTACCCGGCC ACCCCTGCCT GAGCCGGATC
AGGCCGGAGG AGGTGTTGGC CGCGCTGATC CTGCTCGGCG TGCCCCTGTC CCGGCCACCG
ACGACGGCCG TGGCCACCGC CCTCGCCCGG AGCGGCCGAT GA
 
Protein sequence
MPQVEPDQRR STRATARRGG SVSRLFDAVL LDRDGTLIED VPYNGNPERV RPMPGARAAL 
DALRSAGLRL AVVTNQSGLA KGLFTEAQLR AVHARVEQVL GPFDAWLVCP HDDDDRCSCR
KPAPELIHAA ARRLGTTPQR CVLVGDIGRD VTAALAAGAQ AVLVPTPLTR PPETSAAPWV
AADLPAAAAE ILRRQAAIDP ATHRRALPSV GLPSPATVAS AGAPSPATVA SAVRRSRPSR
RAGTVLVVRS DSAGDVLVTG PGIRAVAAHA RRVVLLCGPR GRAAADLLPG VDTVIEHPLP
WIDPAPAPVT PHDIATLTTA LAAVDADEAV IFTSYHQSPL PLALLLRAVD VERICAISDD
YPGSLLDVRH HVPTGTPEPE RALSLAAAAG YPLPSDDEPV LRLRPVPPPP ARVGAPGYVV
LHPGSAAQSR GLPPDLAAEI VRTLVGAGHR VVVTGGPDEV ALTARVAGGI AVDLGGGTGL
ADLAATVAGA AAVVVGNTGP AHLAAAYGVP VVSLFAPTVP FGQWGPWRVP TVRLGDPDAP
CRGTRAATCP VPGHPCLSRI RPEEVLAALI LLGVPLSRPP TTAVATALAR SGR