Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1746 |
Symbol | |
ID | 5705379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2017994 |
End bp | 2019775 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641271249 |
Product | HAD superfamily hydrolase |
Protein accession | YP_001536624 |
Protein GI | 159037371 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0241] Histidinol phosphatase and related phosphatases [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01656] histidinol-phosphate phosphatase family domain [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0409111 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCACAGG TGGAGCCGGA TCAGCGCAGG TCGACCCGGG CAACAGCCCG CCGGGGCGGC TCGGTTTCCC GCCTGTTCGA CGCGGTGCTG CTGGACCGGG ATGGCACCCT CATCGAGGAC GTGCCGTACA ACGGGAACCC GGAGCGGGTA CGGCCGATGC CGGGGGCGCG GGCGGCGCTG GACGCGCTGC GGTCGGCGGG CCTGCGGCTG GCGGTGGTGA CGAACCAGTC CGGGCTGGCC AAGGGGCTGT TCACCGAGGC GCAGCTGCGG GCCGTACACG CGCGGGTCGA GCAGGTGCTC GGCCCGTTCG ACGCCTGGCT GGTCTGTCCG CACGACGACG ACGACCGGTG CAGCTGTCGC AAACCAGCGC CGGAACTGAT CCACGCCGCC GCCCGCCGGC TCGGCACCAC GCCGCAGCGG TGCGTTCTGG TCGGTGACAT CGGTCGCGAT GTCACCGCTG CACTGGCCGC CGGCGCCCAG GCGGTCCTGG TGCCCACGCC ACTGACCCGC CCACCCGAAA CCAGCGCCGC GCCGTGGGTC GCGGCGGATC TACCGGCCGC GGCAGCCGAG ATTCTCCGTC GGCAGGCCGC CATCGATCCG GCCACCCACC GACGCGCCCT GCCGTCGGTG GGCCTTCCGT CGCCGGCCAC TGTGGCGTCG GCGGGCGCCC CGTCCCCGGC CACTGTGGCG TCGGCGGTCC GTCGCTCCCG CCCCAGCCGG CGTGCCGGGA CCGTGCTCGT CGTCCGTTCC GACTCGGCCG GCGACGTGCT TGTCACGGGC CCGGGGATCC GTGCCGTCGC CGCCCACGCG CGCCGGGTCG TCCTGCTGTG CGGACCGCGC GGTCGTGCCG CCGCCGACCT CCTACCTGGC GTCGACACCG TCATCGAGCA CCCACTGCCG TGGATCGACC CCGCACCCGC ACCGGTTACC CCGCACGACA TCGCCACCCT CACCACCGCC CTCGCTGCCG TCGACGCCGA CGAGGCGGTG ATCTTCACCA GCTACCACCA GTCCCCGCTC CCCTTGGCCC TGCTGCTGCG TGCCGTCGAC GTCGAGCGCA TCTGCGCGAT CAGCGACGAC TACCCCGGCA GCCTGCTCGA CGTCCGCCAC CACGTCCCGA CCGGCACCCC CGAGCCCGAA CGTGCCCTCT CGCTCGCCGC CGCCGCCGGC TACCCACTAC CGTCCGACGA CGAACCGGTC CTGCGGCTGC GGCCGGTGCC ACCGCCACCT GCGCGGGTGG GCGCGCCGGG CTACGTGGTG CTGCACCCCG GCTCGGCGGC TCAGTCCCGG GGGTTGCCCC CCGACCTGGC AGCGGAGATC GTCCGGACCC TGGTCGGCGC GGGCCACCGG GTCGTGGTCA CCGGCGGTCC GGACGAGGTG GCGTTGACCG CGCGGGTGGC CGGTGGGATC GCCGTTGATC TCGGTGGTGG GACCGGACTG GCCGACCTGG CCGCGACCGT CGCCGGTGCC GCCGCGGTGG TCGTCGGTAA CACCGGTCCC GCCCACCTCG CCGCCGCGTA CGGCGTTCCG GTGGTCAGCC TCTTCGCCCC GACGGTCCCG TTCGGGCAGT GGGGGCCGTG GCGGGTACCG ACCGTCCGGC TCGGCGATCC GGACGCCCCC TGCCGCGGCA CCCGTGCCGC CACCTGCCCG GTACCCGGCC ACCCCTGCCT GAGCCGGATC AGGCCGGAGG AGGTGTTGGC CGCGCTGATC CTGCTCGGCG TGCCCCTGTC CCGGCCACCG ACGACGGCCG TGGCCACCGC CCTCGCCCGG AGCGGCCGAT GA
|
Protein sequence | MPQVEPDQRR STRATARRGG SVSRLFDAVL LDRDGTLIED VPYNGNPERV RPMPGARAAL DALRSAGLRL AVVTNQSGLA KGLFTEAQLR AVHARVEQVL GPFDAWLVCP HDDDDRCSCR KPAPELIHAA ARRLGTTPQR CVLVGDIGRD VTAALAAGAQ AVLVPTPLTR PPETSAAPWV AADLPAAAAE ILRRQAAIDP ATHRRALPSV GLPSPATVAS AGAPSPATVA SAVRRSRPSR RAGTVLVVRS DSAGDVLVTG PGIRAVAAHA RRVVLLCGPR GRAAADLLPG VDTVIEHPLP WIDPAPAPVT PHDIATLTTA LAAVDADEAV IFTSYHQSPL PLALLLRAVD VERICAISDD YPGSLLDVRH HVPTGTPEPE RALSLAAAAG YPLPSDDEPV LRLRPVPPPP ARVGAPGYVV LHPGSAAQSR GLPPDLAAEI VRTLVGAGHR VVVTGGPDEV ALTARVAGGI AVDLGGGTGL ADLAATVAGA AAVVVGNTGP AHLAAAYGVP VVSLFAPTVP FGQWGPWRVP TVRLGDPDAP CRGTRAATCP VPGHPCLSRI RPEEVLAALI LLGVPLSRPP TTAVATALAR SGR
|
| |