Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4101 |
Symbol | |
ID | 5706528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4660480 |
End bp | 4661298 |
Gene Length | 819 bp |
Protein Length | 272 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641273527 |
Product | histidinol-phosphate phosphatase, putative |
Protein accession | YP_001538882 |
Protein GI | 159039629 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.332683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGGTT ACGCCGCCGA CCTCGCCCTC GCCCACCACC TCGCCGACGC CGCCGACGCG GTCTCCGTCG CGCGATTCCG CGCCCTGGAC CTCCGCGTGG ACACGAAGCC CGACCTCAGT CCGGTCTCCG ACGCGGACAC CGCGGTCGAG CAGGAGATCC GGGCCCTGCT CGCCACCCAC CGCCCGGACG ACGGCCTGCT CGGCGAGGAG TACGGCGAGC AGCCCGCGAC CGGTTCCGGC CGGCGGCGCT GGGTGGTCGA CCCGATCGAC GGGACGAAGA ACTTCATTCG CGGCGTGCCG GTGTGGGCCA CACTCATCGC CCTGCTTGAG GGGGACCGGC CGGTCGCCGG TCTGGTCTCC GCACCGGCCC TGGGCCGTCG CTGGTGGGCG GCGGTCGGCG AGGGGGCGTA CGCGGGGCCG GACCTACCGT CCGGTACGCC GATCCGGGTG TCGGCGGTAA CCGATCTGAG CGACGCCAGT TTCTGCTACT CCTCGCTCGG CGGCTGGGAG GACAACGGTC GGTTGGGAGC TGTCCTGCAG ATCATGCGGG ACGCCTGGCG CAGCCGGGCG TACGGCGACT TCTACGGTTA CATGCTGCTG GCCGAGGGCG CGCTGGACAT CATGGTGGAG CCGGAACTGT CATTGTGGGA CATCGCGGCG CTGGTGCCGA TCGTCACCGA GGCGGGGGGA ATGCTCACCG ACCTGGCTGG CCGGCCCGCC CCTGGAGACA CCAGCTCCGG CGGTACCAGC GCGGTTGCCA CGAACGGTCC GCTGCACGCC GGTATCCTCA CCCGCCTGAG CGGAGCCCCT GCACGCTGA
|
Protein sequence | MKGYAADLAL AHHLADAADA VSVARFRALD LRVDTKPDLS PVSDADTAVE QEIRALLATH RPDDGLLGEE YGEQPATGSG RRRWVVDPID GTKNFIRGVP VWATLIALLE GDRPVAGLVS APALGRRWWA AVGEGAYAGP DLPSGTPIRV SAVTDLSDAS FCYSSLGGWE DNGRLGAVLQ IMRDAWRSRA YGDFYGYMLL AEGALDIMVE PELSLWDIAA LVPIVTEAGG MLTDLAGRPA PGDTSSGGTS AVATNGPLHA GILTRLSGAP AR
|
| |