Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0226 |
Symbol | |
ID | 5705988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 257648 |
End bp | 258718 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641269755 |
Product | hypothetical protein |
Protein accession | YP_001535152 |
Protein GI | 159035899 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1387] Histidinol phosphatase and related hydrolases of the PHP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.171941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.010649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCCC GGGATCCCAT CGCCGACCTA CGCCGGATAG CTTTCCTGCT GGAGCGGGCC AACGAGGCCA CCTACCGGGT GCGTGCCTTC CGGTCGGCGG CGAAAGCCGT GGCCGCGCTG CCGTCCGCGG AGGTCGCCGA GCTGGCCCGC GCCGGCAAGC TGACCACGCT GTCCGGGGTG GGTGAGGTGA CCGCCCGCTG CGTCGCCGAG TCGCTGGCCG GTGAGGAGCC GGTCTATCTG CGTCGCCTGG CGGCGACCGA GGGCGCCGAC CTGGACGCCG AGGCCACCGC GCTGCGGACG GCGTTGCGCG GCGACTGCCA CACCCACTCC GACTGGTCCG ACGGCGGCTC CTCGATCGAG GAGATGGCGT TGGCGGCGGT CGAGTTGGGC CACGAGTACC TGGTGATCAC CGACCACTCA CCTCGGCTGA AGGTGGCGCA GGGGCTGACC GCCGACCGGC TGCGCCGTCA GCTGGACCAG GTGGCGAGTC TGAACGAGGC GCTACCGGAG GGGTTCCGGA TCCTCACCGG CGTCGAGGTG GACATCCTCG CCGACGGCTC CCTGGACCAG GACGAGGAGC TGCTCGCCCG GCTCGACGTG GTGGTGGGAT CGGTGCACAG TGGCCTGTCC GACGAGCGGG GGAGGATGAC CCACCGGATG CTCGCCGCGA TCGCGAATCC GCACCTGGAC ATCCTCGGAC ACTGTACGGG CCGGATGGTG TCCAGCCGAC CGGCGGGCGT GACCGGCCCC GGCGACCGGG GACACCGCCG GCGCACCCGG GGGGAGAGTG ACTTCGACGC GGACGCTGTC TTCGCGGCCT GCGCGGAACA CGGTGTCGCT GTCGAGGTCA ACTCCCGGCC GGAGCGGCAG GATCCGCCGA AGCGGCTGAT CCGGCGGGCG CTCGAGGCCG GCTGCCAGTT CGCGATCAAT ACCGACGCCC ATGCTCCCGG TCAACTCGAC TGGCAGCGGT TCGGCTGCGA ACGCGCCGCC CGCTGCGGTG TCCCCGCCGA TCGGGTGGTC AACACCTGGC CGGCGGAACG GCTGGTGGTG TGGGCCGGGA GCCGCTCCTG A
|
Protein sequence | MTARDPIADL RRIAFLLERA NEATYRVRAF RSAAKAVAAL PSAEVAELAR AGKLTTLSGV GEVTARCVAE SLAGEEPVYL RRLAATEGAD LDAEATALRT ALRGDCHTHS DWSDGGSSIE EMALAAVELG HEYLVITDHS PRLKVAQGLT ADRLRRQLDQ VASLNEALPE GFRILTGVEV DILADGSLDQ DEELLARLDV VVGSVHSGLS DERGRMTHRM LAAIANPHLD ILGHCTGRMV SSRPAGVTGP GDRGHRRRTR GESDFDADAV FAACAEHGVA VEVNSRPERQ DPPKRLIRRA LEAGCQFAIN TDAHAPGQLD WQRFGCERAA RCGVPADRVV NTWPAERLVV WAGSRS
|
| |