Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3535 |
Symbol | |
ID | 5704603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4075585 |
End bp | 4076562 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641272962 |
Product | ribokinase-like domain-containing protein |
Protein accession | YP_001538328 |
Protein GI | 159039075 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.32323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000219352 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGATCG CCGTGACCGG CTCGATCGCG ACTGACCACC TGATGAGCTT CCCCGGCCGA TTCGCCGAGC AGTTCATCGC CGACCAGCTA GACAAGGTGT CGCTCTCCTT CCTGGTGGAC GACCTTGTGC TTCGACGGGG CGGGGTGGCC GCCAACATCG CCTTCGGCAT GGGCCAGCTC GGCCTGCGCC CGGTCCTGGT GGGCGCGGTG GGCGCCGACT TCGCCGACTA TCGCTCCTGG CTGGAGCGGC ACGGCGTCGA CTGCGAGTCG GTGCATGTCA GCGAGATCGC CCACACCGCC CGCTTCGTCT GCACCACCGA TACCGAGATG TGTCAGATCG CCTCCTTCTA CGCGGGCGCG ATGAGCGAGG CGCGCAACAT CGAGTTGGAG CCGATTTCCC GCCGTGCCGG CGGCCTCGAC CTGGTGCTGG TCGGCGCCAA CGACCCGGAG GCGATGCTGC GCCACTCCAC CGAGTGCCGG GAGCGGGGGT ACGCGTTCGC TGCTGATCCC TCCCAGCAGC TCGCCCGGAT GCCCGGCGAG GAGGTCGTCA CGCTGATCGA GGGCGCCGAC TACCTGATGA CCAACGAGTA CGAGAAGTCG CTGCTCCAGA GCAAGGCCAG CCTCAGCGAC GAGCAGCTGC TCGACTTGGT CAAGGTGCGG GTAACCACCT TGGGTAAGCG GGGTGTGGAG ATCGCCGGAC GGGGATTCGA CCCGATTCAC GTACCGATCG CCCGGGAGAT CCGCGCCGTC GACCCGACCG GAGTCGGCGA CGGGTTCCGG GCGGGCTTCT TCACCGCGCT CTCCTGGGGC CTCGGACTGG AGCGGGCTGC CCAGGTCGGC TCGCTACTGG CCACCCACGC GCTGGAGACA GTCGGTACCC AGGAATACCA GATCCGCAAC GACCTGTTCG TCAAGCGACT CGGCGAGTCG TACGGTGACG CGGCCGCCGA CGACATCCGC CCACACCTGA TTCCGTGA
|
Protein sequence | MKIAVTGSIA TDHLMSFPGR FAEQFIADQL DKVSLSFLVD DLVLRRGGVA ANIAFGMGQL GLRPVLVGAV GADFADYRSW LERHGVDCES VHVSEIAHTA RFVCTTDTEM CQIASFYAGA MSEARNIELE PISRRAGGLD LVLVGANDPE AMLRHSTECR ERGYAFAADP SQQLARMPGE EVVTLIEGAD YLMTNEYEKS LLQSKASLSD EQLLDLVKVR VTTLGKRGVE IAGRGFDPIH VPIAREIRAV DPTGVGDGFR AGFFTALSWG LGLERAAQVG SLLATHALET VGTQEYQIRN DLFVKRLGES YGDAAADDIR PHLIP
|
| |