Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1092 |
Symbol | |
ID | 5707013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1229508 |
End bp | 1230683 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641270607 |
Product | ROK family protein |
Protein accession | YP_001535991 |
Protein GI | 159036738 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0163084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.226847 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA CCCGGCTCCC CGGCACCCCC CGCCTGTTGC GGGCGCTCAA CGACCGCGCG GCGCTGGAGC TGTTGTTGGA GCGGGGACCG CTGACCCGGG CGCGGCTGGG CGAGCTGACC GGTCTCTCCA AAGTCACCGC CTCGCAGTTG GTCGAGCGGC TTGAGGAGCG TGGGCTGGTC ACCCGGGTTG GTGAGCAGGC GGGCGGCCGG GGCCCGAACG CCCAGCTCTA CGCGGTCCGA CCGGGCAGCG CCCACGTGGT CGGGGTGGAC TTCGGGGCCG AACGGGTAGT CGCTGCCTGC GCGGACATCA CCGGGGCGGT GGTCGGCCGG GTGGAGCAGT CGACCCGTGA CACCGACGAC CCGGTCGGCG TGGTGCACAG TGCCGTCGCC CTGGCCGCGA GCAGTGCCCA GGTCGAACTG TCGACCGTAC GCCGGATCGT GCTGGGCGCT CCCGGCCTTG TTGATCCGGC CAGTGGTGAC ATCACCTTCG CGTTCAACCT GCCGCGGTGG CACGCCGGCC TGCTCGGCGC GCTTCGTGAT GATCTCCACA TCCCGGTGGT GTTCGAGAAC GACGTGAACC TGGTGGCGAT GGCCGAGGCG CGGTCGGGCG CCGCGCAGGG CGTGCCCGAC TTCGTGCTGG TTTGGGTGGA CGCCGGTATC GGTCTGGCGA TCGTCTTCGG CGGCCGGTTG CATCATGGCA GCACCGGCGC CGCCGGGGAG ATTGGCTGGC TGCCGATGCC CGGTGCGCCG ATCCCGCGTG CCGCTTCGCA CCGAGCAAAG CCCGCGTTTC AGCAACTCGT CGGCGGGGAG GCAGTCCGCG CGCTGGCCAG TGAACGCGGG TATCCGGATG AGACGGCGGC CGGTGGGGTG GCAGCGGCCG TCGCCGACGG CGCGACCGGT GGCCCGATGC TCGACGAGTT GGCCCGTCGG CTCGCGCTCG GCGTGGCGAG CACCTGCGTG GTGCTGGATC CACCGCTGGT GGTGTTGGCC GGCGCGGTCG GCCGGGCCGG CGGTGCGGCG CTGGCCGACC GAGTGCAGCA CGAGGTGGCG GCGATCGCCC CGGTCCGGCC CCGGGTGGTG CCGACCGGGC TGACCGAGGA GCCGATCCTG CGCGGCGCGC TGCACACCGC CCTGGAGGCT GTCCGGGACG AGGTGTTCGA CTCCACAACC GGCTGA
|
Protein sequence | MTTTRLPGTP RLLRALNDRA ALELLLERGP LTRARLGELT GLSKVTASQL VERLEERGLV TRVGEQAGGR GPNAQLYAVR PGSAHVVGVD FGAERVVAAC ADITGAVVGR VEQSTRDTDD PVGVVHSAVA LAASSAQVEL STVRRIVLGA PGLVDPASGD ITFAFNLPRW HAGLLGALRD DLHIPVVFEN DVNLVAMAEA RSGAAQGVPD FVLVWVDAGI GLAIVFGGRL HHGSTGAAGE IGWLPMPGAP IPRAASHRAK PAFQQLVGGE AVRALASERG YPDETAAGGV AAAVADGATG GPMLDELARR LALGVASTCV VLDPPLVVLA GAVGRAGGAA LADRVQHEVA AIAPVRPRVV PTGLTEEPIL RGALHTALEA VRDEVFDSTT G
|
| |