Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3506 |
Symbol | |
ID | 5703315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4045043 |
End bp | 4045990 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641272933 |
Product | ROK family glucokinase |
Protein accession | YP_001538299 |
Protein GI | 159039046 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00502776 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGCTGA CCATCGGAGT GGACGTCGGT GGCACGAAGG TCGCGGCCGG CGTCGTGGAC GACACGGGCA CGGTGCTCGT GCAGACCCGA CGGGACACTC CCGCGGACGA TGTCGGCAAG ACCTGCGACG TCATCGTCGA GGTGATCCGG GAACTGGCCG CTGGCCGTGC GATCGAGGGG GTCGGCATCG GCGCGGCCGG GTGGATTGAC GCCAGCCGAT CAACCGTGCT CTTCGCCCCG AACCTTGCCT GGCGTGACGA GCCGCTGCGC GAGTTCGTCA GTGCAGCCAC CGACCTGCCG GTGATCGTGG AGAACGACGC CAACGTGGCG GCCTGGGGGG AGTTCCGCTA CGGAGCGGCC CGTGACGCCG ACGACTCGAT GGTCATGTTC ACCATCGGCA CCGGGGTCGG TGGCGGCATC GTGCTTGGCG GCGAGTTGGT TCGCGGCGCG CATGGTATCG CCGCTGAACT GGGACACATG CTCAGTGTGC CGGACGGGCA CCAGTGCGGC TGCGGCCGGC TGGGCTGCAT CGAGCAGTAC GCCAGCGGGA GCGCCCTGGT GCGGTTCGCC CAGGCTGCCG CTCGCCAGGA ACCAAACCGC GCCGCCGCCC TGCTGGGGCA GGCCGGTGGC GACGTCGACG CGATCACCGG CCGAATGGTC ACCGCCGCTG CGCGGGACGG CGACCCGGTC TCCACCGAGG CTTTCGCCCA GGTCGGCCAC TGGCTCGGCA GCGGTCTCGC CGACATGGCG CAGATCCTCG ATCCGCAGGT GTTGGTGGTC GGCGGTGGCG TCGTCGAAGC CGGTGAACTG CTGCTGGGCC CGACCCGCTG CTCCTTCACC GAGGCGCTCG CGCAGCGTTG TCGGCTGCCG GTGGCGCAGA TCAGCCCCGC CAAGCTCGGC AACGACGCTG GTCTCATCGG CGCCGCCGAC CTCGCCCGCC GGGTCTAG
|
Protein sequence | MTLTIGVDVG GTKVAAGVVD DTGTVLVQTR RDTPADDVGK TCDVIVEVIR ELAAGRAIEG VGIGAAGWID ASRSTVLFAP NLAWRDEPLR EFVSAATDLP VIVENDANVA AWGEFRYGAA RDADDSMVMF TIGTGVGGGI VLGGELVRGA HGIAAELGHM LSVPDGHQCG CGRLGCIEQY ASGSALVRFA QAAARQEPNR AAALLGQAGG DVDAITGRMV TAAARDGDPV STEAFAQVGH WLGSGLADMA QILDPQVLVV GGGVVEAGEL LLGPTRCSFT EALAQRCRLP VAQISPAKLG NDAGLIGAAD LARRV
|
| |