Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4777 |
Symbol | |
ID | 5704444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5406888 |
End bp | 5408033 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641274175 |
Product | ROK family protein |
Protein accession | YP_001539521 |
Protein GI | 159040268 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000235996 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGGAGCCAC CGATCTCCGC TACGCTGCGC GACCAGACCC TCGATCTGCT CTCCAGCGGC GCGGCCACAT CCCGCGCCGA CCTCGTCGAG GCGTTGCAGG TCGCCCCGTC AACCGTCACC GCCGTGGTGC GCCGGCTGCT GGAGGAGGGC GTCCTCGCGG AGGAGGGCAT GGGTCGCTCC ACCGGTGGGC GACGCCCGCG GATCCTGCGG CTGCGAGAGA CCAAGGGAAT CCTCGCCGTC GCAGAACTCG GCGGCCGGCA CGCCCGGGTC GGGTTGTGCA CACCCGGCGG CGAGCTGCAC ACCACCGAGG AGGTGGCGAT CGACATCGCC GCCGGGCCCG ACGAGGTCTT CGCGGTCGTC GGAGCCACCT TCGCGCGGCT CCAGACGGCG ACCGCACCCG GTCAGGTGCT GCTCGGGGTC GGCGTGGCCC TCCCCGGACC GGTGGGGTTC CCCGGAGGGC GGTTGGTGGG CCCGGCCCGG ATGCCCGGCT GGAGCGGCGT CGACGCTGGC GCCCACCTCA CCGACCGCTT CCAGGTGCCG GTGATCGTCG AGAACGACGC CAAGGCGGCG GCGATGGGCG AGTACGTCAC CCGAGGCCCG GAAGTCGGCG ACATGATCTA CGTCAAGGCC GGCACCGGCA TCGGCGCCTG CCTGGTCAGC GGCGGACAGG TCCATCGTGG CGGGCGCGGC CTCAGCGGCG ACGTCACCCA CGTGCGGGTG GCCGACAGCG GCGAGCGGCA CTGCTCCTGC GGCAGCCGGG GCTGCCTGGA GACCGTTGCC AGCGGTGCCG CCCTGGCCCG TGAGTTGGCC GAGCAGGGTT CCTCGGCGGC CACCGTCCGG GAGATCATCA CGGCGGTCGG CGACGCCGAC CCGACGGTCG TGACCATGGT GCGCCACGCC GGTGGGCTGC TCGGCGTGGC GCTTTCCGGT CTGGTCAACT TCCTCAACCC CGACGCCGTC GTCATCGGCG GTGCGCTGTC CAGCCTCGAC GTCTACGTGG CCGCGACCCG CGGCATGCTC TACGAACGCT GCCTACCGTC CATGACCCAG TCCCTGACCA TCGAAGCCAG CGGCGCGGGC TCGGACGCGG CCCTCATCGG CCTCGGGCAC CTGCTGCGCA CGACCGTCGA CGTCCGACCC GCCTGA
|
Protein sequence | MEPPISATLR DQTLDLLSSG AATSRADLVE ALQVAPSTVT AVVRRLLEEG VLAEEGMGRS TGGRRPRILR LRETKGILAV AELGGRHARV GLCTPGGELH TTEEVAIDIA AGPDEVFAVV GATFARLQTA TAPGQVLLGV GVALPGPVGF PGGRLVGPAR MPGWSGVDAG AHLTDRFQVP VIVENDAKAA AMGEYVTRGP EVGDMIYVKA GTGIGACLVS GGQVHRGGRG LSGDVTHVRV ADSGERHCSC GSRGCLETVA SGAALARELA EQGSSAATVR EIITAVGDAD PTVVTMVRHA GGLLGVALSG LVNFLNPDAV VIGGALSSLD VYVAATRGML YERCLPSMTQ SLTIEASGAG SDAALIGLGH LLRTTVDVRP A
|
| |