Gene Sare_3506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3506 
Symbol 
ID5703315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4045043 
End bp4045990 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content71% 
IMG OID641272933 
ProductROK family glucokinase 
Protein accessionYP_001538299 
Protein GI159039046 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00502776 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGCTGA CCATCGGAGT GGACGTCGGT GGCACGAAGG TCGCGGCCGG CGTCGTGGAC 
GACACGGGCA CGGTGCTCGT GCAGACCCGA CGGGACACTC CCGCGGACGA TGTCGGCAAG
ACCTGCGACG TCATCGTCGA GGTGATCCGG GAACTGGCCG CTGGCCGTGC GATCGAGGGG
GTCGGCATCG GCGCGGCCGG GTGGATTGAC GCCAGCCGAT CAACCGTGCT CTTCGCCCCG
AACCTTGCCT GGCGTGACGA GCCGCTGCGC GAGTTCGTCA GTGCAGCCAC CGACCTGCCG
GTGATCGTGG AGAACGACGC CAACGTGGCG GCCTGGGGGG AGTTCCGCTA CGGAGCGGCC
CGTGACGCCG ACGACTCGAT GGTCATGTTC ACCATCGGCA CCGGGGTCGG TGGCGGCATC
GTGCTTGGCG GCGAGTTGGT TCGCGGCGCG CATGGTATCG CCGCTGAACT GGGACACATG
CTCAGTGTGC CGGACGGGCA CCAGTGCGGC TGCGGCCGGC TGGGCTGCAT CGAGCAGTAC
GCCAGCGGGA GCGCCCTGGT GCGGTTCGCC CAGGCTGCCG CTCGCCAGGA ACCAAACCGC
GCCGCCGCCC TGCTGGGGCA GGCCGGTGGC GACGTCGACG CGATCACCGG CCGAATGGTC
ACCGCCGCTG CGCGGGACGG CGACCCGGTC TCCACCGAGG CTTTCGCCCA GGTCGGCCAC
TGGCTCGGCA GCGGTCTCGC CGACATGGCG CAGATCCTCG ATCCGCAGGT GTTGGTGGTC
GGCGGTGGCG TCGTCGAAGC CGGTGAACTG CTGCTGGGCC CGACCCGCTG CTCCTTCACC
GAGGCGCTCG CGCAGCGTTG TCGGCTGCCG GTGGCGCAGA TCAGCCCCGC CAAGCTCGGC
AACGACGCTG GTCTCATCGG CGCCGCCGAC CTCGCCCGCC GGGTCTAG
 
Protein sequence
MTLTIGVDVG GTKVAAGVVD DTGTVLVQTR RDTPADDVGK TCDVIVEVIR ELAAGRAIEG 
VGIGAAGWID ASRSTVLFAP NLAWRDEPLR EFVSAATDLP VIVENDANVA AWGEFRYGAA
RDADDSMVMF TIGTGVGGGI VLGGELVRGA HGIAAELGHM LSVPDGHQCG CGRLGCIEQY
ASGSALVRFA QAAARQEPNR AAALLGQAGG DVDAITGRMV TAAARDGDPV STEAFAQVGH
WLGSGLADMA QILDPQVLVV GGGVVEAGEL LLGPTRCSFT EALAQRCRLP VAQISPAKLG
NDAGLIGAAD LARRV