Gene Sare_4239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4239 
Symbol 
ID5708089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4812614 
End bp4813795 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content74% 
IMG OID641273658 
ProductROK family protein 
Protein accessionYP_001539011 
Protein GI159039758 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0397832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0388456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCAG GACCGAGCCA GGACGATATC CGTCGGCAGA ACCTCGGAGC GTTGCTGCGG 
TACGTCCACC TGCACGGGGC CACGTCACGA GCCGAGCTGA CCACCACACT GGGGTTGAAC
CGCAGCACCA TCGGCGCACT CACCACCGAC CTCGCCGCGG CCGGCCTGGT GAGCGAGGGG
GCGCCGAAGG AAACCGGCCG GGCCGGACGA CCGTCGCTGG TCGTCCGGCC CGAGTCGGCC
CGGGTGTACG CGTACGCGTA CAGCATCGAG GTGGACCGGC TGCGGGTTGC GCGGATCGGG
CTCGGCGGCG GGGTGCTCGA CCGCCGGGAA CTGGAACGCC CACGTGGCAT CACCGCCGCC
GAAGCGGTGC CGCTGCTCGC CAGAGCGGCA ACGGAGATGC GACGGTCCGT GCCCGCCGAC
GCGATCTGCG TCGGCGCGGG CGTCGCCGTC TGCGGCATGG TCCGCCGCGA CGACGGGCTG
GTCCGTCTCG GCCCCACGAT GGGCTGGGTG GACGAGCCGA TCGGGGCGGC TATCGGCGCC
GAGCTGGGGC CGGACGTCCC GGTCGTCGTC GGCAACGTCG CCGACGTGGC GGCCTTCGCC
GAGCACGCCC GCGGCGCGGC CTCGGGCTGC GACAACGTGA TCTACCTGTA CGGCGATGTC
GGCGTCGGCG CCGGCATCAT CACCGGGGGG CGCCGGCTGA CCGGACACGG CGGCTACGGC
GGCGAGGTCG GGCACATGGT GGTCCTGCGG GACGGAGCCC GCTGCGAGTG TGGGTCGCGC
GGCTGCTGGG AAACCGAGAT CGGCGAGCAC GGCCTGCTGC GCGCCGCCGG TCGGTCCGAC
GCCCGGGGGC GGGACGCGCT GCTGGCCGTC TTCGACGCCG CCGACCGGGG CGACGCCCGA
GCCCAGACGG CGGTCCGTAC CGCCGGCGAC TGGCTCGGCT TCGGCGTGGC CAACCTCGTC
AACATCTTCA ACCCGGAGCT GGTCATCTTC GGTGGCACCA TGCGTGACCT CTACCTGGCC
GCCGCCGCCC AGGTGCGCAG CCGACTCAAC GCCAGCGCGC TGCCCGCCTG CGTGGAACAC
GTCCGGTTGC GCACGCCGAA GCTCGGTGAC GACGGCACCC TGATCGGCGC CGCCGAGCTC
GCCTTCGAGC GCCTTCTCGC CGACCCGCTC GACGTCGGGT GA
 
Protein sequence
MRAGPSQDDI RRQNLGALLR YVHLHGATSR AELTTTLGLN RSTIGALTTD LAAAGLVSEG 
APKETGRAGR PSLVVRPESA RVYAYAYSIE VDRLRVARIG LGGGVLDRRE LERPRGITAA
EAVPLLARAA TEMRRSVPAD AICVGAGVAV CGMVRRDDGL VRLGPTMGWV DEPIGAAIGA
ELGPDVPVVV GNVADVAAFA EHARGAASGC DNVIYLYGDV GVGAGIITGG RRLTGHGGYG
GEVGHMVVLR DGARCECGSR GCWETEIGEH GLLRAAGRSD ARGRDALLAV FDAADRGDAR
AQTAVRTAGD WLGFGVANLV NIFNPELVIF GGTMRDLYLA AAAQVRSRLN ASALPACVEH
VRLRTPKLGD DGTLIGAAEL AFERLLADPL DVG