Gene Sare_1092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1092 
Symbol 
ID5707013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1229508 
End bp1230683 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content73% 
IMG OID641270607 
ProductROK family protein 
Protein accessionYP_001535991 
Protein GI159036738 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0163084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.226847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA CCCGGCTCCC CGGCACCCCC CGCCTGTTGC GGGCGCTCAA CGACCGCGCG 
GCGCTGGAGC TGTTGTTGGA GCGGGGACCG CTGACCCGGG CGCGGCTGGG CGAGCTGACC
GGTCTCTCCA AAGTCACCGC CTCGCAGTTG GTCGAGCGGC TTGAGGAGCG TGGGCTGGTC
ACCCGGGTTG GTGAGCAGGC GGGCGGCCGG GGCCCGAACG CCCAGCTCTA CGCGGTCCGA
CCGGGCAGCG CCCACGTGGT CGGGGTGGAC TTCGGGGCCG AACGGGTAGT CGCTGCCTGC
GCGGACATCA CCGGGGCGGT GGTCGGCCGG GTGGAGCAGT CGACCCGTGA CACCGACGAC
CCGGTCGGCG TGGTGCACAG TGCCGTCGCC CTGGCCGCGA GCAGTGCCCA GGTCGAACTG
TCGACCGTAC GCCGGATCGT GCTGGGCGCT CCCGGCCTTG TTGATCCGGC CAGTGGTGAC
ATCACCTTCG CGTTCAACCT GCCGCGGTGG CACGCCGGCC TGCTCGGCGC GCTTCGTGAT
GATCTCCACA TCCCGGTGGT GTTCGAGAAC GACGTGAACC TGGTGGCGAT GGCCGAGGCG
CGGTCGGGCG CCGCGCAGGG CGTGCCCGAC TTCGTGCTGG TTTGGGTGGA CGCCGGTATC
GGTCTGGCGA TCGTCTTCGG CGGCCGGTTG CATCATGGCA GCACCGGCGC CGCCGGGGAG
ATTGGCTGGC TGCCGATGCC CGGTGCGCCG ATCCCGCGTG CCGCTTCGCA CCGAGCAAAG
CCCGCGTTTC AGCAACTCGT CGGCGGGGAG GCAGTCCGCG CGCTGGCCAG TGAACGCGGG
TATCCGGATG AGACGGCGGC CGGTGGGGTG GCAGCGGCCG TCGCCGACGG CGCGACCGGT
GGCCCGATGC TCGACGAGTT GGCCCGTCGG CTCGCGCTCG GCGTGGCGAG CACCTGCGTG
GTGCTGGATC CACCGCTGGT GGTGTTGGCC GGCGCGGTCG GCCGGGCCGG CGGTGCGGCG
CTGGCCGACC GAGTGCAGCA CGAGGTGGCG GCGATCGCCC CGGTCCGGCC CCGGGTGGTG
CCGACCGGGC TGACCGAGGA GCCGATCCTG CGCGGCGCGC TGCACACCGC CCTGGAGGCT
GTCCGGGACG AGGTGTTCGA CTCCACAACC GGCTGA
 
Protein sequence
MTTTRLPGTP RLLRALNDRA ALELLLERGP LTRARLGELT GLSKVTASQL VERLEERGLV 
TRVGEQAGGR GPNAQLYAVR PGSAHVVGVD FGAERVVAAC ADITGAVVGR VEQSTRDTDD
PVGVVHSAVA LAASSAQVEL STVRRIVLGA PGLVDPASGD ITFAFNLPRW HAGLLGALRD
DLHIPVVFEN DVNLVAMAEA RSGAAQGVPD FVLVWVDAGI GLAIVFGGRL HHGSTGAAGE
IGWLPMPGAP IPRAASHRAK PAFQQLVGGE AVRALASERG YPDETAAGGV AAAVADGATG
GPMLDELARR LALGVASTCV VLDPPLVVLA GAVGRAGGAA LADRVQHEVA AIAPVRPRVV
PTGLTEEPIL RGALHTALEA VRDEVFDSTT G