Gene Sare_4248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4248 
Symbol 
ID5708098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4820287 
End bp4821786 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content75% 
IMG OID641273667 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001539020 
Protein GI159039767 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.380487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0166005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCAG TGTGGCGGGT CGCGGACGTA CGCGCCGCGG AGGCGGGCCT GATGGCGGCG 
CTTCCATCCG GGACGCTGAT GCAGCGCGCC GCGGCCGGGC TCGCCCGCCG GTGTGCACGT
GTCCTGACCG ACCGGGGCGG CGTCTACGGT GCCTCGGTGT TGCTGCTGGT CGGCTCGGGT
GACAACGGCG GTGACGCACT CTTCGCCGGT GCCCGCCTGG CCCGGCGCGG GGCGGCGGTG
TCAGCCCTGC TGTTGTCCCC GGATCGGGTA CACGCCGAGG CGTTGACGGC GTTGCGAGCC
GCCGGCGGCC GGCTGGTCGA GCGCCCCCCG GCACGGGTGG ACCTGGTGGT CGACGGCATC
GTCGGGATCG GTGGCAGCGG CGGGCTCCGT GAGCCGGCGG AACAGCTCGC GGCAAGTCTG
GCGGGGTGCT GTGGGCGAGA CGGTGACCGG GCGACCGTGG TCGCGGTGGA TGTTCCCAGT
GGGGTGTCGG TCGACACGGG GCACGTGCCG CAGTCCGCCT CCGGACGACC GGCGGCGGTC
CACGCCGACG TGACAGTGAC CTTCGGCGCG TTGAAACCCG CGCTGGTGGT GGGACCGGCG
GCGCCGCTCG CCGGCCAGGT CGATCTGGTC GACATCGGAC TGGAGCCCTG GCTGCGTAGC
ACGCCGGCGC TACGCGTCAC CGAGTGGGCG GATGTGACCG GCTGGTGGCC CACTCCTGGT
CCGGCAACCG AAAAGTACAC CCGGGGCGTC GTCGGGGTTG CGACCGGCTC GGCCACCTAT
CCCGGCGCCG CGGTGCTCTC GGTCGCCGGT GCCCTGGCCG GCCCGACCGG CATGGTGCGA
TACGCCGGGA GCGCTCGGGT CGAGGTGCTG CGCCAGCACC CGTCGGTGAT CGCCACCGAC
CGGGTCGCCG ACGCCGGCCG GGTGCAGGCG TGGGTATGCG GTTCCGGGCT CGGTACCGAT
GACGAGGCGG CCGGGGAACT GCGGGCGGTG CTCGCGGCGC CGGTGCCGGC GGTGCTCGAC
GCGGACGCGT TGACCCTGCT CGTGGACGGA TCCCTCGCCC ACCTGCTGCG GCGACGCGAC
GCCCCGATCG TGGTCACCCC GCACGACCGG GAGTTCGCCC GGCTCTGCGG CGAGACCCCC
GGGACCGACC GGGTCGCCGC CGCGCTGCGC CTGGCCGCCT GGATGAACGC CGTGGTGCTA
CTCAAGGGCG ACCGGACGGT GATCGGCACG CCGGACGGCC GGGCGTATGT CAATCAGACC
GGAACGCCGG CCCTGGCCAC CGGTGGCACG GGCGATGTGC TGGCCGGACT GCTTGGCTCG
TTGCTCGCCG CGGGCCTCAA CCCGGAGCGA GCCGCCGCCG CCGCGGCGTA CCTGCACGGG
CTGGCCGGCC GGGAGGCGGC CCAGGGTGGC CCGGTCACCG CTCCCGATGT CGCCACCGCG
CTGCGCCCGG TGCTGGCTCG CGTCGGGTGG ATCGACGGGC GGGCTGGGCC GAACTGCTGA
 
Protein sequence
MRPVWRVADV RAAEAGLMAA LPSGTLMQRA AAGLARRCAR VLTDRGGVYG ASVLLLVGSG 
DNGGDALFAG ARLARRGAAV SALLLSPDRV HAEALTALRA AGGRLVERPP ARVDLVVDGI
VGIGGSGGLR EPAEQLAASL AGCCGRDGDR ATVVAVDVPS GVSVDTGHVP QSASGRPAAV
HADVTVTFGA LKPALVVGPA APLAGQVDLV DIGLEPWLRS TPALRVTEWA DVTGWWPTPG
PATEKYTRGV VGVATGSATY PGAAVLSVAG ALAGPTGMVR YAGSARVEVL RQHPSVIATD
RVADAGRVQA WVCGSGLGTD DEAAGELRAV LAAPVPAVLD ADALTLLVDG SLAHLLRRRD
APIVVTPHDR EFARLCGETP GTDRVAAALR LAAWMNAVVL LKGDRTVIGT PDGRAYVNQT
GTPALATGGT GDVLAGLLGS LLAAGLNPER AAAAAAYLHG LAGREAAQGG PVTAPDVATA
LRPVLARVGW IDGRAGPNC