Gene RoseRS_3410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3410 
Symbol 
ID5210387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4280945 
End bp4282615 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content64% 
IMG OID640597005 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001277718 
Protein GI148657513 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.802721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.621844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA TCGTCACTGC TGAACAAATG CGCGCCATCG AAGAAGCGGC GGTCGCCCGT 
GGCGCTACAT GGTCAGGATT GATGGAACAG GCTGGCGCCG GTGTGGCGCG TGTCGCCCTC
GAAGTCTCCG GCGATCCTGT GGGTCGTCGC GCGCTGGTGC TGGTCGGTCC GGGCAACAAC
GGCGGGGATG GGTTGGTGGC AGCGCGTCTG CTGCACGATG CTGGCATGCA GGTAACGCTG
TTCGTCTGGC GACGACGTGA GACTGTCGAG GACATCAACT GGCGGTTGTG TCGCGAACGC
GCTATCACCG AACTGGCTGC CGCCGATGAT CCGCAGGGTG CTGCGCTGCG GGCATCGCTT
GCCAGGGTGG ATCTGGTGAT CGATGCGTTG CTGGGGTATG GCGCCAATCG CCCGGTGGAA
GGCGAACTGG CGATGATTAT CGCCACGCTG AATGCGGCGA GGGCAACCGA TGCGCGCCGG
ACGAAACCAT ACGTTCTGGC GATCGATGTG CCTACCGGTG TTCACGCCGA TAGCGGCGCC
GTGCTCGGCA ATGCAGTGCG CGCCGACCTG ACCGTTTCGA CCGGTCCTGT CAAGCGTGGG
TTGTTGTTCT ACCCGGCGCG CGCATACGCT GGCGTGCTGC GCAGCGTCGA TATCGGGCTT
TCGCCCGCCG ATCTGGAGAG TGTGATGACC GACATGATCG ATATAGAACT GGCGCGCTCG
CTCCTTCCAC CCCGTCCGCC CGACTCGCAC AAGGGTACGT TCGGCAAGGT GCTGGTCGTT
GCCGGATCGA TAAACTACCC CGGTGCAGCG ACGCTTGCGA CCGCCGGAGC GGCGCGTGTC
GGCGCCGGTC TGGTGACGCT TGCGGTTGGG CGGAGTCAGG TGTACAGTCC AGGGCGCCTC
CCCGAAATCA CGTTGCACAT CCTTCCCGAA GCCGAACCGG GGGTGGTCGG CGATGCCGCT
GCCGACGAGG TGCTCTCCAT CCTCGAAGGG TATCAGGCGC TGCTTGTCGG TCCTGGTTTG
GGGCGCGAGA AGGCGACACG TGCGTTTCTT GAGCGGTTGC TGGGATTGCA ATCACCGCGT
CATCGTGGAC AGATCGGGTT TCGGATCGCC GCAGCTGGCA GCGAAAAACC GGTGGTCAAG
CAGCGACCAG AATTGCCGTT TACTGTGATC GACGCCGATG GGTTGAATAT TCTGGCAGAC
CTGATCCATC ACCCTGAAGC GACCGATGCC GCGTCTGGCA CGATCTGGAA CCGTCTGCCG
CGCGGCAGGT GCGTGTTGAC GCCGCATCCA GGCGAGATGC GGCGTCTGCT TGGCGTCGAG
GAATTGACCG GTCATCCGGT CGATGTGGCG AAGGAAGCGG CGATGCACTG GCAGCAGGTA
GTCGTGCTCA AAGGCGCCAC GACGGTCATT GCCGATCCGG AGGGTCGAGT GCGCGTCAAC
GATGGAGGAA ACCCGGCGCT GGCAACCGCC GGAACCGGCG ACGTGCTGGC GGGCGCCATC
GCCGGGTTGC TGGCGCAGGG GCTGGCGCCG TTCGATGCAG CGACACTTGG CGTCTATCTC
CACAGCGCAG CCGGTCGCCT GGTGCGCGAC GAACTCGGCG ATATGGGCGC ACTTGCGGGT
GATCTGCTCC CGCGTCTACC GCTGGCGATC CGCGCACTAA AACAGCCATG A
 
Protein sequence
MSKIVTAEQM RAIEEAAVAR GATWSGLMEQ AGAGVARVAL EVSGDPVGRR ALVLVGPGNN 
GGDGLVAARL LHDAGMQVTL FVWRRRETVE DINWRLCRER AITELAAADD PQGAALRASL
ARVDLVIDAL LGYGANRPVE GELAMIIATL NAARATDARR TKPYVLAIDV PTGVHADSGA
VLGNAVRADL TVSTGPVKRG LLFYPARAYA GVLRSVDIGL SPADLESVMT DMIDIELARS
LLPPRPPDSH KGTFGKVLVV AGSINYPGAA TLATAGAARV GAGLVTLAVG RSQVYSPGRL
PEITLHILPE AEPGVVGDAA ADEVLSILEG YQALLVGPGL GREKATRAFL ERLLGLQSPR
HRGQIGFRIA AAGSEKPVVK QRPELPFTVI DADGLNILAD LIHHPEATDA ASGTIWNRLP
RGRCVLTPHP GEMRRLLGVE ELTGHPVDVA KEAAMHWQQV VVLKGATTVI ADPEGRVRVN
DGGNPALATA GTGDVLAGAI AGLLAQGLAP FDAATLGVYL HSAAGRLVRD ELGDMGALAG
DLLPRLPLAI RALKQP