Gene Rleg2_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1738 
Symbol 
ID6980475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1778807 
End bp1780279 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content67% 
IMG OID643396461 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_002281251 
Protein GI209549334 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.203791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACC AACTGTCCGA CCTTCTTCTC ACGCCTGCCG AAATGGCCGC CGTCGACGCG 
GCCGCTGCCG CATCCGGTAT CGATTCTTTT GGCCTGATGG AAAGGGCAGG TGCGGCGGCT
GCGGCTGCGG CCCTGCGCCT TCATGCCGGA GCCCTGCGCT TCGTCGTGCT CTGCGGGCCG
GGCAACAATG GCGGCGACGC CTATGTCGCC GCACGACATC TGCAGGAGGG CGGGGCTCCA
GTGGCGCTCT TCCATCTTGG CGATCCCTCC AGGCTGAAGG GCGATGCAGC CCGTGCGAAA
GCCGGATGCG CGCTGCGGGG GGAACCGCTC CACCTCTATA GTCCCGAAAT CGGCGACGTC
GTCATTGACG GCCTGTTCGG CGCGGGGCTC GGCCGCGATG TGCCGGCCGA TGTCCGCGCG
GTGATCGATC GGGTCGCCGA GGCCGGTCTT CCCGTGCTTG CCATCGATCT GCCCTCCGGC
CTGGACGGCC GTACCGGCAG AGTGCTGGGA GCTGCCTTCC GCGCCAGCAA CACCATTACC
TTCATGACCC GCAAACCCGG CCATCTGCTG ATGCCGGGCA GGGAGCTTTG CGGTGAGTTG
GAGGTCTTCG ATATCGGCAT TCCCGCCCGC ATCATCAGGG CCGAGGCGAG TGGCGTCATC
GCCGAAAACA GGCCGGACGC CTGGAAGGGT GTGCTGCCGG CCGAGCAGCT GGAAACCCAC
AAATACAAGC GCGGTCATCT GGTCGTCTTC TCAGGCGAGG CTGATAAGAC GGGTGCGGCG
CGCATGTCGG CGATCTCGGG CCTGAAGGCG GGGGCCGGCC TAGTGACGAT CGCGGCCCCT
GATGCGGCGA TAGCCGCCAA TGCTGCGCAT CTCACCGCCG TCATGCTGCA TGCGATCGAT
GATGCGGCCG ACCTCGAAGA CTGGCTCACC GACAAGCGGC TGCAGACCTT CGTTCTCGGC
CCCGGTTTCG GCATCGGCGC CAGGGCGCGC GCCTTCGTCT CGGCGCTCGC CGAACGCCGG
CTGGTGCTCG ATGCCGACGG CATCTCCTCG TTCAAGGACG ATCCGCAGCA GCTTTTCGAT
CTTTTCGGCG GTGAGCCGCG CCTAGTGCTG ACGCCGCACG AGGGCGAATT TTCGCGGCTC
TTTCCCGATA TCGGCGGCGA CGAAGCGCTG GGGAAGGTGG ACAAGGCCCT GGCCGCCGCC
CGCCGCGCCA ACGCCGCGAT CGTCTATAAA GGCGCCGATA CCCTCATCGC CGCGCCGGAC
GGCCGTGCGC TGATCAATAC TAACGCTCCT GCCTGGCTTG CCACCGCCGG TTCCGGCGAC
GTGCTCGCCG GCATCATCGG CGGATTGCTC GCCCAGGGCC TGCCGGCCTT CGAGGCTGCG
GCCGCCGGCG TCTGGCTGCA TGGAGAGGCC GCCCACCGTG CCGGCAAGGG GCTGACGGCG
GAAGACCTCG CGGCTCATGT CTTGCCACTT TAA
 
Protein sequence
MSHQLSDLLL TPAEMAAVDA AAAASGIDSF GLMERAGAAA AAAALRLHAG ALRFVVLCGP 
GNNGGDAYVA ARHLQEGGAP VALFHLGDPS RLKGDAARAK AGCALRGEPL HLYSPEIGDV
VIDGLFGAGL GRDVPADVRA VIDRVAEAGL PVLAIDLPSG LDGRTGRVLG AAFRASNTIT
FMTRKPGHLL MPGRELCGEL EVFDIGIPAR IIRAEASGVI AENRPDAWKG VLPAEQLETH
KYKRGHLVVF SGEADKTGAA RMSAISGLKA GAGLVTIAAP DAAIAANAAH LTAVMLHAID
DAADLEDWLT DKRLQTFVLG PGFGIGARAR AFVSALAERR LVLDADGISS FKDDPQQLFD
LFGGEPRLVL TPHEGEFSRL FPDIGGDEAL GKVDKALAAA RRANAAIVYK GADTLIAAPD
GRALINTNAP AWLATAGSGD VLAGIIGGLL AQGLPAFEAA AAGVWLHGEA AHRAGKGLTA
EDLAAHVLPL