Gene Rleg_5083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5083 
Symbol 
ID8007676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp470319 
End bp471470 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content60% 
IMG OID644821998 
ProductROK family protein 
Protein accessionYP_002973258 
Protein GI241113423 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.122916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.209889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTATG ATCCGCTAGA CAATGTTGCC GCGCATTTTT TGCGAGACAC ACAACCGATG 
CGCGTGGCTT CCCGCAACGA GCGCGACCTG TTGCGGCTGA TCTGGAAGTC TCCGGGGATC
GAACGCTCCG ACCTGACAGA GCCGCTCGAT CTCACCCAGC AATCGTTGCA CAGGATCGTC
GCGCGGCTTC ATGAGCGAGG AATGCTGGTC TTTTCCCATT CGGAAACCCG GCGGCCGGGA
CCGCCCAGCC CGGAGCTGAC CCTTCGCAAA GACTGGTGTT TAACCCTGGG CATCTCCGTG
AATGTCGGTT CGATCGGCCT TTGCCTCATG GGTTTCGGCG AACCAATGGA AAACATGGAA
ATTCCACAGG CGGGCTCCTC GCTATCCGAT GAAATGGAGC GGATCGAAGC GGCGGTCGAA
GACATTCTTG CCCGCCGGGG CGCCAAACGG CGTGATGTCC TCGGCGTCGG TCTGGCCGTC
GCCGGCCACC GCATGCTGGA GACGGCCTTC AATTGCCCCT TGCCGCTCGC TCATTGGTCG
TTGATCGACC TGGCACCTTT GCTTGGAAAG CAACTTGGCC TGCCGGTCTG GGCAGACAAT
GTCGCCAGGA CCGCAGCCTT GGCGGAAGCC ATTTTCGGCG TTGGCCGCGA TGTCGCAGAT
TTCGCCTACA TCGCCCATCT TCATGGCTAT GGCGGCGGCC TGGTTTCCGG AGGCATGCCG
TTTCGCGGCA ATTTCGGCAA CGCCGGCGAA TTTTCCGTTC TCTTCGGGCG TCAGGACTAC
GAGGAACGCC CTGCACTCGG CGTCCTGCTT GAACATCTTC GTGCCAAAGG GCGCATGAAC
TTGACCCTGA GGGACCTGAA GAACGAGGAC CTGATGGACT GGGATGGCGT TGGCGAGTGG
GTCGACCGCG TCACGCCGGC CCACAATCGA GCCATCAATG CGATCTGCGC AATATTCGAC
CCGGCATTGA TCGTCCTCGG CGGCGAGCTT CCGCACTCAC TCGCGAGAAT GCTGATCGAA
CGGACCGAAT TCAACAATCT GCCTCGGCAT GGCGTTTTAC GCGACGTCCC CAGGCTTGAT
GTGGCGCAGA TAATCGATGC CCCCGGCGCA ATCGGCGCGG CCTTGATCCC GCTCTTCGAA
ACCGTTTTGT GA
 
Protein sequence
MKYDPLDNVA AHFLRDTQPM RVASRNERDL LRLIWKSPGI ERSDLTEPLD LTQQSLHRIV 
ARLHERGMLV FSHSETRRPG PPSPELTLRK DWCLTLGISV NVGSIGLCLM GFGEPMENME
IPQAGSSLSD EMERIEAAVE DILARRGAKR RDVLGVGLAV AGHRMLETAF NCPLPLAHWS
LIDLAPLLGK QLGLPVWADN VARTAALAEA IFGVGRDVAD FAYIAHLHGY GGGLVSGGMP
FRGNFGNAGE FSVLFGRQDY EERPALGVLL EHLRAKGRMN LTLRDLKNED LMDWDGVGEW
VDRVTPAHNR AINAICAIFD PALIVLGGEL PHSLARMLIE RTEFNNLPRH GVLRDVPRLD
VAQIIDAPGA IGAALIPLFE TVL