Gene Rleg_4939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4939 
Symbol 
ID8007389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp316886 
End bp318145 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID644821856 
ProductROK family protein 
Protein accessionYP_002973116 
Protein GI241113281 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTTTG AGAGCCAGCC GTCCTATTCG TTGGGCCAGC GCACCCCTGC CACCAGAGAA 
GTCGCCGCCG GCGAGAACAG CGGCAACGTT ACGCTTGTCT CCCAATCGGC ACTCGGCGCC
ATAAATCGCG GTCGCGTACT TCAAGCTTTG TATGATAACG GACCCAAAAG CCGGGCGGAT
CTCGCTCGAC TTGCGGGCGT CAACCGGACC ACGATCACCG GTATTGTGCA GCCGATGATC
GAAGACCAGC TTCTTATCGA GGGAGATGCG TCGCCTTCCG ACGTCAAAGG CGGCAAGCCG
GCTCGTCCCC TTTATTTTAA CCCCGATGCA CCGATGCTTG GCGCAGTCCT CCTTCTGCCC
GGCACGATAC AATCATGCCT CGTGGCTCTA ACTGGCGAGA TCAAGGCCGT TACGAAAGCT
GAGTTTGATC CGCATGGCGA CACAGAAGCA TTCATCGCTG TCATGACGAA GACGCTTACT
GCCACACTGT CTCAGGCCCA GCGGGCACCG TTTGGCATTG GCGTGGCTTC TGCTGGAATG
ATCGACAGTG ACAAAGGAAC AATTCTTACC GTCAACCTTG CTCCCGTTCT AACGGGACTA
CCTCTTGTAG CGATACTACA AGAACGCTTC TCTCTTCCCG TTGTTATCGA TCATCACCCT
CGTGCCTTGC TTGTTGGGGA CAGATGGTTC GGGCCCGGCC GCGGCCAACA AAATTTTGCC
GCGGTCTATA CCGGCGAGGT GCTTGGCGGC GCCTTCTTCA TCGACGGCAA GGTTTATCGT
GGACTCGCCG GATCCGGCGG TGAGCTCGGG CACAGCGTGG TTCAGATCGA CGGTGCCCTT
TGCAATTGTG GAAAGCACGG TTGCTGGGAG ACGGTAGCTG CCCTTCCGTG GCTACGAAAA
GAAGCCGTCC GAATGGGCTT ACCACATCCC CGAAGCGTCA CCTGTGCCAG ACTTGTCAAG
GAAACAGACG AAGGCTCGAA TGCGGCAGAG GAACTTCTCG ACCGTTATAC ACGCAACGTG
GCGTTCGGCA TCGTCAACCT GCAGCAAACA CTGTCCCTCA ACTCCTACGT CCTTCACGGA
GACGTCGCCG GAGGCGGAAT GAAGGCTGCG GAGCTGATCA GACGGCATGT CAAGCAGCTA
GTGGTAAAGA GACCTGGTCA GGAGATATCA ATCACAGTGA ATGGTATCGG CGAAGGCCAT
ACGGCTCTAC GTGGCGCCGC GGGTCTGGTT TTATCCAGCC ACCTAAAGCT AGTCATTTGA
 
Protein sequence
MKFESQPSYS LGQRTPATRE VAAGENSGNV TLVSQSALGA INRGRVLQAL YDNGPKSRAD 
LARLAGVNRT TITGIVQPMI EDQLLIEGDA SPSDVKGGKP ARPLYFNPDA PMLGAVLLLP
GTIQSCLVAL TGEIKAVTKA EFDPHGDTEA FIAVMTKTLT ATLSQAQRAP FGIGVASAGM
IDSDKGTILT VNLAPVLTGL PLVAILQERF SLPVVIDHHP RALLVGDRWF GPGRGQQNFA
AVYTGEVLGG AFFIDGKVYR GLAGSGGELG HSVVQIDGAL CNCGKHGCWE TVAALPWLRK
EAVRMGLPHP RSVTCARLVK ETDEGSNAAE ELLDRYTRNV AFGIVNLQQT LSLNSYVLHG
DVAGGGMKAA ELIRRHVKQL VVKRPGQEIS ITVNGIGEGH TALRGAAGLV LSSHLKLVI