Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5226 |
Symbol | |
ID | 8007121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 637733 |
End bp | 638698 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644822135 |
Product | ROK family protein |
Protein accession | YP_002973395 |
Protein GI | 241113560 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.515111 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAGG TTGCCATTGG AATCGACCTC GGTGGGACGC AGGTGCGCGC TGCCCTCGTC GACGAGCAGG GCAGGATTCT GGCGCGCGCA GCCGAACCGA CGGATGCGTT GGCGGGCCCC GACCGCGTGC TCGCCCAAAT CTGCGGCCTG ACCGACGGAC TGCTTGCAGC ATCGAACCCC GCTTCGGTGG TGGGCGTCGG CGTATCCGCT CCGGGCCCGC TCGATACGGT CGCCGGTGTC GCCTCGAATA TCCCGACCCT CTCCGGCTTT GTCGACTTTC CGCTGAAGGC GGAGTTGCAG AAGCGGTTTC CGTTTCCGGT CGACCTCGAG AATGATGCGA TTGCCGCTGC CATCGGTGAG TGGCAGTTCG GAGCCGGAAC GGGGCTCGAC AATTTGGTGT ATGTCACCGT GAGCACTGGC ATTGGCGGTG GCGTTGTGTC GGATGGTCGC GTCGTGCGCG GCCGCAAGGG CATGGCAGCC CATGTTGGGC ACATGTCGGT CGTGCCGAAC GGAGAGCTTT GCCCCTGCGG CAACAGGGGT TGTTTCGAGG CCTACGGATC CGGAACGGCA TTTGCGCGCC GCGCCCAAAT CAGGGCTGTG GAATCCAGCG CGACGACAAT AGGCAGCGAT GGCGGCGCCA TCGATAGCCG CAGCGTTTTT GCAGCAGCAA GAAATGGCGA TCGTCTCGCA AATCAACTGA TTGACGAGGA AGCGGAAATT CTCGGTCGCG GCTTCACCAG CCTGATCCAT ATCTTCAGTC CCGATATCAT CGTGATGGGA GGCGGTCTTT CCCACGAGTT CGACCGACTG CAACCCGGCA TTCAAGGCTA CATCACGCAA TGGGCAATGC CGGCATTCAA GGATGTCAGG GTAATGCTGG CGGCGTTGGA CCAGAACTCG GGCCTCGTCG GCGCAGCTGC TCTGGCGTTT CTGACCGGAA AGGTCCCGGC GGTCGATCAG ATTTAG
|
Protein sequence | MQQVAIGIDL GGTQVRAALV DEQGRILARA AEPTDALAGP DRVLAQICGL TDGLLAASNP ASVVGVGVSA PGPLDTVAGV ASNIPTLSGF VDFPLKAELQ KRFPFPVDLE NDAIAAAIGE WQFGAGTGLD NLVYVTVSTG IGGGVVSDGR VVRGRKGMAA HVGHMSVVPN GELCPCGNRG CFEAYGSGTA FARRAQIRAV ESSATTIGSD GGAIDSRSVF AAARNGDRLA NQLIDEEAEI LGRGFTSLIH IFSPDIIVMG GGLSHEFDRL QPGIQGYITQ WAMPAFKDVR VMLAALDQNS GLVGAAALAF LTGKVPAVDQ I
|
| |