Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5942 |
Symbol | |
ID | 8016362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | + |
Start bp | 486158 |
End bp | 487399 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644828055 |
Product | glycosyl transferase family 28 |
Protein accession | YP_002979255 |
Protein GI | 241518627 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0547483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGTAG CCATTCATGC GCTCGGCACG CGCGGAGACG TTCAACCCTA TGTCGCTCTG GCATTGGGAT TGATCGAGCG AGGACATCGA GTACAGCTCG CTGCTCCGGT TCAGTTCGAG AGCATGGTGC AAGACCACGG CATCGCATTT GCCCCCCTGC CTGGAGAGTT TCTCGCTCTT CTCGATACTC CGGAAGGAAA GGCGGCGATC GCCGGCAGCA AGGGCTTCAG TGCGGGTTTG AAGCTGCTAA AGTACGTCCG TCCGATGATG CGAACCCTGC TAGACGCGGA ATGGAGAGCA GCGCAGGCCT TCAACCCCGA CATCTTCGTG CATCATCCGA AGGCAATCGC GGTGCCACAC ATGGCGGAGG CGCTTCAGTG CCCATTTATT CTGGCCTCGC CCCTGCCTGG CTTTACGCCG ACCGCCACTT TTCCCAGCCC GATGTTGCCT TTCAGAGATC TGGGCTGGTT CAACCGGATC AGCCATATCG CGGCGATCAG GGGCGCGGAA CTTCTGTTCG GCACGTTGCT CTCGACCTGG AGGGTGGAAC AGCTTGGTCT GGCGCGACGC AGGACGCCAG CTATCGCTTC GAATGGCACG CTCTACGCCT ATAGTCGCCA TGTCGTGCCG GTCCCTCCGG ACTGGGGCAG TGACGTGCTG GTAAGTGGCT ACTGGTTTCT CGACAGCAAG AACTGGCGAC CTCCAGACGA TTTGGCAGCA TTCCTCGCGG ATGGGAAGCC GCCAATCTAC GTTGGCTTCG GAAGCATGCC GGGCGTCGAT CCGGGCCGAA TGACAGCCAC TGTTGTCGAG GCCCTCGCAA GGCAGGGCAA GCGGGGTATC TTGGCTTTGG GAGGCGGTGC TCTGGCCGCG GACCATAAAT CCGGTCATGT CCACGTCGTC CGCGACGCCC CCCACGACTG GTTGTTTCCC GAGGTGAGCG CGGTCATCCA CCACGGCGGC GCCGGAACGA CCGCGGCCGC TCTTCGGGCC GGCAAGCCTA TGATCATTTG CCCATTTTTC GGCGATCAAC CGTTCTGGGC AAGGCGTGTA ACAGACCTCG GCGTCGGACT GTCACTCGAT CGCAGAGCAT TGACCGTCGA GAGCCTGACA GATGCACTCG CAGCCATGGA CGATCCACAT ATGCGACGCC AGGCAGATGC CCTTGGCTCT AGGATTCGGG ACGAAGATGG GGTTGCGAAC GCAGTCGGTT TCATCGAGGC TGCTGCGGAC AAACTGCATT GA
|
Protein sequence | MRVAIHALGT RGDVQPYVAL ALGLIERGHR VQLAAPVQFE SMVQDHGIAF APLPGEFLAL LDTPEGKAAI AGSKGFSAGL KLLKYVRPMM RTLLDAEWRA AQAFNPDIFV HHPKAIAVPH MAEALQCPFI LASPLPGFTP TATFPSPMLP FRDLGWFNRI SHIAAIRGAE LLFGTLLSTW RVEQLGLARR RTPAIASNGT LYAYSRHVVP VPPDWGSDVL VSGYWFLDSK NWRPPDDLAA FLADGKPPIY VGFGSMPGVD PGRMTATVVE ALARQGKRGI LALGGGALAA DHKSGHVHVV RDAPHDWLFP EVSAVIHHGG AGTTAAALRA GKPMIICPFF GDQPFWARRV TDLGVGLSLD RRALTVESLT DALAAMDDPH MRRQADALGS RIRDEDGVAN AVGFIEAAAD KLH
|
| |