Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2973 |
Symbol | |
ID | 6981718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3033449 |
End bp | 3034429 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643397683 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002282466 |
Protein GI | 209550549 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.473581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGC AGCAAACATT CCCGCACTCA TCCGTCGGCA CCGATCAGGC AGGTGCACCG TTGAAGATCG CCGTCGGGGT ACTGACCTAT CGGCGTCTGG ACGGGATCGC CAAGCTGCTT GACGTCATGA CGCGACAGAT CAGGCACCCG GCCCGCCCCT TTCATCTCAC CATGGTGATC GTCGACAATG ATGCGGCCGG CAGCGCCAAG GCAACGGTGG AGGGCTTTGG CCAGACAGGC GCCTACGACC TGATCTACGT CGTCGAGCAA AACCAGGGCA TCCCTTTTGC GCGCAATCGC GCGCTGGATT CGGCACCGCC CGGCACCGAC CTCTTCTGCT TTCTCGACGA TGACGAATGG CCGGTCGACG GCTGGCTGGA CGCCATGCTG GAGACCCGCG AGAAAAACCG CGCCGATTGC GTCTACGGCC CCGTCCAGCC GGTCTATCCA GAAAATCCGC CGGAATATTT CATCAAGGCC AGAGTATTCG AGCGCAAGAA GAACATCGAC GGCCAGCGCA TCGGTTATGC GGCCTCGAAC AACGTCATGT TCGACTATCC GCTGATCCGT TCATGGAATC TTCGTTTTGA GGAAAAGATG CGCTTCACCG GCGGCACGGA CTATCTTTTC TTCAATCAGG CCATCCGCCG CGGCATGCAG ATTTTCTGGG CTGACAAGGC CCTGGTCTAC GACATCGTTC CCGCGAACCG GATGACCTGG AAATGGGTGT TGCAAAGACA GTACCGACTG GGCAATACCT TCGCCGTCAG CGAGGTTCTG CATGGCAACC TCAAGCGCAA GATCTATCGC GCCGCCTATG GCGCCACGAG GGTCGTGCTC GGACTGGTCA TGCTGCCTAC GATCTTGATT TCACCCTACT GGGGCATGCG CGCGCTCACC CATGTTCTGC GCGGCGCAGG CATGGTTAAC GGGATCCTCG GACATGCCTA TCAGGAATAC AAGCCCAATG CCGCCCACTG A
|
Protein sequence | MNQQQTFPHS SVGTDQAGAP LKIAVGVLTY RRLDGIAKLL DVMTRQIRHP ARPFHLTMVI VDNDAAGSAK ATVEGFGQTG AYDLIYVVEQ NQGIPFARNR ALDSAPPGTD LFCFLDDDEW PVDGWLDAML ETREKNRADC VYGPVQPVYP ENPPEYFIKA RVFERKKNID GQRIGYAASN NVMFDYPLIR SWNLRFEEKM RFTGGTDYLF FNQAIRRGMQ IFWADKALVY DIVPANRMTW KWVLQRQYRL GNTFAVSEVL HGNLKRKIYR AAYGATRVVL GLVMLPTILI SPYWGMRALT HVLRGAGMVN GILGHAYQEY KPNAAH
|
| |