Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4598 |
Symbol | |
ID | 5318508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1098063 |
End bp | 1099217 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640776399 |
Product | glycosyltransferase family 28 protein |
Protein accession | YP_001313331 |
Protein GI | 150376735 |
COG category | [R] General function prediction only |
COG ID | [COG4671] Predicted glycosyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.312674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.707093 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCAGCC CCCGCGTCCT TTTCTATGTC CAGCACCTTC TCGGTATAGG GCATCTGGCG CGCGCCAGCC GTATAGCCGA AGCCCTCATC AGGCGGGATT TCGACGTCAC GATGGTGACG GGCGGTACGC CGGTACCGGG CTTCCCGGGG GATGGCGTGA AGACCATCGC TCTGCCGCCG GTGACGGCAG GCGACAAGGG CTTTTCGGGG CTCGTCGACG GCGAGGGCCG CGCGGTCACG GCGGCTTTCC AGGAACATCG CCGGGATCTG CTGCTCGAGA TCTACCGCCG TGTGGTGCCT GAAGTCGTCA TCATAGAGGC CTTCCCCTTC GGGCGCCGGC AGATGCGGTT CGAGCTGCTG CCGCTGCTCG CCGAGATTGC CGCGAGTGAC CGGCCGCCGC TCGTCGCGAC GTCGCTGCGC GACATTCTGC AGGAGAGGCT CAAACCCGGG AGAGCCGAGG AAACCGTCGA AATCGTCAAG AACCATTTCG ACCTCGTACT CGTGCATGGC GATCCCGGAT TCGCGCGCAT CGAGGAGACG TTTCCCCTTG CGGGCGAAAT CAGCGGCAAG GTGGTATACA CCGGCCTCGT AGCGCCGCCG CGGCCGACTG GAGTGCCCGA GAAATTTGAC GTCGTTGTTT CGGCGGGTGG AGGAGCCGTC GGCAGCGCGC TGATTGGCGC GGCACTTGCG GCTGCCAAGC TTTTGCCGAA CGCACTTCGC TGGTGCCTTG TGACCGGCCC GAACCTGCCG CAGGCGGATT TCGACGCATT CGCGGCTGCG GCACCGCCGG GCGTAAGCCT CTTTCGGTTC CGGCGGGATT TCGGGGGCCT GCTTGCCGGT GCCCGCCTTT CCATCTCTCA AGCGGGCTAC AACACAGTGT GCGACATTTT GCGTGCCGGG TGCGCTTGCC TGCTCGTTCC CTTTACCGCG GGCGGCGAAA CCGAACAGCG TATGCGGGCC GCACGGCTTG AAGAGCTGGA CCTTGCCGGC GTTCTGCCGG AGGAGGGGAT CACGCCCGAG CTGCTGGCCG CGAAGGTCGG CGCGATGCTT GCCCGCCCGA AACCGGCCAT CCCGCCGCTG GACCTCGACG GAGCCGCCGG AACGGCGAGG ACCATCGAAG AGCGGCTTCC GGGCAGGCGG TCTTCAAAGG TTTAG
|
Protein sequence | MSSPRVLFYV QHLLGIGHLA RASRIAEALI RRDFDVTMVT GGTPVPGFPG DGVKTIALPP VTAGDKGFSG LVDGEGRAVT AAFQEHRRDL LLEIYRRVVP EVVIIEAFPF GRRQMRFELL PLLAEIAASD RPPLVATSLR DILQERLKPG RAEETVEIVK NHFDLVLVHG DPGFARIEET FPLAGEISGK VVYTGLVAPP RPTGVPEKFD VVVSAGGGAV GSALIGAALA AAKLLPNALR WCLVTGPNLP QADFDAFAAA APPGVSLFRF RRDFGGLLAG ARLSISQAGY NTVCDILRAG CACLLVPFTA GGETEQRMRA ARLEELDLAG VLPEEGITPE LLAAKVGAML ARPKPAIPPL DLDGAAGTAR TIEERLPGRR SSKV
|
| |