Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4710 |
Symbol | |
ID | 5318860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1227899 |
End bp | 1230148 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776508 |
Product | glycosyl transferase family protein |
Protein accession | YP_001313440 |
Protein GI | 150376844 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.436323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.224848 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGTTCG ACGACCGCAT GGGGGCCGCC AGCGGCATTT TCGAGGACGC GAAGGTTTTT CGGCTCGGTT CCGAAGTGGC CGTATTGATC TCGGACCTCA AGAAAAAACT GCCGGCCGCA GCAAAACACA GGCTCGTCAT GCCGGAGCAA CCCGTGCCGC TGGTCAGCAC CATCCTTCCT CTGGCGGAAG GCGGTCAGCG GGTCCTTTGG GCCATGCGCC CAGGGGGCGA ACCGCGCCGC GGACAGATTT CCGTCGATAG CGACTTCGTC CGAACGGTTG TCCTGCAGCC CGTCGGCGAA CTTCCGCCCC TCGACGTGGA AGCCCTGTTT GCCGCGCTAA CACCGGAGGG CTGCGTCAAG TTTGTGAACA CTCTGCTTAC TGTTTGGCGG AGCGCCTTCC GCCTGTCCCG GGATGCATTC TTCATAGGCC TCGTCGAGGA CGCCCTTCAT GCGCTGACAT CGAGGCCGCG GCCTGCAAAG ATCGCTTGCC CGATCATGCC GGGCCGGTAT CTGATCGAGA CCGCGATAAC TCCCGACTTC GGTGAGATCA GCGCGGTCTA TTCGCTCGGT GCCAATGCCG TTTTGCCGCT GGAGTCGCAA GCTGTGACCG GCCCAGATCG CGAGCACGAC CTGCGCCCCT GCCACTTCAT TGTCGAGTCT CCTCGACACC CTCAATCCTA TGTTCTCGTC GGAAAACGAG GCGTGGCTGT TCGCGAACTC TCCTCCGGAA AACCACATTA CACCAGTCTC CAGGCATGGT GGGCCGAACG GGGTGGCGCG CCGGAGCTGC GCGAATTCGT CGTCCGATGG CTTTCGACGA CACCGGAGGG CGGACATTCG ACCGCCGTCG ACCTCCAGCT TCGGACACCG CTTCCGGCAA GGCGGATCGA CAGATCATCG ATGCATCCTT CCGCAGAAGT GGATCTCGCA TTGAGCCTTT CGGGCGGTCT GCTCGCCGGG GGCTGGTCCC ATGATCCAAC AGCCACGCTC GCCGGCATCG ACTACCTGAA TGAGAACGGC ACGGCGGTGC CCCTCGACGG CAATTGGTAC GAGTTTCCTG CCTGGGCACG CGGAACGGAC GACAATACGC GCGCCGACGT AACCGGCTTC GTCGCCTGGC TGCCGTTAAA CGAGGCGCCG GGCGCCTTGC TCCAGCCGCG CTTCCAGATG CGGCTCGCTT CCGGCGTCGT AAAACCGCTC GTGCCGAAGC CACAACCCTT CGAAGCTTCG GCGCAGCGCA ACCGCATCCT GCGCGCGGTG CCGCCCCAGC ACGCAGTCGA CCTGGCCTTC CGGACGATCC TTGCCCCCGC GCTGCAGGAC GTAGAACATC GATTGGGCAG AACTGCCACG GTCGACTATA CCAAGGATTA CGACCTGCCG CAGACGGCGC CCCTGGTTTC GATCGTCGTG CCGCTTTACC GTGTTCTCGA TTTTCTGCGG TTCCAGCTCT CCGGCATGGC GACCGACCGC TGGCTCGCAA AGAATGTCGA GATCATCTAC GTTCTGGATT CACCCGAGAT CCAGGACGAG ACGGAGCATC TTCTTGGCGG GCTGCATCTC CTCCACGGGC TGCCGATGAA GCTCGTGGTG ATGAGCCGCA ACGGGGGCTA TGCCCGGGCC TGCAATGCCG GCGCGCGCTT CGCCCGCGGC GCGGTCATCG TCATGCTGAA CTCCGACGTC GTTCCGTCAG CAGCCGGCTG GCTGCAGAGG CTCATCCGGC CGCTCATGGA ACAGAAGAGC CTCGGCGCCA TCGGCCCGAA GCTGATCTTC GAGGACGGAT CGCTCCAGCA CGCGGGACTC TATTTCGCTC GCGATCAGCG CGGCATATGG CTCAACCACC ATTTCCACAA GGGCATGCCA GGCGACTATG CGCCCGCTCA ATTTTCCAGG AGCGTCCCGG GGATCACAGG TGCGTGCCTC GTCACGCGTC GGGAGACCTA CGAGCGCGTG GATGGATATA CGGAGGATTA CGTAATCGGC GACTACGAGG ATAGCGACTT GTGCCTGAAG ATCCGCCGGT GCGGCTATGA CATCGTCTAC GAGCCGTCTG CGTGTCTTTA TCATTTCGAA CGCCGCTCGA TCCGCCGCAG CGAGGACTAC ATGCGCGGCG TCGCCAGCCA GTATAACTCC TGGCTGCACA CGGAGCGGTG GGATGACGAT ATCGAGGCGC TGATGGCAGC ATATCTCGGC AGCGGCATCC AAGGGCTCGT CCGCCAAGAG GCCGGTGCCA CAGCGAGGAG CGCCGCATGA
|
Protein sequence | MTFDDRMGAA SGIFEDAKVF RLGSEVAVLI SDLKKKLPAA AKHRLVMPEQ PVPLVSTILP LAEGGQRVLW AMRPGGEPRR GQISVDSDFV RTVVLQPVGE LPPLDVEALF AALTPEGCVK FVNTLLTVWR SAFRLSRDAF FIGLVEDALH ALTSRPRPAK IACPIMPGRY LIETAITPDF GEISAVYSLG ANAVLPLESQ AVTGPDREHD LRPCHFIVES PRHPQSYVLV GKRGVAVREL SSGKPHYTSL QAWWAERGGA PELREFVVRW LSTTPEGGHS TAVDLQLRTP LPARRIDRSS MHPSAEVDLA LSLSGGLLAG GWSHDPTATL AGIDYLNENG TAVPLDGNWY EFPAWARGTD DNTRADVTGF VAWLPLNEAP GALLQPRFQM RLASGVVKPL VPKPQPFEAS AQRNRILRAV PPQHAVDLAF RTILAPALQD VEHRLGRTAT VDYTKDYDLP QTAPLVSIVV PLYRVLDFLR FQLSGMATDR WLAKNVEIIY VLDSPEIQDE TEHLLGGLHL LHGLPMKLVV MSRNGGYARA CNAGARFARG AVIVMLNSDV VPSAAGWLQR LIRPLMEQKS LGAIGPKLIF EDGSLQHAGL YFARDQRGIW LNHHFHKGMP GDYAPAQFSR SVPGITGACL VTRRETYERV DGYTEDYVIG DYEDSDLCLK IRRCGYDIVY EPSACLYHFE RRSIRRSEDY MRGVASQYNS WLHTERWDDD IEALMAAYLG SGIQGLVRQE AGATARSAA
|
| |