Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5686 |
Symbol | |
ID | 5319988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 652389 |
End bp | 655631 |
Gene Length | 3243 bp |
Protein Length | 1080 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640777413 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001314345 |
Protein GI | 150377750 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0438] Glycosyltransferase [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAC GTTTGCGCAT AGTGCTTGCG ACCGACAGCG TGGATCCATC CGGAATGGGT GAGCACATGC TGACGCTCGG TCGGGCGCTC CAGGGCCAGT TCGATGTGAC GCTTGCCGCC ATCGATGGGA TCGAAGCCGC TCTGCTGACA AGGGCTGCGC GTTGCGGCGT CGCGGTCAAA GGCATTGATG ACAATGCATC GTTCGAACAT TGGCTGGAGT TTTCGGGCGT TTCTTTGCTC CACGTTCACG CTGGCATCGG TTGGGAGGGG CATGAGATAG CGCGCGCCGG GGTCGCCTGC GGCATCCCGG TTATCCGTAC GGAGCATCTC CCCTATCTGC TCACCGATGC CGAGCAGAAG GAGCGCTATG CCAGAGAAAG TGATGCTCTT ACGCATCATA TTGTCGTTTC GGAAGCATCG AAGTTAAGCT TCGAGAGCAA GGGGGTCAAG GGCAACCGAA TGACGGTCGT CCGCAATGGA ATCTTTCCGC TAGGCGCACG GGACATTGCC GGTCGCAGCA AAGTCGCACT TGGGCTCGAC GGGAAAAGCG TCCTCATTAC GGTGGCGCGC TTTTCTAAGC AGAAGGACCA CGCCACCCTG ATTAAGGCAA TGCCGGCAGT ATTAGCCGCG GACCCGTCGG TCGTTTTGCT TCTGGTAGGC AAAGGCGAGG AATTGGAGGC CGTTCGGGCC CTGGTCGAAG ACCTGTCGCT TGGCCCGCAT GTCCAATTTC TGGGGCATCG CATCGAGGTC GACCAGTTGA TGGGCAACGC CGACCTTTTC GTCCTGCCGT CGCGATTCGA AGGGCTTCCC TTAGCAGTGC TCGAAGCAAT GTCGATCGGA CTACCGGTCG TCGCAACCCG GATCGGCGGC ACCGTCGAGG CGCTTGGATC GGAACATCCG TTCCTGGCTG AGTGCGAAGA TCCGTCCTCA TTGGCTCGCG TGCTGATCGA AGCTTTGAGC GACCCGGAAC GGGCGAAAAC GATCGGCCGG GCCGGCCGGG CCAGGTTCGA CACTGAATTT TCAGCGCGGC GAATGGCGGA CGAAACCGCG GCTGTCTATC GGCGATTTCT TTCCGAGCGG ACGGAGAATA AACAAGGACA CCATTTTATG GACAAGACAC GTGTCGGCTT CATTGGCGTC GGGGGCATCG CCCACCGGCA TCTTGATATT TTCGCCGGTT TCGAGGACGT GGCGCTCGTT GCTTTTGCAG ACCCGGACTT CGCACGCGCC GAGCATGCCG CGTCGCGGTT CGGTGCGAAG GCGTTCGAGA GCCACAGTGC GATGCTGGAG AAAGAGGCGC TGGATGCGGT CTATATCTGC ATTCCCCCCT TTGCCCATGG CGACGCCGAA CGGGATCTGA TCGCGCGTGG CATCCCGTTC TTTGTCGAGA AGCCCATTAC GCTCGATATC GAGCTGGCGG AGGAACTCTG TGCCGCGATT GAAGCCGCAA AGCTGATCAC GGCTGTCGGT TATCATTGGC GCAATCTCGA TACAGTAGAG GAGGCGCGCC GGCTTCTGGC CGAAAATCCA GCTCAACTTC TTTCGGGCTA TTGGCTGGAC CAGACGCCAC CGCCGCAATG GTGGTGGAAG AACGACCGCT CAGGCGGCCA GATGGTTGAG CAGGCGACCC ACATCATTGA TCTGGCGCGA TACCTCATTG GTGAAGTCAC CGACGTTTAT GGCCGCGTCG GCTTCAAAGA CCGCCCCGAA TTTCCAGGCC TCGATGTGCC GGCGGTGACC ACTGCGAGCC TCACCTTCCA ATCGGGCGTT ATCGGCAACA TCTCCTCGAC ATGTCTTCTC GGCTGGAGCC ACAGGGTCGG ACTGAACATC TTCGCCGACC GACTTGCGAT CGAGCTGACC GACCATGACA TCATGATCGA TGTGGGTGCT GGTCGACCAG TGCGGCATGC CCAGGGCGAT CCCGTCTGGC GAGAAGATCG CGATTTTGTC GACGCCGTGC GCGGGGGAAA TAATAACATT CGCTGTGCCT ACGCGGATGC GCTGGCGACG CACCGGCTCG CGCTGGCGGT CGCTTCCTCG GCGCGCAGCG GCGAGCCGAT AAAGCTCGAT CCGCCTGCCA TAGTCCGCAA CGCTGTGACG ACGCTGCAGT ATCCACCGTC GTCCGAAGCC TGTCAGGGAT TATCCCCAGG CCACCGTGCC ATTCGCTCTC TTGGCATTGA AGGGCCCGGA AAAGCATTCT TCTTCGACTA TCAAGAGGGT CCGCCCGCTG ACGGCCATGT CCGGTTGGAA ACGCTCTTTA CCGGCTTTTC CGCCGGCACC GAACTTACCT TCATGAAGAA CACTAATCCT TACTTCCACT CCCGTTTCGA TAGCGAGCGT GGCGTGTTCT TCGAGAACGA GCCGGATCTT CACTACCCGG TGCCTTTTCT GGGATACATG GAAGTGGCGC GCGTCTCGGA GTCGAAGGCT GCAGGCTTCA AGGAAGGGGA TGTCGTTGCC GCGACCTACG CGCACAAAAG CGGCCATACT GCCGATCCAT TCCTCGACTT GCTGGTGCCC TTGCCAGCCG AAATCGATCC CGTTCTTGGC GTCTTCGTAG CGCAAATGGG CCCTATCGCG GCCAACGGCA TTCTACATGC GGATGCGGAC GCTCTGGGGA GCAATGTATC CTGTCTCGGA GTGGGCGTTG CGGGGCGTCA CGTGGTCGTG CTCGGAGGAG GCACGGTTGG ATTGATGACG GCGCTTTTTG CGCAGAAGGC GGGGGCCTTG GAAATCGTCG TTGCCGATCC TTCGGCGTTT CGGCGGAACA AGGCGCGGGG CATGGGTTTG ATTGCGATGA CCGAGGATGA GGTCTGGCAG CACGCGAAAA CGCGATGGCA TAACGGCGGA AGTGATCGCG GCGCCGATGT CGTGTTTCAG ACACGGGCGC ATCCGTGGAG CCTCCACGTC GCGCTGAAGG CACTGCGCCC GCAGGGCACG GTCATTGATC TCGCATTCTA TCAGGGCGGC GCCGAGCGGC TGCGGCTAGG CGAAGAGTTT CATCACAATG GCCTCAACAT CCGCTGCGCA CAGATTAACC GCGTACCCAG AGGACTGGCG CCGCTGTGGA ACAGGCGCCG GCTTGCCGAG GAAACGGTCC AGCTCTTAAA AATCTACGGA GCTCTCATTC GCGAGCATAT GATCACGCAT GTCGTTCCAT TCGACGATGG GCCAAAATTC CTCGCTGATC TGGTGGAGAA CCGGCCGGAA TTCCTGCAGA TCGTCTTCAA GGTCAGCGGA TGA
|
Protein sequence | MNKRLRIVLA TDSVDPSGMG EHMLTLGRAL QGQFDVTLAA IDGIEAALLT RAARCGVAVK GIDDNASFEH WLEFSGVSLL HVHAGIGWEG HEIARAGVAC GIPVIRTEHL PYLLTDAEQK ERYARESDAL THHIVVSEAS KLSFESKGVK GNRMTVVRNG IFPLGARDIA GRSKVALGLD GKSVLITVAR FSKQKDHATL IKAMPAVLAA DPSVVLLLVG KGEELEAVRA LVEDLSLGPH VQFLGHRIEV DQLMGNADLF VLPSRFEGLP LAVLEAMSIG LPVVATRIGG TVEALGSEHP FLAECEDPSS LARVLIEALS DPERAKTIGR AGRARFDTEF SARRMADETA AVYRRFLSER TENKQGHHFM DKTRVGFIGV GGIAHRHLDI FAGFEDVALV AFADPDFARA EHAASRFGAK AFESHSAMLE KEALDAVYIC IPPFAHGDAE RDLIARGIPF FVEKPITLDI ELAEELCAAI EAAKLITAVG YHWRNLDTVE EARRLLAENP AQLLSGYWLD QTPPPQWWWK NDRSGGQMVE QATHIIDLAR YLIGEVTDVY GRVGFKDRPE FPGLDVPAVT TASLTFQSGV IGNISSTCLL GWSHRVGLNI FADRLAIELT DHDIMIDVGA GRPVRHAQGD PVWREDRDFV DAVRGGNNNI RCAYADALAT HRLALAVASS ARSGEPIKLD PPAIVRNAVT TLQYPPSSEA CQGLSPGHRA IRSLGIEGPG KAFFFDYQEG PPADGHVRLE TLFTGFSAGT ELTFMKNTNP YFHSRFDSER GVFFENEPDL HYPVPFLGYM EVARVSESKA AGFKEGDVVA ATYAHKSGHT ADPFLDLLVP LPAEIDPVLG VFVAQMGPIA ANGILHADAD ALGSNVSCLG VGVAGRHVVV LGGGTVGLMT ALFAQKAGAL EIVVADPSAF RRNKARGMGL IAMTEDEVWQ HAKTRWHNGG SDRGADVVFQ TRAHPWSLHV ALKALRPQGT VIDLAFYQGG AERLRLGEEF HHNGLNIRCA QINRVPRGLA PLWNRRRLAE ETVQLLKIYG ALIREHMITH VVPFDDGPKF LADLVENRPE FLQIVFKVSG
|
| |