Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2993 |
Symbol | |
ID | 5323870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3139813 |
End bp | 3143724 |
Gene Length | 3912 bp |
Protein Length | 1303 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640791944 |
Product | glycosyltransferase 36 |
Protein accession | YP_001328657 |
Protein GI | 150398190 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAAT CAAATCAAGG CCGCAGACCG CCCGAGGTGA CGAGATACGA CAGCACGCCT GAGCTGGCGG ATACAGGCCC TGCCATCGCC CTGCTTTCCA ACGGCCGTTA CAGCGTCATG GTTACTGATG CGGGCGCGGG CTGTGCCATC TGGCGGAGCC TTGACGTCAC GCGGTGGCGC GAAGATGTCA CGCGCGATTG CTGGGGGCAG TTTTGCTACG TCCGAGAACA GGCGGAAAAA ACGGTGTGGT CTATCGGCAA GCAACCCGTT TGCAGGCCCG CAAACAGTTA CGCGCACTCT TTTCACGGAG ACCGGGCCGA GTTCTCCTGT CGTTCCGAAG ACATTGAAAT CTCCTGGAAA GTCTGCGTCG CGCAGGATGC CGACGCGGAA GTGCGTGAGC TTACCGTGGT CAACCACGGC GGACGAGAGC GGACTCTCGA TCTTACAAGC TATTCCGAGG TCTGCCTGAA CGACCGCCGC GCGGACAGGG CGCATCCGGC CTTTGCCAAG CTCTTCGTGG AGACGCATCA CGATGACGCG ACGGGCGCTT TGTTTGCGCG ACGCCGGCCG CGCGATGCGG CAGAGCCACC CGTCTGGGCC GTCCATGTTT CATCATCAGA TCAACAGATC CGGGCAGAAG TCGCCCATGA GACCGATCGG CTCAAATTCC TCGGACGGGG CCGCACGACG GCCGACCCCG CGATGCTGGA CCCAGGTGCC ATGCTTTCCA AGTCGACCGG GCCGGTGCTC GATGCGATCT TCTGCCTGCA AAGGATTGTC CATCTGAAAC CCAAAGGCAA GGTGCGCGCG GCCTTTGTCA CGGGAGCGAC AGACAATCAA CAGGCGGCCC AGGAAATCGC TGAACGCTAC TCGACAATCG AGGCGGCGGA CCGCGCCTTT TCCGAAGCGC TGGAGGCGTA TCGAGGCGAG CTGCAAAGCT CGAACTTAAC CGCCGATGAC GTGTCGCTGT TCAACCAGCT TGCCGGGAGC ATCCTGTTTG CAAATCCGGC CATGCGATTG GCCGGGGCAC TGAAGAAAAA TCCACTGGAT AGGGGAGTGC TCTGGTCTCA TGGAATTTCG GGTGATCTTC CGATCATGCT GGTTTGCGTA AATGGCGACG GGGATGCCGC TCTCATTCGC GAAGTGAGGA TGGCGCACGA CTTCGCCCGC CGCCGAGGGC TGCTGTTCGA TCTTGCCCTA TATGACAGGC GTGGCGCGCG CAGCGCTGAG CGGCTTGCCG AAGTCCTGCG AGCCGGCCCG CAAGCCGGAA TGCCGGGCAA GCCAGGCGGA ATCTTTGTCC TTTCTGCAGC GACGGCCCCG AGCGATCACC TAAATGCAAT CACGGCCGCA GCGCGGATCG TCTTGCCGGA TGGCGGCAAG TCTCTCGCAA GCCTGCTCGA CCGGCAATCC CCCGCCGCAT CACTTCCCGC ACGTATTCGG GCCGCTCAAA CGGGCGGAGA GCCCCCGGAG CCGGGTTCGA CCACATCTAC TAAAGACTTG CTGTACTGGA ACGGATTCGG GGGGTTCACC CCCGGGGGAC GCGAATATGT CGTCATGGTG GACGGCATCA AGTCGCGAGC CCCGGCCTTG CCGCCGGCAC CTTGGAGCAA TGTCTTGGCA AACCCGGACT TCGGCTGCCT GACGACTGAG GCCGGGCTCG GCTACACCTG GTGGGGCAAC AGCCAAATGA ACCGGTTGAC GCCGTGGTCG AACGACCCCG TTTCGGATAC GCCAGGCGAG GTCATCTACT TGAGGGACGA GGAGAGCGGC GATGTCTGGA CTCCGACGCC TCTGCCACTC GGACATGAGA CGTCCGTGAC TGTGCATCAC GGCCAGGGCT ACAGTCGCTA TGTAAGCCGC AGCCGACGCT TGAGCCACGA ATTGACGGTG CATGTTCCGC CGGAGGATCC GGTTAAAGTC ATGCATCTGA GGCTGAGCAA TGACGACGCC CGAACGCGCC ATCTGACCGC AACGTACTTC GCTGAATGGG TTCTCGGAAC CCAGCGTGAC GACACGGCAA TGCAAATCGT CTGCGAACGT GATGCGAAAT CAAATGCGAT CATTGCACGG AACCCATGGG CCGGCGATTT CGCGAAAAGG CTGGCGTTTG CGGCCGCCAG CCAGCCTCCG AGGTCGATGA CATCCGACCG CGCGGAATTT CTTGGGAAAC ACGGTTCAGT GTCCTCGCCA GCGGCCCTGG GGCGAACCGA TTTGGCAGAA AGCTTCGGTC CCTTGCAGGA CCCCTGTGCC GCGCTGATGG TGGAGATCTC CCTGGCACCC GGCGAGAGCA AGGAAGTTAC CCTTGTTCTG GGGCAGGCCC GCACCCGCGA GCAAGTCCGC AGACTCGTGC GCGATTATGC CGACCCGCAA CGTGCGATCG AAAGCTTCGC GGCCACATGC TGCCAATGGA ATGATATTCT GGATACGATT CAGGTCTCGA CACCGGACGT CGGCATGAAT CTGATGATGA ATCGCTGGCT GCCGTATCAG GTCCTGGCTT GTCGCGTCTG GGGGCGCTCT GGCTTTTATC AGTCGGGGGG CGCATTCGGT TTCAGGGACC AGCTTCAGGA TGTGATGGCA TTGGTCCACA GCGCGCCGGA CGAGACACGC GCACACATTC TTCGAGCAGC CGCGCGTCAA TTCGCGGAGG GAGACGTTCA ACACTGGTGG CACCCGCCGT CCGGCGTCGG CGTGCGCACC CGGATTACCG ACGACCTCTA TTTTCTGCCT TTCGTCGTTC ACCACTACGT TTCGACGACT GGCGATGTCG ACCTTCTTGA TGAGCAGGTG TCTTTCATAA CGTCACCGGT CCTCAAGGAA GGCCAGGAGG AGGACTTCGG CAAACCTGAC GTCGGCGAAC GGACCGACAC CCTCTACGAA CACTGTATCC GTGCATTGGA ATACGGCTTC CGGCTCGGCG AGCACGGTTT GCCGCTCATG GGAACGGGCG ATTGGAACGA CGGCATGAAC AAGGTCGGCG CCGAGGGACG AGGTGAAAGT GTTTGGAACG GCTGGTTCTT CCTGACGGTT CTGAAATCAT TTGCGACGAT CGCGTCATCG CGTGGAGACG AAAGCCGCGA AACCTGGTGC TGCGAACGTG CTGAAGGATT GCGCGCGGCC CTGGAAGCAC ACGCCTGGGA CGGCTCCTGG TATCGCCGCG CCTATTTTGA CGACGGCACG CCGCTCGGCT CCTCTTCGAA CGACGAGTGC CAGATCGACG CCCTTCCGCA GGCCTGGGCC GTCATCTCTG GCGAAGCCGA CGAGGAGCGG GCGTCGAAGG CGATGAGCGC GGCCTATCAA AGACTCGTGC GCCGGCAGGA CAAGCTGATC CAGTTGTTCG ATCCGCCATT CGACAAGGGT TCGCTGCAGC CGGGCTACAT CAAGGGCTAT GTTCCCGGCA TACGAGAAAA TGGCGGGCAA TATACCCATG CGGCTGCCTG GGTCGTGCTG GCCACTGCGC TGCAAGGCGA CGGCGAACGG GCGCTGGAGC TTTGGAACCT GCTCAATCCG ATCAATCACG CGGCGACGAA GCAGGAGGCT CAGCATTACA GGGTAGAGCC ATATGTCGTC AGCGCCGATG TCTACGGCGC ACCGCCAAAT ACCGGCCGCG GTGGCTGGAC ATGGTATACG GGAGCAGCAG GCTGGTTGTA TCGCGTCGCG CTGGAGGCGA TGCTCGGTTT TCGGCGGCAG GCCCAGTTTC TTTCGATCGA ACCATGTGTA CCAGCCGACT GGCCAGAGTT CGAAATCAAA TACAGGCACG GATCATCGAC ATACCGCATC CACGTGGACA ACCCGGCCGG CGTCTGTCGC GGGGTACGTT CGATTCTCCT CGATGACACA CCGCTGGCGG ACACAAAGGT ACCGTTAACA GATGATGGTC GTTTCCACGA CGTCCGAGTG ATCCTTGGCT AG
|
Protein sequence | MSKSNQGRRP PEVTRYDSTP ELADTGPAIA LLSNGRYSVM VTDAGAGCAI WRSLDVTRWR EDVTRDCWGQ FCYVREQAEK TVWSIGKQPV CRPANSYAHS FHGDRAEFSC RSEDIEISWK VCVAQDADAE VRELTVVNHG GRERTLDLTS YSEVCLNDRR ADRAHPAFAK LFVETHHDDA TGALFARRRP RDAAEPPVWA VHVSSSDQQI RAEVAHETDR LKFLGRGRTT ADPAMLDPGA MLSKSTGPVL DAIFCLQRIV HLKPKGKVRA AFVTGATDNQ QAAQEIAERY STIEAADRAF SEALEAYRGE LQSSNLTADD VSLFNQLAGS ILFANPAMRL AGALKKNPLD RGVLWSHGIS GDLPIMLVCV NGDGDAALIR EVRMAHDFAR RRGLLFDLAL YDRRGARSAE RLAEVLRAGP QAGMPGKPGG IFVLSAATAP SDHLNAITAA ARIVLPDGGK SLASLLDRQS PAASLPARIR AAQTGGEPPE PGSTTSTKDL LYWNGFGGFT PGGREYVVMV DGIKSRAPAL PPAPWSNVLA NPDFGCLTTE AGLGYTWWGN SQMNRLTPWS NDPVSDTPGE VIYLRDEESG DVWTPTPLPL GHETSVTVHH GQGYSRYVSR SRRLSHELTV HVPPEDPVKV MHLRLSNDDA RTRHLTATYF AEWVLGTQRD DTAMQIVCER DAKSNAIIAR NPWAGDFAKR LAFAAASQPP RSMTSDRAEF LGKHGSVSSP AALGRTDLAE SFGPLQDPCA ALMVEISLAP GESKEVTLVL GQARTREQVR RLVRDYADPQ RAIESFAATC CQWNDILDTI QVSTPDVGMN LMMNRWLPYQ VLACRVWGRS GFYQSGGAFG FRDQLQDVMA LVHSAPDETR AHILRAAARQ FAEGDVQHWW HPPSGVGVRT RITDDLYFLP FVVHHYVSTT GDVDLLDEQV SFITSPVLKE GQEEDFGKPD VGERTDTLYE HCIRALEYGF RLGEHGLPLM GTGDWNDGMN KVGAEGRGES VWNGWFFLTV LKSFATIASS RGDESRETWC CERAEGLRAA LEAHAWDGSW YRRAYFDDGT PLGSSSNDEC QIDALPQAWA VISGEADEER ASKAMSAAYQ RLVRRQDKLI QLFDPPFDKG SLQPGYIKGY VPGIRENGGQ YTHAAAWVVL ATALQGDGER ALELWNLLNP INHAATKQEA QHYRVEPYVV SADVYGAPPN TGRGGWTWYT GAAGWLYRVA LEAMLGFRRQ AQFLSIEPCV PADWPEFEIK YRHGSSTYRI HVDNPAGVCR GVRSILLDDT PLADTKVPLT DDGRFHDVRV ILG
|
| |