Gene Smed_2993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2993 
Symbol 
ID5323870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3139813 
End bp3143724 
Gene Length3912 bp 
Protein Length1303 aa 
Translation table11 
GC content61% 
IMG OID640791944 
Productglycosyltransferase 36 
Protein accessionYP_001328657 
Protein GI150398190 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAT CAAATCAAGG CCGCAGACCG CCCGAGGTGA CGAGATACGA CAGCACGCCT 
GAGCTGGCGG ATACAGGCCC TGCCATCGCC CTGCTTTCCA ACGGCCGTTA CAGCGTCATG
GTTACTGATG CGGGCGCGGG CTGTGCCATC TGGCGGAGCC TTGACGTCAC GCGGTGGCGC
GAAGATGTCA CGCGCGATTG CTGGGGGCAG TTTTGCTACG TCCGAGAACA GGCGGAAAAA
ACGGTGTGGT CTATCGGCAA GCAACCCGTT TGCAGGCCCG CAAACAGTTA CGCGCACTCT
TTTCACGGAG ACCGGGCCGA GTTCTCCTGT CGTTCCGAAG ACATTGAAAT CTCCTGGAAA
GTCTGCGTCG CGCAGGATGC CGACGCGGAA GTGCGTGAGC TTACCGTGGT CAACCACGGC
GGACGAGAGC GGACTCTCGA TCTTACAAGC TATTCCGAGG TCTGCCTGAA CGACCGCCGC
GCGGACAGGG CGCATCCGGC CTTTGCCAAG CTCTTCGTGG AGACGCATCA CGATGACGCG
ACGGGCGCTT TGTTTGCGCG ACGCCGGCCG CGCGATGCGG CAGAGCCACC CGTCTGGGCC
GTCCATGTTT CATCATCAGA TCAACAGATC CGGGCAGAAG TCGCCCATGA GACCGATCGG
CTCAAATTCC TCGGACGGGG CCGCACGACG GCCGACCCCG CGATGCTGGA CCCAGGTGCC
ATGCTTTCCA AGTCGACCGG GCCGGTGCTC GATGCGATCT TCTGCCTGCA AAGGATTGTC
CATCTGAAAC CCAAAGGCAA GGTGCGCGCG GCCTTTGTCA CGGGAGCGAC AGACAATCAA
CAGGCGGCCC AGGAAATCGC TGAACGCTAC TCGACAATCG AGGCGGCGGA CCGCGCCTTT
TCCGAAGCGC TGGAGGCGTA TCGAGGCGAG CTGCAAAGCT CGAACTTAAC CGCCGATGAC
GTGTCGCTGT TCAACCAGCT TGCCGGGAGC ATCCTGTTTG CAAATCCGGC CATGCGATTG
GCCGGGGCAC TGAAGAAAAA TCCACTGGAT AGGGGAGTGC TCTGGTCTCA TGGAATTTCG
GGTGATCTTC CGATCATGCT GGTTTGCGTA AATGGCGACG GGGATGCCGC TCTCATTCGC
GAAGTGAGGA TGGCGCACGA CTTCGCCCGC CGCCGAGGGC TGCTGTTCGA TCTTGCCCTA
TATGACAGGC GTGGCGCGCG CAGCGCTGAG CGGCTTGCCG AAGTCCTGCG AGCCGGCCCG
CAAGCCGGAA TGCCGGGCAA GCCAGGCGGA ATCTTTGTCC TTTCTGCAGC GACGGCCCCG
AGCGATCACC TAAATGCAAT CACGGCCGCA GCGCGGATCG TCTTGCCGGA TGGCGGCAAG
TCTCTCGCAA GCCTGCTCGA CCGGCAATCC CCCGCCGCAT CACTTCCCGC ACGTATTCGG
GCCGCTCAAA CGGGCGGAGA GCCCCCGGAG CCGGGTTCGA CCACATCTAC TAAAGACTTG
CTGTACTGGA ACGGATTCGG GGGGTTCACC CCCGGGGGAC GCGAATATGT CGTCATGGTG
GACGGCATCA AGTCGCGAGC CCCGGCCTTG CCGCCGGCAC CTTGGAGCAA TGTCTTGGCA
AACCCGGACT TCGGCTGCCT GACGACTGAG GCCGGGCTCG GCTACACCTG GTGGGGCAAC
AGCCAAATGA ACCGGTTGAC GCCGTGGTCG AACGACCCCG TTTCGGATAC GCCAGGCGAG
GTCATCTACT TGAGGGACGA GGAGAGCGGC GATGTCTGGA CTCCGACGCC TCTGCCACTC
GGACATGAGA CGTCCGTGAC TGTGCATCAC GGCCAGGGCT ACAGTCGCTA TGTAAGCCGC
AGCCGACGCT TGAGCCACGA ATTGACGGTG CATGTTCCGC CGGAGGATCC GGTTAAAGTC
ATGCATCTGA GGCTGAGCAA TGACGACGCC CGAACGCGCC ATCTGACCGC AACGTACTTC
GCTGAATGGG TTCTCGGAAC CCAGCGTGAC GACACGGCAA TGCAAATCGT CTGCGAACGT
GATGCGAAAT CAAATGCGAT CATTGCACGG AACCCATGGG CCGGCGATTT CGCGAAAAGG
CTGGCGTTTG CGGCCGCCAG CCAGCCTCCG AGGTCGATGA CATCCGACCG CGCGGAATTT
CTTGGGAAAC ACGGTTCAGT GTCCTCGCCA GCGGCCCTGG GGCGAACCGA TTTGGCAGAA
AGCTTCGGTC CCTTGCAGGA CCCCTGTGCC GCGCTGATGG TGGAGATCTC CCTGGCACCC
GGCGAGAGCA AGGAAGTTAC CCTTGTTCTG GGGCAGGCCC GCACCCGCGA GCAAGTCCGC
AGACTCGTGC GCGATTATGC CGACCCGCAA CGTGCGATCG AAAGCTTCGC GGCCACATGC
TGCCAATGGA ATGATATTCT GGATACGATT CAGGTCTCGA CACCGGACGT CGGCATGAAT
CTGATGATGA ATCGCTGGCT GCCGTATCAG GTCCTGGCTT GTCGCGTCTG GGGGCGCTCT
GGCTTTTATC AGTCGGGGGG CGCATTCGGT TTCAGGGACC AGCTTCAGGA TGTGATGGCA
TTGGTCCACA GCGCGCCGGA CGAGACACGC GCACACATTC TTCGAGCAGC CGCGCGTCAA
TTCGCGGAGG GAGACGTTCA ACACTGGTGG CACCCGCCGT CCGGCGTCGG CGTGCGCACC
CGGATTACCG ACGACCTCTA TTTTCTGCCT TTCGTCGTTC ACCACTACGT TTCGACGACT
GGCGATGTCG ACCTTCTTGA TGAGCAGGTG TCTTTCATAA CGTCACCGGT CCTCAAGGAA
GGCCAGGAGG AGGACTTCGG CAAACCTGAC GTCGGCGAAC GGACCGACAC CCTCTACGAA
CACTGTATCC GTGCATTGGA ATACGGCTTC CGGCTCGGCG AGCACGGTTT GCCGCTCATG
GGAACGGGCG ATTGGAACGA CGGCATGAAC AAGGTCGGCG CCGAGGGACG AGGTGAAAGT
GTTTGGAACG GCTGGTTCTT CCTGACGGTT CTGAAATCAT TTGCGACGAT CGCGTCATCG
CGTGGAGACG AAAGCCGCGA AACCTGGTGC TGCGAACGTG CTGAAGGATT GCGCGCGGCC
CTGGAAGCAC ACGCCTGGGA CGGCTCCTGG TATCGCCGCG CCTATTTTGA CGACGGCACG
CCGCTCGGCT CCTCTTCGAA CGACGAGTGC CAGATCGACG CCCTTCCGCA GGCCTGGGCC
GTCATCTCTG GCGAAGCCGA CGAGGAGCGG GCGTCGAAGG CGATGAGCGC GGCCTATCAA
AGACTCGTGC GCCGGCAGGA CAAGCTGATC CAGTTGTTCG ATCCGCCATT CGACAAGGGT
TCGCTGCAGC CGGGCTACAT CAAGGGCTAT GTTCCCGGCA TACGAGAAAA TGGCGGGCAA
TATACCCATG CGGCTGCCTG GGTCGTGCTG GCCACTGCGC TGCAAGGCGA CGGCGAACGG
GCGCTGGAGC TTTGGAACCT GCTCAATCCG ATCAATCACG CGGCGACGAA GCAGGAGGCT
CAGCATTACA GGGTAGAGCC ATATGTCGTC AGCGCCGATG TCTACGGCGC ACCGCCAAAT
ACCGGCCGCG GTGGCTGGAC ATGGTATACG GGAGCAGCAG GCTGGTTGTA TCGCGTCGCG
CTGGAGGCGA TGCTCGGTTT TCGGCGGCAG GCCCAGTTTC TTTCGATCGA ACCATGTGTA
CCAGCCGACT GGCCAGAGTT CGAAATCAAA TACAGGCACG GATCATCGAC ATACCGCATC
CACGTGGACA ACCCGGCCGG CGTCTGTCGC GGGGTACGTT CGATTCTCCT CGATGACACA
CCGCTGGCGG ACACAAAGGT ACCGTTAACA GATGATGGTC GTTTCCACGA CGTCCGAGTG
ATCCTTGGCT AG
 
Protein sequence
MSKSNQGRRP PEVTRYDSTP ELADTGPAIA LLSNGRYSVM VTDAGAGCAI WRSLDVTRWR 
EDVTRDCWGQ FCYVREQAEK TVWSIGKQPV CRPANSYAHS FHGDRAEFSC RSEDIEISWK
VCVAQDADAE VRELTVVNHG GRERTLDLTS YSEVCLNDRR ADRAHPAFAK LFVETHHDDA
TGALFARRRP RDAAEPPVWA VHVSSSDQQI RAEVAHETDR LKFLGRGRTT ADPAMLDPGA
MLSKSTGPVL DAIFCLQRIV HLKPKGKVRA AFVTGATDNQ QAAQEIAERY STIEAADRAF
SEALEAYRGE LQSSNLTADD VSLFNQLAGS ILFANPAMRL AGALKKNPLD RGVLWSHGIS
GDLPIMLVCV NGDGDAALIR EVRMAHDFAR RRGLLFDLAL YDRRGARSAE RLAEVLRAGP
QAGMPGKPGG IFVLSAATAP SDHLNAITAA ARIVLPDGGK SLASLLDRQS PAASLPARIR
AAQTGGEPPE PGSTTSTKDL LYWNGFGGFT PGGREYVVMV DGIKSRAPAL PPAPWSNVLA
NPDFGCLTTE AGLGYTWWGN SQMNRLTPWS NDPVSDTPGE VIYLRDEESG DVWTPTPLPL
GHETSVTVHH GQGYSRYVSR SRRLSHELTV HVPPEDPVKV MHLRLSNDDA RTRHLTATYF
AEWVLGTQRD DTAMQIVCER DAKSNAIIAR NPWAGDFAKR LAFAAASQPP RSMTSDRAEF
LGKHGSVSSP AALGRTDLAE SFGPLQDPCA ALMVEISLAP GESKEVTLVL GQARTREQVR
RLVRDYADPQ RAIESFAATC CQWNDILDTI QVSTPDVGMN LMMNRWLPYQ VLACRVWGRS
GFYQSGGAFG FRDQLQDVMA LVHSAPDETR AHILRAAARQ FAEGDVQHWW HPPSGVGVRT
RITDDLYFLP FVVHHYVSTT GDVDLLDEQV SFITSPVLKE GQEEDFGKPD VGERTDTLYE
HCIRALEYGF RLGEHGLPLM GTGDWNDGMN KVGAEGRGES VWNGWFFLTV LKSFATIASS
RGDESRETWC CERAEGLRAA LEAHAWDGSW YRRAYFDDGT PLGSSSNDEC QIDALPQAWA
VISGEADEER ASKAMSAAYQ RLVRRQDKLI QLFDPPFDKG SLQPGYIKGY VPGIRENGGQ
YTHAAAWVVL ATALQGDGER ALELWNLLNP INHAATKQEA QHYRVEPYVV SADVYGAPPN
TGRGGWTWYT GAAGWLYRVA LEAMLGFRRQ AQFLSIEPCV PADWPEFEIK YRHGSSTYRI
HVDNPAGVCR GVRSILLDDT PLADTKVPLT DDGRFHDVRV ILG