Gene Smed_5002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5002 
Symbol 
ID5318651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1518056 
End bp1520416 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content57% 
IMG OID640776784 
ProductTPR repeat-containing protein 
Protein accessionYP_001313716 
Protein GI150377120 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4092] Predicted glycosyltransferase involved in capsule biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000902541 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGGTATG AGCAACCCGC CCTGGTCCGG AACTCGGCTG CCGAGTCCCC GGTCCTTACC 
GTACTGATAA CGATCAGGGT CGCAGCACAT TACGACATGA TCGAGCGGCT TCGCTTCAGG
TCGTTAGACA CTCGGATGCC CGACTCTGTG ACCTTCCTTG TAATTGACGA GGGCTCCAGC
TTTGAGGACG CAGAACGATT GAAGGCCGCA TGTGCCGAGA TCGGGTTCGA CTACGTTCGC
CACAACACCG AGAAGGACCT GTTCTGCCTC GCAGGCGCCC GAAACATCGG CGCTTCACTG
TCCCGTTCGC AATTCATCAT GTTCGAGGAT GTCGACCTTT TTCCTTATCC CGGTTTCTAT
CAGGACATTA TCGACGAGAT CCACGTTCAG GGGCTCGATA CGCACTCGAA CCGCTTCCTT
ACTGTCCCCT GCCTCTATCT GAGCGAACAA GCGACCGAGG CGGCCCTTGC TGGAACGCTA
TCGAAGAATG AAATTCTTCA CGACCACCTG ACTGGCGGGC CGCTTGTTGA TTTCGCCCTA
CCCGCCAGTT CGGCGCTCGT TATCAACCGC ATGTTTTATC TGTCGATCGG CGGCTCGAAC
GATCAGTTCA AGGGTTGGGG GCTGGAAGAT CTGGAACTCG CCTACCGGCT GACGCGTGGC
ACCAACAAGT TCATGTCTCC GAAGGACCAT CGTTGGCTCA TCGAAGGTGG ATACGCCCAC
AACCCAACTT ATCGAGGTTG GCGCGCACAG TTCCGCTTGC ATGGCGAGCT TCTCGCCCGC
AAATCGATCT ACATCTTTCA CGCCCACCAC CCGAGAGACA CGGCCTGGCG CAACCCGGAA
CTCCATCGGG CAAACAAAGC TATATTTCAG AGCAGCAATG AGTCCTTCGA CGCCAAGGGC
CACACATTGC CCTCTCTGCC GGATGCCAAG CGCGGCCGGT CGTTGATATT CGGGCGCGGC
ACCTTCGCCT ATAACCCCGC TCTCCTGCCT TTGTGGGGCG ATCTCGAGGT GAAGGGGTAT
CAAGACTTCC AGGAGCGGGA CATCATCGAG CATGTCCGCG CCAACGGCAT CAACCGGGTG
ATTTTCACAA ACCCCTACGC CAATGACATG CGGCTTTCCG TTTACAACCG AGTGCGATCG
GCGGGAATTC CCTTCTACGT CGTTGAGCGT GGCGCGCTTA CCGACTCCAT GTTCATTGAC
GATACCGGCT TTTGCTGTGA GAGCACGCGC TATCGCCGCG AGCACTGGCC TGCAGAACTT
GACGCCGGTC GTCTGCGGCG GGTGCAGGAT TACATTTCGG CCGAAACTTC CTCGGGCTCG
GCTCTTGAAA AACAGGGTGA GCGCATCGGC GCACGTGCCG CGCTTCAAAA ACTCGGCATC
CCGCCGCAAA AGAAGGTCCT GTTTGTTCCT TTCCAATCCC GCTCGGACAC GACCGTCAAT
CACTTCGCCG GCCCGATCGG ATCATTCGAT AACTTCGTTG ACCTCGTTCG GGACGTGACA
AAGAGGCTGC CACCCGATTG GGTGGTGGTC TTCAAAAAGC ATCCGCTTTC GTCAGTTCAA
GAGACCGTCC CCGGCGCCAT CGACGTCGGC AACATGCACA TCAAGGACCT GATGGAACTC
ACGGATTACG TTCTGCTTAT GAACTCCGGC GTAGGCGTCC TGTCTATCCT GTTCGACCGG
CCGTGCATCT ATACCGCCCA AGCCTTCTAT GCTGACGACG CTCTGAACCG TGGCGCATCC
AAGGCTGGCG ACGTCCTGCG CCTTCTTAAA GAGGGCTTCA CCGTCGACCA AGACAGCCGG
CTCCGGTTCT TGTCCTACTT ACTCGAAGAT TTCTACAGCT TCGGCAAATT CACGGTCACC
GAGAAGTCGT ACACGGACAA CGCTACTCTC TCGATAACAG AGAGGATTGA CTACTACCGC
GTAAACCTCC TTGGTGCGCG CGTCCTCGAC ACCGACGACC GCGAGCGGAT CCTCAACCCC
AATGCTCCCA TCTACGATCT TTTCCGTGAG TGGATACGGA GTAATAAGCA CACAGAGCGG
CAGGCTTCCC CATCTGGAGC CCCCCAAAGT GCGGATGAGG CCTCGCGCAA AGGACGGAGT
GCGTTTCACG CCAAGAGGTA TCAGGAGGCC GCCCTCATGT TTGACGCCGC CTGCTCGATA
CAGCCCAATA GAGCAACCCA TTACCGCGCA GCCGCCGAGG CGCTTTTCGC AATGGGGAAC
AGGAACATCG CCCTTTCACG TCTCGAAAAA GCGCGCCTCC TCGCCCCCGA CAACAAGTCA
ATCAAGCGCC GGATACGTGA GATGAAGAGG CCGCCCTTCC TTCAGTTCCT CTCGAAGCCC
TTCCCCATAG CAAAGGATTA G
 
Protein sequence
MRYEQPALVR NSAAESPVLT VLITIRVAAH YDMIERLRFR SLDTRMPDSV TFLVIDEGSS 
FEDAERLKAA CAEIGFDYVR HNTEKDLFCL AGARNIGASL SRSQFIMFED VDLFPYPGFY
QDIIDEIHVQ GLDTHSNRFL TVPCLYLSEQ ATEAALAGTL SKNEILHDHL TGGPLVDFAL
PASSALVINR MFYLSIGGSN DQFKGWGLED LELAYRLTRG TNKFMSPKDH RWLIEGGYAH
NPTYRGWRAQ FRLHGELLAR KSIYIFHAHH PRDTAWRNPE LHRANKAIFQ SSNESFDAKG
HTLPSLPDAK RGRSLIFGRG TFAYNPALLP LWGDLEVKGY QDFQERDIIE HVRANGINRV
IFTNPYANDM RLSVYNRVRS AGIPFYVVER GALTDSMFID DTGFCCESTR YRREHWPAEL
DAGRLRRVQD YISAETSSGS ALEKQGERIG ARAALQKLGI PPQKKVLFVP FQSRSDTTVN
HFAGPIGSFD NFVDLVRDVT KRLPPDWVVV FKKHPLSSVQ ETVPGAIDVG NMHIKDLMEL
TDYVLLMNSG VGVLSILFDR PCIYTAQAFY ADDALNRGAS KAGDVLRLLK EGFTVDQDSR
LRFLSYLLED FYSFGKFTVT EKSYTDNATL SITERIDYYR VNLLGARVLD TDDRERILNP
NAPIYDLFRE WIRSNKHTER QASPSGAPQS ADEASRKGRS AFHAKRYQEA ALMFDAACSI
QPNRATHYRA AAEALFAMGN RNIALSRLEK ARLLAPDNKS IKRRIREMKR PPFLQFLSKP
FPIAKD